In December, an Atlanta company called NuTec Sciences acquired the fastest supercomputer in commercial use, an IBM machine capable of 7.5 trillion calculations per second. Only a few computers on the planet possess such speed, and many of those were built to simulate nuclear explosions and test the viability of atomic arsenals. But NuTec is using its supercomputer for biology, not bombs.
The mapping of the human genome has triggered an explosion in data about the nature of life. As pharmaceutical and biotech research companies like NuTec struggle to understand the workings of tens of thousands of genes and the hundreds of thousands of proteins they produce, biology is overtaking nuclear weapons as the field demanding the most sophisticated computers. Every other day seems to bring a new discovery - like last week's announcement that a biotech company located a gene thought to be responsible for heart-attack-inducing cholesterol. NuTec will deploy its supercomputer to analyze cancer patients' individual genetic profiles to find the most effective treatments for their particular disease.
To capitalize on such advances, technology companies like IBM, Compaq and Sun Microsystems (SUNW) are pouring money into their life-sciences divisions and courting the biotech industry. Compaq and IBM recently launched dueling bio-supercomputer projects that will result in two of the world's most powerful computers. Like genomic research, bio-supercomputing is a leap into the unknown: It remains unclear exactly how these staggeringly expensive machines will be used and who will be willing to pay for them. But finding a market for bio-supercomputers is almost beside the point for the companies. Rather, the drive to push the boundaries of biological computing is behind the quest.
"Who knows what will come next," says Caroline Kovac, chief of IBM's life-sciences division. "That's the kind of high risk but incredibly high value" of the bio-supercomputer.
The biological and computing challenges posed by mapping the genome are daunting: What is the function of the more than 30,000 human genes decoded last summer? How many human proteins exist? (Scientists think there are at least a million.) Which proteins keep the heart beating; which ones repair damaged tissue; which ones help digest food? Just as important, how does a protein behave when a disease like Alzheimer's or colon cancer takes root? And which genes are generating those flawed proteins?
Some of these tasks, such as determining the function of different proteins, can be tackled with existing computer power - albeit lots of it. Others, such as understanding the role of proteins in promoting disease, can require a computer of a different magnitude. A bio-supercomputer is not radically different from a conventional supercomputer, but it does require certain strengths, including the ability to recognize complex chemical and biological patterns and carry out complicated data searches.
That is where Compaq's Red Storm and IBM's Blue Gene come in. Currently under development, these bio-supercomputers aim to perform far faster than any existing supercomputer. "What we're seeing," says Kovac, "is a trend that the fastest computers in the world are starting to be driven by people looking at life sciences as opposed to physics calculations."
IBM was the first to announce its bio-supercomputer plan. When completed in 2004, Blue Gene will be capable of performing 1,000-trillion calculations per second, or 1,000 teraflops in supercomputer lingo. That's a quantum leap over the company's current champion, a supercomputer housed at Lawrence Livermore National Laboratory in California that operates at the relatively leisurely pace of 12 trillion calculations per second. In fact, Blue Gene is faster than the top 500 computers in the world put together. The machine will also be deceptively compact. While the Livermore Lab supercomputer compares in size to two basketball courts filled with refrigerators, Blue Gene will be contained in two fridge-size units that could be stored in a walk-in pantry.
IBM, which is spending at least $100 million on Blue Gene, also has chosen to build it for a narrowly defined purpose: mathematically predicting the changing shape proteins take - a process known as protein folding - before they go about their business regulating the human body. Misshapen proteins are thought to be a major trigger of disease. Determining the shape of a protein is crucial to developing drugs that can latch on to a damaged protein and prevent it from causing illness.
Competitor Compaq is taking a different path. In January, the company announced plans to develop a 100-teraflop bio-supercomputer dubbed Red Storm in partnership with Celera Genomics, the Rockville, Md., company that mapped the human genome, and Sandia National Laboratories in Albuquerque, N.M. Although Blue Gene will be 10 times faster than Red Storm, a Celera executive stresses that the company's machine could eventually match IBM's speed.
Unlike Blue Gene, though, Red Storm is being designed for a broader array of life-science experiments and may be used to conduct nuclear research. The supercomputer, set to begin operating in 2004, will cost an estimated $125 million to $150 million to build.







Hosted by Tom Sullivan, stay abreast of the latest IDG content covering IT news, product reviews, best practices, and white papers.