Software Emulates Organism's Entire Lifespan 86
An anonymous reader
"Scientists have developed a software simulation, running on 128 computers, of an entire organism, a step toward carrying out full experiments without traditional instruments (abstract). 'For their computer simulation, the researchers had the advantage of extensive scientific literature on the bacterium. They were able to use data taken from more than 900 scientific papers to validate the accuracy of their software model. Still, they said that the model of the simplest biological system was pushing the limits of their computers. "Right now, running a simulation for a single cell to divide only one time takes around 10 hours and generates half a gigabyte of data," Dr. Covert wrote. "I find this fact completely fascinating, because I don’t know that anyone has ever asked how much data a living thing truly holds. We often think of the DNA as the storage medium, but clearly there is more to it than that." In designing their model, the scientists chose an approach that parallels the design of modern software systems, known as object-oriented programming. Software designers organize their programs in modules, which communicate with one another by passing data and instructions back and forth. Similarly, the simulated bacterium is a series of modules that mimic the different functions of the cell.'"
Re:OO vs real life (Score:5, Informative)
Simplified answer:
These models tend to be object-oriented in the sense that a genetics "module" interacts with a protein signaling module, etc. In each module, you'd have the member data (say, a list of all proteins) and member functions (say, a model of the reaction network that discretizes the massive system of ODEs).
The objects then interact. You have well-defined interfaces between these modules to codify currently known (or hypothesized!) biology. For example, members of the proteins module activate certain genes in the genetics module to (eventually) drive synthesis of more proteins.
You write the rules based upon our current state-of-the-art in understanding cell biology, simulate, and see what happens. To the extent that it quantitatively matches experiments, we can assess the underlying hypotheses, refine them, or toss them out.
In this work, it looks like they pulled information from 900 papers on this species of bacterium to simulate 525 genes, God knows how many proteins (genes can encode multiple proteins), and 28 processes.
Notably, there is no spatial component (e.g., transport of proteins, RNAs, cell volume changes, cell mechanics, etc.), but it's an incredible set of work. And to be able to predict phenotype solely based upon the emergent behavior of this network is pretty incredible.