

Using Bacterial DNA For Data Storage 212
NPV writes "January ACM Communications has an article on the use of DNA in genetically modified bacteria to store information. This is an attempt to achieve the ultimate in archival storage (one of the modified bacteria can tolerate 1000X more radiation than a human being). Now just suppose that the "junk DNA" in the human genome is the documentation package for the machine code. Who wrote that manual?" Here's the article abstract.
Of course its junk DNA... (Score:3, Interesting)
Re:Of course its junk DNA... (Score:2, Insightful)
We may not immediately be able to make natural organisms "better" in terms of natural fitness, but we can still make plenty of modifications which are beneficial to us. We can do it even without the use direct genetic engineering; we call that "domestication".
Re:Of course its junk DNA... (Score:2)
Re:Of course its junk DNA... (Score:5, Interesting)
Yet I don't see this hitting the market in the next ten years.
I remember about eight years ago an article about how the future of storage was going to be in a frozen solid containing bacteria that change shape when a certain intensity of light hits them -- two lasers, each with half the requisite amount of light, would shine in to cause the bacteria to change shape where they met. Terrabytes in a little cube. Never happened.
Re:Of course its junk DNA... (Score:2, Informative)
Its NOT junk DNA... (Score:2, Informative)
_
Gaia (Score:2)
Reminds of that STTNG Episode (Score:4, Interesting)
Planet P Blog [planetp.cc] - Liberty with Technology.
Re:Reminds of Gel Packs (Score:2)
Who wrote that manual? (Score:5, Funny)
I think the important question is... who has IP rights over it?
We wrote the manual! (Score:3, Funny)
I'll probably write this code in sometime in the future. Human cloning is stealing and I will sue your ass for infringement.
Re:We wrote the manual! (Score:2)
I'll probably write this code in sometime in the future. Human cloning is stealing and I will sue your ass for infringement.
Yeah, but then someone will hack your code, call it DeDNA, post it onto Kazaa, and then it's all down hill from there.
mutations? (Score:1, Interesting)
Who wrote that manual? (Score:5, Funny)
Re:Who wrote that manual? (Score:2, Interesting)
Ha! Raelians [rael.org] can't read biology and think p53 is a new gene (first published 1984) that makes evolution impossible, cos its a DNA repair enzyme, which makes mutation and hence evolution impossible (1).
Which is true in that DNA repair exists and p53 is involved in it (although its more involved in getting cells to commit suicide [rcn.com] if there feeling a bit precancerous), but it won't stop genes mutating as all it does is checks/corrects DNA base pairing [rcn.com] sometimes correcting it the wrong way creating a new mutation
(1) under Evidence -> Science & Future -> Alt theorys of Evolution.... (F***ing frames)
PS Being involved in human gene nomenclature [ucl.ac.uk] I feel duty bound to mention p53 approved symbol is TP53.
No spy! You can't have it (swallow) (Score:2, Insightful)
Doctor, my stomach hurts!
(1 year later)
Plague Plague Plague!
Great! (Score:5, Funny)
The aliens beat us to it (Score:2)
All we have to do now is search the junk DNA in our own genome until we find that the encoded message reads:
First Post!
Cheating possiblities (Score:2, Interesting)
This is interesting though. What if the entire human population became just a storage bank? What if EVERY LIVING THING on Earth became part of this bank? That would be an interesting scenario. For now, though, I'll just stick to normal HD's. A big problem, I suppose, is in changing the data. I wonder how many bacteria they had to go through to get it right.
Re:Cheating possiblities (Score:3, Funny)
Re:Cheating possiblities (Score:2, Funny)
Re:Cheating possiblities (Score:2, Funny)
Re:Cheating possiblities (Score:2)
The beowulf cluster allready exists....
They found out the answer is 42....
Then they made the cluster to find out what the questions really is.
hmmm (Score:5, Funny)
Uhh, perhaps not. (Score:4, Interesting)
I haven't read the article (don't have access to where I am) nor have I thought about this subject much, but one question I have is how the authors keep the sequences under selective pressure. DNA sequences are only conserved over many years if evolution needs them. Non-coding regions (So called "junk-DNA", poor choice of words, btw) would easily mutate into other sequences. One could imagine sequencing many cells, and infer the original sequence, but this gets more expensive as time goes on (as the number of sequences you need to sequence goes up).
-Sean
Re:Uhh, perhaps not. (Score:2, Interesting)
There is a kind of bactera (Score:5, Interesting)
You could also put error checking (parity, checksums, etc) so once you found some bactera you could check to make sure they had the right version and not a mutation
Can tolerate lots of radiation, but... (Score:2)
Re:Uhh, perhaps not. (Score:2)
-Sean
Don't wash your hands... (Score:2)
Re:Don't wash your hands... (Score:2)
Let me save
1. [Please insert one of
smelly unwashed ego-maniacal GNU/lixux anti-Stallman
or
smelly un-washed Moutain Dew-swilling linux geek
or
smelly never-showering always-surrendering Frenchman
'will be the only ones able to use this bacterial memory'
joke here.]
2. ???
3. Profit.
Yipes! (Score:1)
No, you mean a virus (Score:2)
hrm...
Bad news (Score:1, Funny)
So, ah... (Score:1)
Or the holographic/crystal whatever storage that's being developed?
This seems, ah, messy.
In other news... (Score:5, Funny)
Re:In other news... (Score:2)
Doctors Visit (Score:1)
Patient: I think I'm dying.
Doctor: RTFM!
dna in violation of dmca (Score:1)
Re:dna in violation of dmca (Score:2)
It is only a matter of time before this becomes a violation of the DMCA.
Now that you mention it, all the President has to do to get his way with contraceptives is get a law passed that says that every person immediately gets copyright over their DNA (grammar?). Then contraceptive devices themselves and even talking about them would be a violation of the DMCA.
So that's why (Score:2, Funny)
Bacteria Have No Introns and Other Considerations (Score:5, Informative)
Also note that the introns in eukaryotes are highly mutable (look up 'tandem repeats' if you have the inclination), so the fidelity of the data would be sacrificed by putting it there. The longest lifetime for the data would be achieved by tricking the replication machinery into thinking the segment was an exon, which would involve tying it to a functional protein that would be absent were the sequence to be mutated.
Duplication of the data would also work, but it would only hammer down the probability of mutation, since the probability of a point mutation of a base at the same location in two widely separated sequences is roughly 10^-18 to 10^-17 per year for exons.
Re:Bacteria Have No Introns and Other Consideratio (Score:5, Informative)
junk---junk---junk---exon-intron-exon-intron-ex
The junk DNA often referred to is mainly intergenic DNA, and this is where most of the non-coding DNA is found. This also makes up the majority of the eukaryotic genome. Prokaryotes (bacteria) do contain intergenic DNA, but no introns.
Re:Bacteria Have No Introns and Other Consideratio (Score:2)
Note I never said that introns were junk-DNA, and I don't even like that term. Perhaps I should have said 'non-functional' vs. 'functional' DNA.
Re:Bacteria Have No Introns and Other Consideratio (Score:5, Insightful)
My first impluse was that this is way off. I'm used to working with plasmids where frequently like 60% of the sequence is junk. They use E. Coli and D. radiodurans in the study mentioned in the article. A brief survey of E. Coli K12 (the parent of most common lab strains) sez that about 5-10% of it is non-coding. The old initial reference claims about 11% is non-coding, but a good chunk of that may be regulatory. The radiodurans genome is about 9% non-coding. The up shot is that there is actually a fair amount of 'junk-DNA' in (at least the Coli) bacterial genomes. Not a lot by human standards but enough to be able to squeeze in a chunk here or there if you're careful.
Another impulse was 'gad... that made it into Nature!?' (the journal, the article cited is a self congratulatory summary of their Nature paper). A lot of it follows a well duh kind of reasoning. 'Well duh' science is often the really good kind, but I wasn't particularily amazed by this. The DNA manipulation methods are beyond standard now, the only really clever thing was proposing the use of radiodurans as the host. Even that was sort of obvious (a blazingly well studied organism that is transformable). The DNA -> text using a 6 bit space? Well if you've ever designed linker regions in proteins I'm sure you were at least thought about spelling out you name or something in amino acids (unless your name is BOB). In part this is because every one learns the amino acids by doing stupid things like spelling out their name. Few people actually do this, mind you, as it usually would have some deleterious effect, but the point is I'm sure they weren't the first ones to try something like this, probably just the first to get funded to do this explicitly. Their big addition was to come up with a 3-letter code that includes all the letters and, ooo, punctuation. Then they spelled out bits of 'It's a small world.' My point is that it's not that far fetched and a bit surprising (to me) that it made it to Nature.
As to the utility of these things for information carriers... Mutation would be a problem in the long term. Sure radiodurans would survive nuclear war (these guys put cockroaches to shame) but they do it using lots of mismatch repair and recombinatorial repair methods. These are not perfect repair systems, they can and frequently do introduce many errors, especially in non-essential DNA space. Tying it to a functional protein isn't a bad idea, but unless the added sequence adds some survival advantage it won't enhance the lifetime of the measage (ie. if uncorrputed data gives an advantage then it is statistically less likely to propagate). Also, as you mentioned, the bacterium might notice long chunks (they're using 100 characters here) of useless DNA and excise it. For that kind of text, it might be better to just etch it into stone or something, at least you have some hope of seeing it intact in 2000 years.
True .... but what message to send?? (Score:2, Insightful)
Your right about Nature, to me its more New Scientist article (I recall seeing a paper in Biotechniques about encoding text in DNA some 5-6 years ago I think that was for copyright messages)
Mutation may not be too much of a problem as you could reconstruct the data by sequenceing many different strains of the bug (sort of bacterial TCP protocol if the packet is corrupted sequence a different strain)
What I'd like to know is what sort of data would you send? Encoding the data would be a bit of a fiddle.... but extracting the data would be a expensive, soul destroying project, reqireing late 20th early 21st centuary tech and if target decendants have that sort of tech there must be better ways of sending messages./P
Re:Bacteria Have No Introns and Other Consideratio (Score:2)
Photolithography on aluminum plates for long-term data storage. It's been done.
Re:Bacteria Have No Introns and Other Consideratio (Score:2)
This is fascinating. Still, I wouldn't say that regulatory DNA is 'junk'. And the other small fraction whose purpose is not understood may well be functional, right? It would be an interesting experiment anyway.
The Nature connection (Score:2)
What are you referring to? In the article that appears in ACM Communications, it says:
The cited Nature article has a completely different set of authors (Taylor, Risca, Bancroft) from the ACM article (Pak Chung Wong, Kwong-Kwok Wong, Harlan Foote). Based admittedly on the authors' own claims, the ACM article seems to go significantly beyond the Nature article (the latter sounds like as much a test of the US Postal Service than anything else!)Sorry there are intron-like things in prokaryotes (Score:3, Interesting)
And if you consider RNA editing (where the wacking out or modification of nucleotides prior to translation), you gain a tremendous amount of flexibility in the smaller genomes of these bugs.
Of course, the long term storage they're looking at is best done by the spores of gram positive bugs, like Bacillus subtilis. When they're in this non-replicative stage, there is little chance of sequence alteration. And by having, some 10^8 spores around, even if there were a few mucking things up, the majority would maintin the original sequence.
But engineering a bug to not alter sequences is much more difficult than knocking out RecA.
Re:Sorry there are intron-like things in prokaryot (Score:2)
Are you sure about RNA editing in bacteria? The rate of ribosomal attachment to a free RNA strand is very high, and it is unlikely that you can preserve the free mRNA long enough (without a nucleus) to edit it. At least that's the dogma I was taught. If you know a way, then please tell
Re:Bacteria Have No Introns and Other Consideratio (Score:2)
I think there might be a slight error in reasoning here. The mutation rate in exon DNA is probably about the same as the rate in most other regions of DNA. The reason you don't observe that many mutations in these regions is that this DNA tends to be very critical to the proper functioning of the cell, and if it changes the cell is going to be at a disadvantage, and is likely to die out. I don't think there is much evidence that cells control the mutation rate of coding sequences specifically. (There are known exceptions - such as genes used by animal (including human) immune systems which are intentionally scrambled during early development to ensure that animals can generate antibodies to just about anything, and that each individual has a unique set of cellular markers that identify cells as belonging to "self" (and hence not subject to immune attack). This is why organs from even close family members can be subject to rejection.)
Of course, bacteria don't have introns, so the better analogy there is coding and non-coding regions of DNA.
Here is the problem in a nutshell: The coding DNA in bacteria is HIGHLY optimized to do its job in the best possible way. If you want to store data in this region you would have to alter it somehow. There are two ways you can alter it:
1. Changes that do not affect the biological interpretation of the DNA (called silent mutations).
2. Changes that do affect the interpretation of the DNA.
If you do #1, then there is no selective pressure for the mutations to stay around - they can mutate back just as easily as they could in non-coding DNA.
If you do #2, then your new DNA is going to have a biological affect. It will either confer an advantage or a disadvantage or will be neutral. If it confers a disadvantage the cell won't be able to compete against the natural strain of the bacteria. I think you'd be hard-pressed to come up with a sequence that confers an advantage - bacteria are probably the most efficient machines on the planet and it is unrealistic that you're going to be able to come up with an algorithm that systematically improves them while being able to code information into their DNA. If the mutation is neutral you then have the issue of random mutations wiping out your data as in scenario #1 - the mutants wouldn't be at a disadvantage.
The only way you can create stable sequences of DNA is to ensure they confer a selective advantage to the cell. The existing processes of mutation and natural selection have pretty much guaranteed that this isn't going to be easy to do - if mutations that confer advantages were trivial to generate they'd already have been generated in the past.
Re:Bacteria Have No Introns and Other Consideratio (Score:2)
Re:Bacteria Have No Introns and Other Consideratio (Score:2)
I think you missed my point (or I didn't explain it well). If the addition of the data changes the evolutionary fitness of the organism, it is almost certain that it will be in a negative way. If that is the case, then your new organisms will tend to mutate back to the state they started in if possible (though that will be slow). If released into the wild, your engineered bacteria would be overrun by wild-type bacteria, which don't have the crippled genes.
As far as histones go - just try to introduce a non-silent mutation into an exon of one of those genes. They are likely to be VERY picky about changes, and you might be lucky to get anything to grow at all...
Re:Bacteria Have No Introns and Other Consideratio (Score:2)
Re:Bacteria Have No Introns and Other Consideratio (Score:2)
The "Junk DNA" (Score:2)
The Manual (Score:2, Funny)
And where the hell did they hide it? I've been trying to figure out the human race (more specifically, the female of the species) for years. Chicks are always telling me to RTFM, so hurry up and fork that thing over so I can get ahead (bad pun intended) in the world!
What? (Score:2, Interesting)
What kind of comparison is that? Are human beings presently used as archival storage in irradiated areas?
Seems that the punched metal tape the Army uses for ultimate reliability is the way to go. Even if the stuff rusts, is radioactive and glowing red, you can still read it.
After much testing... (Score:3, Funny)
What I really want to know is, can the same be done with the DNA of a bug? Because if it can, I'm going to buy some MSFT shares...
RMN
~~~
kind of ironic isn't it? (Score:2)
it's only good hollywood movie justice that we should play switch up and start storing our information in THEM.
No, you didn't steal someone's yogurt (Score:3, Funny)
KFG
The Matrix... (Score:4, Funny)
Morpheus coulda pointed to a SAN/NAS box!
Instead they make a duracell [duracell.com] commercial and mumble about the "human body generating more bio-electricity than a 120-volt battery and over 25,000 BTUs of body heat."
Ok I'll quit ze bitching... it was spiffy anyway.
Re:The Matrix... (Score:4, Funny)
Just remember to feel sorry for the guy that gets slashdotted
The Matrix wasn't all that good (Score:2)
I started laughing out loud when they did the power generation explanation.
And when they started doing the "phones mysteriously transport you in and out of the Matrix" bit, the image that came to mind was the people first adapting to phones and thinking people could do things like poison them or reach through the phone across the phone line.
I mean, as tech movies go, *Tron* was more plausible. Does none of this come off as *stupid* to anyone else?
Re:The Matrix... (Score:3, Interesting)
They coulda used some wonky vague ass stuff about the machines figuring out a method of harvesting the untapped power of the human consciousness and I would've been happier... the mind could generate the power itself somehow (emotional energy perhaps?) or maybe act as a conduit for drawing energy from extradimensional space. It would also give em a reason to stimulate and develop helathy human brains via their Matrix simulation instead of just keeping em doped all the time.
Re:The Matrix... (Score:2, Funny)
The matrix would have been easy as hell to code: a big field, lots of grass -- that's it. I don't think you'd have any bovine Keanu Reeves breaking out (although a cow that knows Kung-Fu would be pretty damn funny).
Re:The Matrix... (Score:3, Funny)
Right...the leaping from building to building, spider robots, and 'faster than 5 speeding bullets' were fine...it was the human battery plot that made it seem like fantasy...
Re:The Matrix... (Score:2)
Well yeah, actually it was fine...
But using human bodies to generate more energy than you put into them? Sorry, can't buy that. That's not a question of being too far-fetched, it's simply not mathematically possible.
Re:The Matrix... (Score:2)
Morpheus coulda pointed to a SAN/NAS box!
I think that was some of the subtext of The Matrix movie. Humans were used as storage and processing as well as batteries and capacitors.
Remember the "power from fusion" line? Where exactly would you dump the extra electrical output until you required it later? You'd store it in a stable chemical form (catalytic thermal salts) or highly combustible forms (gasoline or hydrogen).
Why else would the A.I. even BOTHER keeping the humans alive in the first place unless it was required to do so for its own survival. Note that The Matrix guardians were seeking the keys to the "Zion Mainframe" so they could "leave this world". That means it was trapped there because it could not leave of its own free will.
I assume the sequels will go into more detail on this. Of course I COULD be reading the subtext wrong and misinterpret the "Zion Mainframe" as a mobile computer storage & processing ship in place perhaps of a worldwide space defense system in orbit or on the moon (thus preventing The Matrix's A.I. from leaving the planet in a more literal sense as all attempts to leave end up with it being shot from the sky).
Re:The Matrix... (Score:2)
Bacteria dna code? (Score:2)
So, they decode some of the bacteria dna at some point thinking that maybe there's some important information left there, and they come up with:
Now just suppose that the "junk DNA" in the human genome is the documentation package for the machine code. Who wrote that manual?
The article posting was obviously just someone using it as a steppingstool to push their own preconceived notions of science upon us. I declare the article a troll.
Since these are bacteria, it'd probably be like: (Score:2)
#include <unistd.h>
int main(void)
{
int i = 0;
for (i = 1; i <= 20; i++) {
fork();
}
return 0;
}
/* Whatever made you think that bacteria wouldn't be ANSI compliant? */
Re:Since these are bacteria, it'd probably be like (Score:2)
#include <unistd.h>
int
main(void) {
fork();
main();
}
/* It should be written like this: simple, accurate and destructive. */
what's kinda neat about that junk dna... (Score:2)
On a more realistic note, that junk DNA is probably more like a revision history of life. Many scientists are of the opinion that a significant portion of the junk DNA is really the product of virus infection way back in the evolutionary tree. Many viruses can copy their DNA into the host's genome where it will be propagated throughout life, and potentially into offspring. If this infection happens in the wrong section of a host's genome, the DNA is never read and while the virus doesn't propagate, its DNA will. Do this over a time scale of millions to billions of years and you get a lot of leftover virus DNA hanging around silent.
So basically, some large percentage of your DNA is really just virus turds.
-Ted
Prankster-prone technology. (Score:4, Funny)
Is there a connection here? (Score:2)
Sounds like a primitive form of Slashdot ratings!!
Junk DNA isn't the manual for our genetic code... (Score:2)
*ducks*
Old technology... (Score:2)
Great, we get a memory fault and.. (Score:2)
.
Great (Score:2)
The right data "saved" creates some sort of deadly super-bacteria.
Ok, maybe not. But it still seems like a bad idea for reasons I can't quite think of right now. . .
Duplicate? (Score:2)
in George W Bu... ooops, I've said too much
Instead of bugs in the code... (Score:2)
The 'dog-ate-my-homework' of the 21st Century... (Score:2)
DNA and the secret of life (Score:2)
ADDGCTCTCTDONTPISSITAWAYAGGGDTTDONTPISSITAWAYGG
Shoulda, Coulda, Woulda.... (Score:2)
Talk about a great way to smuggle/hide data....in your own DNA..
Junk DNA isnt junk DNA (Score:2)
ref Nature Genetics, 1 June 2002, pp. 159-165
Documentation in junk DNA: (Score:3, Funny)
So I don't jave a cold I have a database? (Score:2)
Um... (Score:2)
I mean, five minutes before they finally translate the data coded into these junk DNA, the Vogons are going to destroy Earth to make way for a Hyperspace Bypass.
Well, at least Arthur will survive.
Great. (Score:3, Funny)
Open Source software downloaded by a simple handshake or sneeze!
Then, when Microsoft gets in on the new industry (2 years too late as usual) all life on earth will be wiped out by an unchecked buffer overflow in blank bacteria media as it is sequenced by default when accessed by any device.
Seriously though, I wonder what the maximum storage capacity of something like that would be? How much data could be packed into a bacteria sequence? Would there be a really high read/write time to sequence the DNS? What about seek time? "Godammit come back here you bug!"
A new excuse not to go to work (Score:2)
Good grief--this got past the ACM editors? (Score:2)
It's inappropriate to refer to organisms as "bugs and weeds" in a biological context, not because it might hurt someone's feelings, but because it is biologically meaningless.
The idea itself is old and has been bounced around by SciFi writers as well as scientists. Whole stories have been written about ancient civilizations or space aliens encoding messages in DNA.
We're just storage for supa-smart aliens (Score:2)
Or maybe our DNA is being used as storage for some supa-smart aliens -- probably ones who can spell super...or galaxy's [slashdot.org] ;)
Re:This brings up a concern (Score:1)
A little on-topic, messing with DNA in such a way seems a little... sketchy.
Re:I wonder what bacteria would look like (Score:2)
After a year, a new EULA pops out. If you want the Service Pack that fixes the compromised immune system DNA, you have to agree to the EULA, which installs the auotmatic apoptosis DNA, forcing what the EULA euphemitically calls a "planned obselescence of all cellular function" just in time for the rollout of MS-DNA 2010.
Re:this idea was proposed in NYT millenium issue (Score:2)
Unless I'm totally wrong, of course.
Re:quaternary vs. binary (Score:3, Informative)
Now, although there are two strands in most DNA molecules, only one actually codes for proteins- the two strands are sometimes referred to as sense and nonsense (or antisense) strands. Both are involved in replication, however- a DNA helicase splits the two strands, each acts as a template for a new complementary strand. And both can and usually do contain all four bases, with the concentration of each base in either strand being totally independent. Since the two strands in a double helix are complementary, the amount of adenine must equal the amount of thymine and the amount of cytosine must equal the amount of guanine in both strands . In fact, recognizing this relationship led to the realization that complementary base-pairing occurs. The original IAAB is correct though- the genetic code is indeed base 4- although nature has chosen to not use it to its full potential (i.e. code for 64 different amino acids) in favor of building in some redundancy.
Re:quaternary vs. binary (Score:2)
Let me club you on the head with some ASCII art: According to your argument, this sequence contains no information.
Re:Jesus fucking christ on a vibrating bed. (Score:3, Insightful)
This has to be the 434340930493rd article where the presenter considers himself clever because he sees an insight... that everyone else does, too. Give it up. The abstract is interesting, if lacking in news or useful information, but its presentation is nothing but annoying.
The easiest way to disprove the "junk DNA" is to remove the "junk DNA" and see if the organism still works. Take for example a computer program where "junk code" is removed. If the program still runs then the code might not be important. However, the "junk code" could be comment code not removed by the compiler, error checking code (which will not activate unless the program hits an overflow then all heck breaks loose), or even just graphic data which would allow a program to run (but with a corrupted image display).
The basic truth of "junk DNA" is that unless somebody has a "decompile into a higher level language" device then removed code could case all sorts of things to go GOOEY later on when certain conditions are met. Heck, if we look back at the early days of BBS protocols you'd remember the FOO junk padding code at the end of many ZIP files just to compensate for buggy data transmission protocols. That padding allowed a certain amount of send errors at the end of a file to be tolerable while keeping the important parts of the file intact.