New Method To Revolutionize DNA Sequencing 239
An anonymous reader writes "A new method of DNA sequencing published this week in Science identifies incorporation of single bases by fluorescence. This has been shown to increase read lengths from 20 bases (454 sequencing) to >4000 bases, with a 99.3% accuracy. Single molecule reading can reduce costs and increase the rate at which reads can be performed. 'So far, the team has built a chip housing 3000 ZMWs [waveguides], which the company hopes will hit the market in 2010. By 2013, it aims to squeeze a million ZMWs [waveguides] onto a single chip and observe DNA being assembled in each simultaneously. Company founder Stephen Turner estimates that such a chip would be able to sequence an entire human genome in under half an hour to 99.999 per cent accuracy for under $1000.'"
99.3% accurate? (Score:5, Insightful)
That's, what, 28 incorrect base pairs out of 4000? I'm not a biologist, but is this considered an acceptable error rate? Even the hopes of 99.999% accuracy seems really awful when there are about 3 billion base pairs in a human genome.
I realize that we aren't going to be trying to make a cloned copy from this data, but what uses is this "good enough" for?
Re:99.3% accurate? (Score:5, Insightful)
I realize that we aren't going to be trying to make a cloned copy from this data...
What makes you so sure? Who knows where this will lead?
Re: (Score:2)
Re: (Score:3, Funny)
Sorry, you're asking for the impossible - I've never seen a well-organized mammoth hunt.
Re: (Score:3, Funny)
Re: (Score:2)
Re: (Score:2)
Re: (Score:2)
Re:99.3% accurate? (Score:4, Funny)
Oh well. At least it's not the first time somebody missed a point on /.
Don't you mean "Oh well. At least it's not the first time somebody missed a point on /".
Re: (Score:2)
It's not too bad. I don't think the human version of the polymerase has a better error rate. However, while being in a biological entity, DNA replication also has other integrity checks.
Re: (Score:2, Insightful)
That's, what, 28 incorrect base pairs out of 4000? I'm not a biologist, but is this considered an acceptable error rate? Even the hopes of 99.999% accuracy seems really awful when there are about 3 billion base pairs in a human genome.
I realize that we aren't going to be trying to make a cloned copy from this data, but what uses is this "good enough" for?
More than good enough for forensic work at least, I'd wager.
Re: (Score:2)
Re: (Score:2)
What's this cloning fascination all over this page? If you want to clone, why would you need to first read out the entire genome? Nature is doing pretty well in copying DNA if you only find a nice cell to start with.
Further, in DNA-matching suspects they are doing pretty fine now with a handful of typical pieces of DNA. Don't need the whole genome for the first step.
Re:99.3% accurate? (Score:5, Interesting)
Re:99.3% accurate? (Score:5, Informative)
Re: (Score:2)
Talk about medicine for healing rather than profiteering eh?
Re:cost of sequencing is a reasonable determinant (Score:2, Insightful)
In medicine, the cost of a study, as well as its reliability, availability, and predictive val
Re: (Score:2)
Little I hope, the girl who wants to mate with me has to be really retarded and I don't want that to affect our kids :D
Re:99.3% accurate? (Score:4, Informative)
It's common practice in bioinformatics to measure the same data repetitively in an effort to reduce the error. While 0.993 isn't very good, (0.993)^3 is pretty awsome. In practice, the errors might be correlated (as in a flaw in the measuring system), so the benefit of re-measuring might not be exponential...however it should be darn close.
Re: (Score:2)
Re: (Score:3, Funny)
If you got B on the second run you'd be pretty sure it was incorrect.. ;-)
Re: (Score:2)
Re: (Score:2)
Re:99.3% accurate? (Score:5, Insightful)
There is a saying from the old sailing days. "Never set sail with two compasses". One is ok, three is better. But never two. The paralysis from not knowing which is right is far worse than being wrong and correcting later.
Re:99.3% accurate? (Score:4, Funny)
Unless of course all three of your compasses are giving you different readings. In that case, you simply yell "Where the hell is my sextant."
Re: (Score:2)
Posting to undo moderation. Modded parent Overrated by accident.
Re: (Score:2)
Unless of course all three of your compasses are giving you different readings. In that case, you simply yell "Where the hell is my sextant."
Astrolabe FTW!
Re:99.3% accurate? (Score:5, Funny)
If you were sequencing DNA and got a B then you'd seriously need to recheck the equipment (or the competence of the operator). Perhaps a T or a G, or even a C but never a B.
Re: (Score:3, Interesting)
This assumes that the method simply has a random chance of getting each data point wrong. What if it is something systematic with the method that causes it to read one gene wrong? In other words, it reads the gene as a 'T' every time despite it really being an 'A'. No matter how many tests you run, it will still result in a wrong answer.
Re: (Score:2, Informative)
If you RTFP (requires subscription), no systematic errors were detected
http://www.sciencemag.org/cgi/content/full/323/5910/133
nitpick (Score:3, Informative)
One base-pair does not a gene make.
Re: (Score:3, Insightful)
One base-pair does not a gene make.
But a one base-pair change can unmake the gene pretty well.
Tons of major debilitating mutations are due to a point mutation.
Re: (Score:2, Interesting)
Using a large sample, like the proposed Personal Genome Project [unsure if they have gotten in touch with any of those who expressed interest in participating] could be useful in showing any systematic mis-reads, as long as the Personal Genome Project is using another method to sequence the participant's DNA.
Re: (Score:2, Informative)
It's common practice in bioinformatics to measure the same data repetitively in an effort to reduce the error.
It's common practice on Slashdot to read the article before posting. From the abstract of the Science article:
Consensus sequences were generated from the single-molecule reads at 15-fold coverage, showing a median accuracy of 99.3%, with no systematic error beyond fluorophore-dependent error rates.
So that's 99.3% after averaging 15 reads. Not exactly replicating the same read 15 times..more like taking random starting points and aligning the results where they overlap, so that each base is covered in 15 different reads.
Don't get me wrong - this is really cool, and a massive speed-up over current "next-gen" sequencing. And I'm sure that it will get better.
To answer the GP - yes, this i
Re: (Score:2)
In what alternate universe are you reading slashdot?
prograde (1425683)
Oh. You'll figure it out, given time.
Re: (Score:2)
You must be new here :)
Re: (Score:2)
No, 0.993^3 is only 97.9%; how about 1-(1-0.993)^3 :)
Re: (Score:2)
It's common practice in bioinformatics to measure the same data repetitively in an effort to reduce the error. While 0.993 isn't very good, (0.993)^3 is pretty awsome. In practice, the errors might be correlated (as in a flaw in the measuring system), so the benefit of re-measuring might not be exponential...however it should be darn close.
I don't mean to trample your point, but three iterations wouldn't give a (0.993)^3, (which would equal 97.9%). The odds of an error would be 1-(0.007)^3, which would actually be 99.9999657%.
Re: (Score:2)
lol, you got moderated 5 informative but your math is flawed.
Do it three times to get 97.9% accuracy instead of 99.3? Great! :D
Guess it should be something like: :)
1 - ((1 - 0.993)^3) = 0.999999657 ? =P
Actually I have no idea, too tired to think about it but not 0.993^3 atleast
Re: (Score:2)
No, your formula results in an accuracy of .999, not .9999997. Sure, it's better than .993, but nowhere near as accurate as you wish.
.993 was accurate to .9930000?
Did you assume that
Furthermore, this entire formulaic analysis is useless without considering statistical error margins.
Re: (Score:2)
Given the expense of doing an entire genome, alternative is a 25% accuracy rate. What 99.3% does is let you do a bulk scan looking for interesting areas. Prospecting. Now you can adjust therapies the match likely genome sequences. "Ah Ms. X, I see you likely have gene XYZ. Medication A, B, and C won't work for you so let's try D."
In other words, you don't need perfect results to now bias the odds in you favor.
Re:99.3% accurate? (Score:5, Insightful)
If they can sequence the whole thing in less than 30 minutes one time with a 0.001% "read" error rate, my guess is that they can get it probabilistically near 100% correct in 2 hours or so.
By the way, what's the current error rate? Is it 0? (just asking)
Re:99.3% accurate? (Score:5, Interesting)
Or you could run a parallel processing setup, 3-5 sequencing chips all given the same sample at the same time. More expensive, but you'd get that effective 100% rate in the half hour time.
$5k for a genetic sequencer that could give effectively 100% accuracy in half an hour would be pittance for pretty much every hospital in the US.
Hell, the first malpractice lawsuit it prevents (detect a disorder that would make a commonly used treatment crippling or fatal to the patient) would pay for the machine 1000 times over.
Re: (Score:2)
Don't know - but it doesn't need to be 0 to be reliable. The genome is quite resistant to random mutation, having been subjected to it for all its existence.
Re:99.3% accurate? (Score:5, Funny)
1 Hour Genome Sequencing: 30,000 errors or less or YOUR MONEY BACK!
Re: (Score:2)
Do it several times over with different cells and "vote" on the inconsistencies between trials. If 5 out of 7 copies of the DNA look like the base at position X is tyrosine, then it's most likely that it's tyrosine.
Re: (Score:2)
I suppose that running it twice or trice will increase the accuracy a lot more.
Which makes it still blazing fast.
Re: (Score:2)
Re: (Score:2)
When I have sequences done the conventional way, I get less than 1000 base pair reads back. Generally 2 or 3 are ambiguous enough that the machine reads them incorrectly or not at all. 28 out of 4000 is the same as 7 out of 1000, so this is roughly the same magnitude of error. Less accurate than what we use now, but more economical to do really large sequences.
I don't know how the method works (site is slashdotted anyone got a DOI for the paper?) so it's hard to tell whether repeating the reads would
Re: (Score:2, Insightful)
Re: mistakes and inaccuracies...
You run two or three trials and do "a check sum" ...a la Raid inter leafing...errors stand out and are discarded..
Re: (Score:2)
That's, what, 28 incorrect base pairs out of 4000? I'm not a biologist, but is this considered an acceptable error rate? Even the hopes of 99.999% accuracy seems really awful when there are about 3 billion base pairs in a human genome.
That's a very good question, but consider that 100% is impossible. Even the cell's own machinery, under development for millions of years, makes mistakes at a frequency that would be lethal if that's all there was.
In this case, the error rate seems in the neighborhood of rival techologies. The way to deal with it is the same way the cell uses: redundancy. Sequence segments or the whole thing more than once, the likelyhood of bases in error is significantly decreased. If you run 3 sequencings, there's ev
Re: (Score:2)
... and to prove that last point I just realized that I was redundant with the non-coding DNA and introns. I think. No wait, I meant to do that, this way if you misread "introns" it will still be covered by the "non-coding DNA" bit. And that's the last biochemistry joke out of me today.
Re: (Score:2)
There's an easy and obvious way around this - just run 3 simultaneous instances and error-check by consensus. Still able to run the whole thing in under a half hour and still pretty cheap at ~$3000.
Re: (Score:2)
The idea is to sequence each portion of the genome many times over. With enough redundancy you can detect these errors, so your final annotated copy would have a much higher accuracy.
Re: (Score:2)
It probably also will find a niche, in spite of its error rate; because there is just so damn muc
Re: (Score:2)
Re: (Score:2)
So, instead of $1000, you must spend $3000 and get the same result in one hour that we needed 10 years and several millions to get at the 90's... I see, that is useless.
Re: (Score:2)
Re: (Score:2, Insightful)
I realize that we aren't going to be trying to make a cloned copy from this data, but what uses is this "good enough" for?
It's most likely good enough to deny you health coverage. Pre-existing condition? Now risk can be assessed on pre-existing genes.
Re: (Score:2)
The final estimate is 99.999%, presumably including some redundancy or error checking.
This is still one error in 100,000 base pairs, which is tens of thousands of errors in the 3 billion pairs of the human genome.
Of course, we're not talking about computer code here - this code has withstood (or rather adapted to) millions of years of inaccurate copying. Most of the errors would probably hit the vast majority of inactive areas, or be compensated by the natural redundancy mechanisms.
Re: (Score:2)
Run the data among multiple chips for verification. Run amongst enough chips, the errors should be detectable/correctable.
Re: (Score:2)
That's, what, 28 incorrect base pairs out of 4000? I'm not a biologist, but is this considered an acceptable error rate?... but what uses is this "good enough" for?
three-eyed fish?
Re: (Score:2)
Re: (Score:2)
3,000,000,000 * 0.99999 = 2,999,970,000
So your error count is 30,000.
Sub-$1000 genome sequencing (Score:4, Funny)
Sub-$1000 genome sequencing will put the creation of 'designer' kids into the realm of the affordable for much of the middle class. Scary stuff. Now we just need to combine that with cheap and reliable cloning techniques and my plans for world domination will be comlete!
Re:Sub-$1000 genome sequencing (Score:4, Funny)
Hopefully you'll fix that nasty intercalary deletion [wikipedia.org] bug first!
Re: (Score:2, Funny)
I will, but I gotta P first!
Re: (Score:3, Funny)
Hopefully you'll fix that nasty intercalary deletion bug first!
As long as it's not a missing G, T, C, or A, he'll be OK.
Re: (Score:2)
Sorry, it comes to my attention that you are missing a 'p' in the word 'complete', without a 'p', the world is never completed.
P, as in...I think you will figure that out by checking the spam.
Reading a sequence is not the same as creating one (Score:2, Insightful)
Real applications of this, however, include looking for gene sequences in adults which predispose them to diseases (e.g. breast cancer) and then providing counseling and monitoring commensurate with that risk, a far less expensive effort than monitoring everyone for the same disease, even if they aren't at risk. Also, one could use
Real-Time DNA Sequencing from Single Polymerase Mo (Score:2, Informative)
Abstract:
Kicks ass on Moore's Law... (Score:5, Insightful)
I think this qualifies as a true 'technological singularity' [wired.com]
piggy backs on Moore's Law (Score:2)
Re: (Score:2)
error correction (Score:3, Insightful)
Something like the error correction on an audio compact disk ?
Re:error correction (Score:5, Insightful)
Yes. It's called "natural selection". :P
Article in Science (Score:2, Informative)
I assume that the hardware at Science can withstand a slashdotting better than the crappy blog linked in the summary:
http://www.sciencemag.org/cgi/content/abstract/323/5910/133 [sciencemag.org]
GATTACA here we come! (Score:2)
1/2 hour for $1000, eh? And in another 5-10 years we'll cut that in half or more, both time and cost. It looks like the instant gene sequencing tech from GATTACA will be with us in most of our lifetimes. But even with this announced breakthough it'll be functionally the same.
maybe 60 to 1000 are significant? (Score:2)
Although humans differ from one another in about 0.1% base pairs for a total of 3 million, the number of difference that describe human variability may be vastly smaller than this. First you discard non-coding DNA which gets you done to 30,000.
Re: (Score:3, Insightful)
Although humans differ from one another in about 0.1% base pairs for a total of 3 million, the number of difference that describe human variability may be vastly smaller than this. First you discard non-coding DNA which gets you done to 30,000.
Except that when our differences are so small, the non-coding regions are even more important. They control what genes are active and to what degree. That's nearly as important as the genes themselves.
Genes are only part of the puzzle. You need to know what to do with them, and non-coding regions provide some of that along with the cellular machinery.
Scientists used to call them "junk" DNA where junk == "I can't figure it out". Why would cells spend all that energy maintaining something useless? Not very li
I guess I can drop my X prize plans (Score:3, Informative)
ForeSight.Org Down (Score:2)
I guess they didn't have the foresight [foresight.org] to use a real host.
I'll be here all night folks try the steak.
not for cloning, but therapy (Score:2)
Slashdotted: Abstract and Fulltext (Score:3, Informative)
re the error rate thing and over view (Score:2)
This is just an historical accident; 99.9% was what could be done with what people judged "reasonable" effort and cost a few years ago; unless you know whatyou are going to use the sequence for, you don't know what error rate is acceptable
There are medical test that rely on dna sequence, eg myriad makes a fortune from sequencing the gene that gives women hereditary breast cancer. I don't know what there claimed error rate would be, but that would show you what is acceptable in todays clinical market place.
A
Grammar ambiguity (Score:3, Interesting)
Company founder Stephen Turner estimates that such a chip would be able to sequence an entire human genome in under half an hour to 99.999 per cent accuracy for under $1000
Does that mean that the chip costs $1000 or that each human genome processed costs $1000?
Re: (Score:2)
Both. Each chip processes one human genome.
Actual link to scientific article (Score:2)
http://www.sciencemag.org/cgi/content/abstract/323/5910/133 [sciencemag.org]
Abstract:
We present single-molecule, real-time sequencing data obtained from a DNA polymerase performing uninterrupted template-directed synthesis using four distinguishable fluorescently labeled deoxyribonucleoside triphosphates (dNTPs). We detected the temporal order of their enzymatic incorporation into a growing DNA strand with zero-mode waveguide nanostructure arrays, which provide optical observation volume confinement and enable parallel, sim
New Scientist Write Up (Score:3, Informative)
Error Correction (Score:2, Insightful)
Furthermore, if you're using a technique like this to map a person's genome, you can be clever about
454 Sequencer isn't ~that~ bad. (Score:2)
Re: (Score:2)
Re: (Score:2)
We have Uma Thurman NOW...
Re: (Score:2)
Unless we perfect cloning, there are going to be a few billion people who find little consolation in that.
Re: (Score:3, Insightful)
Using 454 sequencing you get average read lenghts of ~400-500 bp
I suspect someone had confused 454 with the other popular next-gen sequencing technique from Illumina, which does give very short reads.
Read lenghts around 20 bp would be pretty much useless. At least for de novo sequencing..
Not necessarily. If you can drive the cost/base down far enough, you can make short reads worthwhile if you use a shotgun approach and try for large-scale coverage. Especially if you can produce the short reads at a lower rate of time/base.
Re: (Score:3, Insightful)
The use of short reads for de novo assembly only makes sense if you want a rough draft of a genome, not the complete thing. There are way too many transposable elements, repeats, variation, etc. to accurately reconstruct even a bacterial genome with short reads. Nowadays, people don't even bother trying to piece it all together. They get down to a few dozen large fragments and say "good enough". It just costs too much to get the last 1-2% with a random sequencing approach.
Re: (Score:3, Funny)
Gattaca was supposed to show us a dark future. It was supposed to be a cautionary tale. The message was, "if your DNA isn't good enough, you'll have to make do banging Uma Thermon - poor you."
I don't think the producers thought their cunning plan all the way through.
Re: (Score:3, Insightful)
Re: (Score:3, Funny)
error's
That character you're using... I don't think it means what you think it means...
You must be new here. You see, here on teh interweb, many of us are terribly afraid of word's ending in "s" - Plural's, possessive's, contraction's with the word "is", and occasionally even name's. The apostrophe is a polite way of warning the general reading public that an "s" is approaching so that we can brace ourselve's accordingly.
Re: (Score:2)
99.3% of your base are belong to us!
Fixed it for you..