Slashdot Log In
The 1000 Genomes Project
Posted by
kdawson
on Wed Jan 23, 2008 03:18 AM
from the reaching-for-statistical-significance dept.
from the reaching-for-statistical-significance dept.
jd writes "An international consortium of specialists in genetics has announced the 1000 Genomes Project, in which at least 1,000 people from around the world will have their genomes fully sequenced as part of an effort to discover the relationship between genetics and disease. At present, over 100 regions of DNA are known to be related to illnesses, but the maps that exist are vague and are drawn from an extremely small population pool. According to the article, this results in the need for slow, expensive, and laborious studies to pinpoint causes, especially for rarer conditions. This project aims to find conditions that might only appear once in every 2,000 people (though how they intend to do that with half that number is unclear). The researchers hope to massively speed up the diagnosis of genetically linked illnesses and to improve the reliability of such diagnoses."
Related Stories
This discussion has been archived.
No new comments can be posted.
The Fine Print: The following comments are owned by whoever posted them. We are not responsible for them in any way.
Chinese (Score:2, Informative)
You can see the list of all participants (including funders) here [1000genomes.org].
Re: (Score:3, Informative)
Selection (Score:5, Insightful)
Well, they could sequence the DNA of people known to have rare diseases.
Re: (Score:2, Informative)
From TFA:
RTFA (Score:4, Insightful)
Parent
Rare Conditons (Score:4, Informative)
Parent
1 in 2000 people (Score:5, Informative)
Let's try to make it clearer, then.
The probability that a given condition appears in an individual is 1 in 2000, or 0.0005. The probability that it does not appear in that individual is 0.9995. The probability that it does not appear in any of 1000 individuals is 0.9995^1000 = 0.6 approximately; and the probability that at least one of the 1000 individuals has it is 0.4. Not bad at all. (If you used 2000 people, the probability that at least one of them would have it would improve to about 0.6.)
Suppose you aren't interested in just one conditions, but in lots of conditions -- say, ten of them. The probability that at least one individual would have at least one of those conditions is 1 - 0.9995^(1000*10) = 0.993 == ie, practically certain.
They really ought to teach basic probability theory in schools...
Re:1 in 2000 people (Score:5, Insightful)
Let's try to make it clearer, then.
The probability that a given condition appears in an individual is 1 in 2000, or 0.0005. The probability that it does not appear in that individual is 0.9995. The probability that it does not appear in any of 1000 individuals is 0.9995^1000 = 0.6 approximately; and the probability that at least one of the 1000 individuals has it is 0.4. Not bad at all. (If you used 2000 people, the probability that at least one of them would have it would improve to about 0.6.)
Suppose you aren't interested in just one conditions, but in lots of conditions -- say, ten of them. The probability that at least one individual would have at least one of those conditions is 1 - 0.9995^(1000*10) = 0.993 == ie, practically certain.
They really ought to teach basic probability theory in schools...
Parent
Re: (Score:3, Funny)
Re: (Score:3, Insightful)
you neglect the fact that each person has two sets of genes, one inherited from their mother, the other from their father. that brings the total number of genes to 2000 sets. and it's also likely they're interested in many more than ten conditions. so you should think more in terms of a probability density function of conditions found versus their rarity.
Re: (Score:3, Informative)
Furthermore another issue is that the genome is one huge causality network - for all but the most simplest disorders you'll need to have a cascade of genes to get a p
Re: (Score:2)
Re: (Score:3, Interesting)
Not quite... (Score:3, Insightful)
Re: (Score:2)
when one sequences a person, one captures the variation on two human genomes at once.
Of course, this all relies on the coverage you sequence at, and one option for
the 1,000 genomes project is doing this at low (2x?) coverage, using pretty sophisticated
methods to combine statistical power between sample datasets.
The "1,000" though is more a round number that is in the right range. it might well be
1346 people or something like th
Biology geek solves your problem for you. (Score:3, Informative)
They really ought to teach basic probability theory in schools...
Or maybe basic biology maybe? The Hardy-Weinberg equation plus a little basic algebra solves the problem:
p + q = 1
p^2 + 2pq + q^2 = 1
P and q are the frequency of a specific gene (assuming there are only two variants, but lets KISS.) Each organism has two copies of a given gene. They can be pp, pq, or qq. So the number of p genes and q genes must equal 100%. And the number of people who are pp, qp, or qq must equal 100%, hence the two equations.
In the case of a simple autosomal recessive gene, t
1000 Gnomes Project? (Score:4, Funny)
Re:1000 Gnomes Project? (Score:4, Funny)
Parent
Re: (Score:3, Funny)
This is made of win (Score:2)
This is fantastic news (Score:5, Insightful)
From a lab level where what used to be a weeks work with lots of chemicals and processing is now usually a 20 minute protocol with a kit from Quagen. what used to be massive amounts of work with hundreds of gels and digestions and labeling steps to analyse nucleic acid sequences is now a few days with an affymetrix kit, giving far more accurate and useable results. Across every step this progress has been rapid.
And in the future, near-term like within a decade, all these methods will become outdated and replaced with near-realtime analysis and diagnosis. The best point in all of this is that no matter how advanced medical tech has become, the limiting factor has been that it's necessary to actually BRING your disease ridden body to the hospital or doctor. The rise of companies like www.decodeme.com [decodeme.com] is what i expect DNA assesment to be like in the future. You send off some samples you scrape off your cheek yourself, and within a few days you get a full diagnosis on any known predisposition to disease or genetic problems.
Which is why a lot more attention should be put into the debate on morality and genetic profiling. It's going to be here before you can blink, it might be nice to know what you think about using embryo selection to wipe out CF before it becomes a possibility.
Where do I sign up? (Score:2, Interesting)
www.marfan.org
Not 1 in 2000 (Score:4, Informative)
Re: (Score:2)
Not in the sense you probably mean: your DNA does not adapt or "change" during your lifetime. Some cells have some changes to their DNA, either by accident or on purpose, but that generally amounts to inactivating or removing genetic material that a specialized cell won't be needing anymore before its death.
Re: (Score:2)