Slashdot Log In
The 1000 Genomes Project
Posted by
kdawson
on Wed Jan 23, 2008 02:18 AM
from the reaching-for-statistical-significance dept.
from the reaching-for-statistical-significance dept.
jd writes "An international consortium of specialists in genetics has announced the 1000 Genomes Project, in which at least 1,000 people from around the world will have their genomes fully sequenced as part of an effort to discover the relationship between genetics and disease. At present, over 100 regions of DNA are known to be related to illnesses, but the maps that exist are vague and are drawn from an extremely small population pool. According to the article, this results in the need for slow, expensive, and laborious studies to pinpoint causes, especially for rarer conditions. This project aims to find conditions that might only appear once in every 2,000 people (though how they intend to do that with half that number is unclear). The researchers hope to massively speed up the diagnosis of genetically linked illnesses and to improve the reliability of such diagnoses."
Related Stories
This discussion has been archived.
No new comments can be posted.
The Fine Print: The following comments are owned by whoever posted them. We are not responsible for them in any way.
Chinese (Score:2, Informative)
You can see the list of all participants (including funders) here [1000genomes.org].
Re: (Score:3, Informative)
Re: (Score:2)
Agreed, folks in the US seem to think they hold a monopoly on scientific acumen, when in fact, our hold isn't "slipping", it's passed into "slipped". Other nations have been rocketing past us in terms of not only scientific growth, but social and financial as well. The people in the US are told on a daily basis that they liv
Re: (Score:2)
Listen, John Edwards, I know the campaign isn't working out too well for you, but you're not going to save it by posting on Slashdot.
Re: (Score:2)
You must be new here.
Selection (Score:5, Insightful)
Well, they could sequence the DNA of people known to have rare diseases.
Re: (Score:1, Troll)
Re: (Score:2, Informative)
From TFA:
Re: (Score:2)
RTFA (Score:4, Insightful)
Parent
Rare Conditons (Score:4, Informative)
Parent
Re: (Score:2)
IANA Scientist, but the article seems to portray it as if they are simply trying to make a catalog of what normal variations exist.
But, there is one part that still does confuse me. From TFA:
Wouldn't it make much more sense to have a detailed listing of their medical history? For example, if one of those people has Alz
Re: (Score:2)
1 in 2000 people (Score:5, Informative)
Let's try to make it clearer, then.
The probability that a given condition appears in an individual is 1 in 2000, or 0.0005. The probability that it does not appear in that individual is 0.9995. The probability that it does not appear in any of 1000 individuals is 0.9995^1000 = 0.6 approximately; and the probability that at least one of the 1000 individuals has it is 0.4. Not bad at all. (If you used 2000 people, the probability that at least one of them would have it would improve to about 0.6.)
Suppose you aren't interested in just one conditions, but in lots of conditions -- say, ten of them. The probability that at least one individual would have at least one of those conditions is 1 - 0.9995^(1000*10) = 0.993 == ie, practically certain.
They really ought to teach basic probability theory in schools...
Re:1 in 2000 people (Score:5, Insightful)
Let's try to make it clearer, then.
The probability that a given condition appears in an individual is 1 in 2000, or 0.0005. The probability that it does not appear in that individual is 0.9995. The probability that it does not appear in any of 1000 individuals is 0.9995^1000 = 0.6 approximately; and the probability that at least one of the 1000 individuals has it is 0.4. Not bad at all. (If you used 2000 people, the probability that at least one of them would have it would improve to about 0.6.)
Suppose you aren't interested in just one conditions, but in lots of conditions -- say, ten of them. The probability that at least one individual would have at least one of those conditions is 1 - 0.9995^(1000*10) = 0.993 == ie, practically certain.
They really ought to teach basic probability theory in schools...
Parent
Re: (Score:3, Funny)
Re: (Score:2, Funny)
Re: (Score:3, Insightful)
you neglect the fact that each person has two sets of genes, one inherited from their mother, the other from their father. that brings the total number of genes to 2000 sets. and it's also likely they're interested in many more than ten conditions. so you should think more in terms of a probability density function of conditions found versus their rarity.
Re: (Score:3, Informative)
Furthermore another issue is that the genome is one huge causality network - for all but the most simplest disorders you'll need to have a cascade of genes to get a p
Re: (Score:2)
Re: (Score:2)
Knowing them they would attack the problem at right angles to how its normally tackled.
Just like how they made their translation service by feeding stacks of data in to a probability engine to get a extremely accurate translator, they could throw in tons of genetic data and out will pop the answers with pretty good accuracy.
These sorts of problems are ideal for brute force techniques like that.
Only problem is you need one of Google's datacenters to do it properly.
Re: (Score:3, Interesting)
Re: (Score:2)
And if you, on top, follow the idea that physical conditions (starting at the molecular level, INGBER [childrenshospital.org]) are a major determinant of gene expression you might eventually come to your own conclusions regarding the value of simply linking 'the genome' to conditions of ill health.
CC.
Re: (Score:2)
Re: (Score:2)
Um, they did back when I went to high school. What they really need to do is do a better job of teaching stat math so terms like 1/X won't confuse those that made it through high school educated people when the sample size is smaller than X.
In college, you learn most stats, polls, and surveys are just plan lies to push you in the desired direction.
Not quite... (Score:3, Insightful)
Re: (Score:2)
when one sequences a person, one captures the variation on two human genomes at once.
Of course, this all relies on the coverage you sequence at, and one option for
the 1,000 genomes project is doing this at low (2x?) coverage, using pretty sophisticated
methods to combine statistical power between sample datasets.
The "1,000" though is more a round number that is in the right range. it might well be
1346 people or something like th
Re: (Score:2)
Re: (Score:2)
Biology geek solves your problem for you. (Score:3, Informative)
They really ought to teach basic probability theory in schools...
Or maybe basic biology maybe? The Hardy-Weinberg equation plus a little basic algebra solves the problem:
p + q = 1
p^2 + 2pq + q^2 = 1
P and q are the frequency of a specific gene (assuming there are only two variants, but lets KISS.) Each organism has two copies of a given gene. They can be pp, pq, or qq. So the number of p genes and q genes must equal 100%. And the number of people who are pp, qp, or qq must equal 100%, hence the two equations.
In the case of a simple autosomal recessive gene, t
Re: (Score:2)
Many replies in one, here; and a day late so I guess nobody will read it.
If a gene exists in 1/50 people and you sample 1000, the odds that you wouldn't find it is pretty remote.
But you'd only care if you see the disease. That's the point you and other geneticists above are missing. Otherwise you'd never know whether the gene is linked with a disease in its variant form, or is harmless.
For the story to be convincing, in your sample population, you need to show that one person has the disease AND tha
1000 Gnomes Project? (Score:4, Funny)
Re:1000 Gnomes Project? (Score:4, Funny)
Parent
Re: (Score:3, Funny)
Re: (Score:2)
Hmmm... They could use the 1000 gnomes as a baseline data to determine:
- how many gnomes are needed to be joined end - to - end to encircle the earth
- at the equator
- at the horse latitudes
- how many are needed to drill from the america and out to china
- the number of gnomes needed to reach the moon and back
- the volume of gnomes needed to fill a library of congress
- the difference between a metric gnome and t
This is made of win (Score:2)
This is fantastic news (Score:5, Insightful)
From a lab level where what used to be a weeks work with lots of chemicals and processing is now usually a 20 minute protocol with a kit from Quagen. what used to be massive amounts of work with hundreds of gels and digestions and labeling steps to analyse nucleic acid sequences is now a few days with an affymetrix kit, giving far more accurate and useable results. Across every step this progress has been rapid.
And in the future, near-term like within a decade, all these methods will become outdated and replaced with near-realtime analysis and diagnosis. The best point in all of this is that no matter how advanced medical tech has become, the limiting factor has been that it's necessary to actually BRING your disease ridden body to the hospital or doctor. The rise of companies like www.decodeme.com [decodeme.com] is what i expect DNA assesment to be like in the future. You send off some samples you scrape off your cheek yourself, and within a few days you get a full diagnosis on any known predisposition to disease or genetic problems.
Which is why a lot more attention should be put into the debate on morality and genetic profiling. It's going to be here before you can blink, it might be nice to know what you think about using embryo selection to wipe out CF before it becomes a possibility.
Where do I sign up? (Score:2, Interesting)
www.marfan.org
Not 1 in 2000 (Score:4, Informative)
Astonishing (Score:2, Insightful)
This whole idea of "early detection" pisses me off; it just reminds me of the drug industry. It really does come down to the almighty human thinking they know what they're doing. Hopefully we
Re: (Score:2)
And as we know more, genetic therapy will become more effective. But feel free to go to live in a tribe somewhere if you hate progress that much.
Re: (Score:2)
Someone didn't... (Score:2)
(though how they intend to do that with half that number is unclear)
Here's how... [wikipedia.org] Alleles.
misleading science (Score:2, Informative)
Even within the "normal", euchromomatic, sequencable DNA, there are gaps that have not been sequenced.
Beyond this, you need to know haplotypes - that is, for most of your DNA
Invaluable to a cure for auto-immune diseases. (Score:2)
Before the survey, only one genetic marker was known for the disease: HLA-B27, uncovered over three decades ago. If you have that genetic marker, you're almost certain to get AS
Chromosomes are diploid! (Score:2, Informative)
Remember that human chromosomes are diploid - we have two copies of (most) genes. (A few of the genes on the male Y chromosome have no analogue on the X chromosome, but that's a very small percentage of the human genome). So in total they will have roughly 2000 samples for each gene - 2 for every individual.
Of course, that doesn't provide a correlation with specific genetic diseases - but here classical genetics techniques allow you to get an insight on how some of those diseases might be related to spec
This makes me wonder (Score:2)
I think I'd make a good mad scientist...
Re: (Score:2)
Not in the sense you probably mean: your DNA does not adapt or "change" during your lifetime. Some cells have some changes to their DNA, either by accident or on purpose, but that generally amounts to inactivating or removing genetic material that a specialized cell won't be needing anymore before its death.
Re: (Score:2)
Re: (Score:2, Informative)
I believe you are talking about the DNA sequence, and not the structure of DNA itself? The DNA sequence is relatively unchanged throughout your life. The only things that changes it, are spontaneous mutations and pathogen-induced mutations (Bacteria, but especially viruses). Most of the time, cells with lethal malfunctions in their DNA undergo self-killing, known as apoptosis. Others that behave unnormally, either due to infection,