Chinese Lab Speeds Through Genome Processing With GPUs 408
Eric Smalley writes "The world's largest genome sequencing center once needed four days to analyze data describing a human genome. Now it needs just six hours. The trick is servers built with graphics chips — the sort of processors that were originally designed to draw images on your personal computer. They're called graphics processing units, or GPUs — a term coined by chip giant Nvidia. This fall, BGI — a mega lab headquartered in Shenzhen, China — switched to servers that use GPUs built by Nvidia, and this slashed its genome analysis time by more than an order of magnitude."
The Future Is Here!! (Score:5, Funny)
Re:The Future Is Here!! (Score:5, Insightful)
If one reads to page 2 of tfa, they only claim the technique works well in this instance. They go on:
Even for computer-intensive aspects of analysis pipelines, GPUs aren’t necessarily the answer. “Not everything will accelerate well on a GPU, but enough will that this is a technology that cannot be ignored,” says Gollery. “The system of the future will not be some one-size-fits-all type of box, but rather a heterogeneous mix of CPUs, GPUs and FPGAs depending on the applications and the needs of the researcher.”
and
GPUs have cranked up the speed of genome sequencing analysis, but in the complicated and fast-moving field of genomics that doesn’t necessarily count as a breakthrough. “The game changing stuff,” says Trunnell, “is still on the horizon for this field.”
So yes, the article is a bit breathless, but if utilizing GPUs helps cure my potentially impending genetic disorder, I'm all for it.
Re: (Score:1)
Yes, unfortunately, you will be unable to pay for that cure, unless you own the business you work for.
Re:The Future Is Here!! (Score:4, Insightful)
Unless you are fortunate to live in a civilised part of the world with a universal healthcare system.
Re: (Score:1)
You stinking communist!
Don't think you can keep giving away the cures to your patients that we develop so we can gouge our victims er patients.
Re: (Score:2)
Your point? *snicker*
Re: (Score:1)
I used to work with Texas Instruments TMS34010/32020/34082 processors in the 1990's. These were surface mounted onto a VGA graphics board, along with a number of TMS34082 vector processors and a few megabytes of memory (Hercules Graphics Station Card as an example). They had this really neat feature where you could cross-compile, download and execute programs on these boards as "extensions". You could do anything from encryption/decryption, image-processing to drawing lines and rendering triangles.
Initially
Re: (Score:1)
TROLL FIGHT!
Re: (Score:1)
you're worthless.
Re: (Score:1)
cower in my shadow in your paid in full shanty some more, feeb.
you're completely pathetic.
Re: (Score:1)
you're worthless.
you've obviously never been informed (Score:1)
The Fine Print: The following comments are owned by whoever posted them. We are not responsible for them in any way.
cower in my shadow some more, feeb.
you're completely pathetic.
Re: (Score:1)
Re: (Score:1)
if you believe you're being attacked, you're even dumber than that... but considering i merely responded to your idiocy with a statement of fact concerning it's idiotic content, it's already clear you're an idiot and an ignorant hypocrite.
you are not defending yourself. you are denying the existence of yourself. you don't un
Re: (Score:1)
cower in my shadow in your paid in full shanty some more, feeb.
you're completely pathetic.
Re: (Score:2)
LMAO -> http://www.youtube.com/watch?v=JrUGYT_gl9I [youtube.com] = MikeK, lol...
APK
P.S.=> How many accounts is that you're up to now? Well, lol, let's see: I said 500 on a guess, & MichaelKristopeit412 looks like your 412th, lmao... I was wrong, but (rotflmao)... who cares?? Why??
Well - You're the "Man..." (See song above, lol, same quality as you, lol).. /quote)... apk
Troll fight! Humongous host file vs. pig fucker.
The only possible winner is - everybody else.
Re: (Score:1)
you are NOTHING
Re: (Score:1)
clone53421 = stephen alongi... i am not stephen alongi... i am michael kristopeit. you don't understand the difference, because you're an idiot.
cower in my shadow some more, feeb.
you're completely pathetic.
Re: (Score:1)
clone53421 is stephen alongi. i am michael kristopeit. stephen alongi =/= michael kristopeit. you can't comprehend that simple application of the transitive property because you're an idiot.
cower in my shadow some more, feeb.
you're completely pathetic.
Re: (Score:1)
clone53421 is stephen alongi. i am michael kristopeit. stephen alongi =/= michael kristopeit. you can't comprehend that simple application of the transitive property because you're an idiot.
you're also an ignorant hypocrite.
cower in my shadow some more, feeb.
you're completely pathetic.
Re: (Score:1)
cower in my shadow some more, feeb.
you're completely pathetic.
Re: (Score:1)
you're an ignorant hypocrite.
clone53421 = stephen alongi. i am michael kristopeit. stephen alongi is no-one's ma. michael kristopeit =/= stephen alongi... you don't understand the difference because you're an idiot.
cower in my shadow some more, feeb.
you're completely pathetic.
Re: (Score:1)
alex, p, k?... is that what your mom tells you to when you've been alone in the basement too long without a potty break?
still living in the bushes in the middle of shantytown, alex? [google.com]
you are NOTHING
you've obviously never been informed (Score:1)
The Fine Print: The following comments are owned by whoever posted them. We are not responsible for them in any way.
cower in my shadow some more, feeb.
you're completely pathetic.
ur mum's FACE're going to regret you (Score:1)
cower in my shadow some more, feeb.
did jay little regret you claiming that he libeled you? you said that he would too, but it seems his website is still up after many years, and no act [jaylittle.com]
Re: (Score:1)
you're worthless.
Re: (Score:2)
Oh geez. For some reason that was posted anonymously. Old habits die hard...
By the way, sedans aren't sports cars.
Re: (Score:2)
trolllolllollloll.
Wanna see the .40? Okay sure. [ompldr.org]
you've obviously never been informed (Score:1)
The Fine Print: The following comments are owned by whoever posted them. We are not responsible for them in any way.
cower in my shadow some more, feeb.
you're completely pathetic.
ur mum's FACE're going to regret you (Score:1)
cower in my shadow some more, feeb.
did jay little regret you claiming that he libeled you? you said that he would too, but it seems his website is still up after many years, and no act [jaylittle.com]
News for nerds (Score:5, Funny)
I always wondered what GPUs are. Thanks Slashdot!
Re: (Score:1)
Genome Processing Unit
Re: (Score:3, Funny)
1. Steal underpants
2. Process underpants
3. Profit
Re: (Score:1)
I thought it was 'Gnomes Processing Underpants' and that we finally had that elusive missing step
1. Steal underpants 2. Process underpants 3. Profit
I knew Bitcoin mining smelled funny...
Re: (Score:2)
We know what they are, but we've been using them to play Battlefield 3 or create phony "untraceable" currency for drug dealers instead.
Re:News for nerds (Score:5, Funny)
No, it's "Guinea Pig Units"
Re: (Score:1)
And there I was thinking there was a chance it could be "Guanxi Publicity Units".
Re: (Score:1)
Re: (Score:2)
Re: (Score:2)
I always thought it stood for General Processing Units!
Didn't you read the summary title? It's obviously Genome Processing Unit.
Summary dumbed down enough for you? (Score:3, Insightful)
Explaining what a GPU is in a slashdot summary? Come on.
This is similar to someone telling you a story about something funny happening to them while shopping at the store, pausing mid-story to inform you that a 'store' is a business where goods are displayed and exchanged for a papery substance called 'money'.
Re: (Score:1)
It might have some use. "store" is chiefly American English. British would prefer "shop", though they should definitely be able to understand what you are talking about. But in both BE and AE "store" would also mean "to keep".
Re: (Score:1)
I don't mind the explanations in the submitter's summary too much: it's better than some of the jargon/acronym laden summaries that totally obfuscated some summaries, and abstracts need to avoid jargon in order to pull in interested readers. I do, however, mind that the summary just plagiarizes the first few sentences of the Wired article. I'm also unhappy with the watered-down article; summaries and abstracts need to avoid jargon for clarity, but articles need to use the right words to convey their points
This article is almost painfully dumbed down... (Score:2)
Re: (Score:3)
Hell, even the summary is condescending.
This is Slashdot. You don't have to explain what a GPU is.
Re: (Score:1)
I'd say there are too many summaries that fail to explain the acronyms used. Not all readers have the exact same knowledge.
Re: (Score:1)
Re: (Score:1)
That's an unfair comment. Acronyms can stand for more than one thing, and a good writer's intent is not to show how much smarter they are than their readers.
Re: (Score:2)
I agree, and it would be a better policy to define acronyms the first time they are used. The same could be said about the names of software packages in other summaries. I'm mystified that so many commenters are miffed that GPU is explained.
Re:This article is almost painfully dumbed down... (Score:5, Informative)
The summary is pulled directly from the top of the article.
Here's the article from HPC Wire [hpcwire.com] and some details from nvidia [nvidia.com] as well as the nvidia press release [nvidia.com]
Re: (Score:2)
And then we get a verbose definition of "GPU"... one everyone is familiar with. The lack of consistency might be explainable, but it's kinda funny.
Reader's Digest (Score:1)
A reminder (Score:5, Insightful)
Re: (Score:3)
The only reminder should bethat processors designs for different types of math can do that math faster than processors designed for other types of math.
I don't understand why companies don't realize that. Running graphics on a floating point processors is like using a train to go across an ocean. Sure you can do it doesn't mean that it is a good idea.
Re: (Score:3)
"The only reminder should bethat processors designs for different types of math can do that math faster than processors designed for other types of math."
Not all kinds of math can be parallelized.
Re: (Score:2)
you have proof of this ?
I don't have room to write it in the margins of this website. The borked .js keeps killing it.
Re: (Score:2)
http://en.wikipedia.org/wiki/Amdahl's_law [wikipedia.org]
Wonder the speed for using AMD (Score:3)
Re: (Score:2)
Re: (Score:3)
I always wondered why FPGA's aren't used for this kind of stuff, or if they already are. I would imagine they would even be faster because you can design a circuit specifically optimized for the problem.
I think they are to some degree, but there is a major barrier to adopting them: they require specialized programming knowledge which you won't find in most genomics centers. GPUs are commodity technology and APIs like CUDA are easier to tackle (and more transferable to other fields) than FPGA programming.
Re: (Score:1)
Re: (Score:2)
From the article..... (Score:1)
According to Jackson Lab’s TeHennepe, the feat BGI and NVIDIA pulled off was porting key genome analysis tools to NVIDIA’s GPU architecture, a nontrivial accomplishment that the open source community and others have been working toward.
Can anyone familiar with current efforts shed more light on this? Who is working on open source bioinformatics and how much work has been done?
Re: (Score:2)
What the article doesn't say:
A better article (Score:5, Informative)
Excerpt:
At BGI, he says, they are currently able to sequence 6 trillion base pairs per day and have a stored database totaling 20 PB.
The data deluge problem stems from an imbalance between the DNA sequencing technology and computer technology. According to Dr. Wang, using second-generation sequencing machines, genomes can now be mapped 50,000 times faster than just a decade ago. The technology on track to increase approximately 10-fold every 18 months. That is 5 times the rate of Moore's Law, and therein lies the problem.
Obviously it would be impractical to upgrade one's computational infrastructure at that rate, so BGI has turned to NVIDIA GPUs to accelerate the analytics end of the workflow. The architecture of the GPU is particularly suitable for DNA data crunching, thanks to its many simple cores and its high memory bandwidth.
Re:A better article (Score:5, Informative)
Part of the problem is Low Standards (Score:4, Informative)
BGI is certainly one of the biggest offenders (Cucumber and Pigeonpea are both examples of the sort of terrible genomes-in-name-only BGI puts out) but I think the real problem is that Illumina sequence data is so cheap people keep trying to use it to sequence genomes, thinking if they throw enough raw data and enough mate-pair libraries at the problem it'll eventually make up for the fact that Illumina reads are so short. Illumina data is great for a lot of things. Calling SNPs, measuring gene expression, studying methylation patterns.
But, at least for any genome significant transposon content, it simply does not work.
Re: (Score:2)
Re: (Score:2)
you need to get about 300x coverage just for a bacterial genome
OUCH. Wasn't the original high-quality human genome sequence (using Sanger technology) only about 10x? And doesn't having only 20bp per read basically rule out de novo sequencing of any eukaryote? Even for bacteria that sounds tricky without a closely-related reference sequence.
Re: (Score:2)
Re: (Score:2)
The problem with next generation sequencing is that it produces a lot of garbage as well. There is no free lunch. And that is why a lot is passed on computers to handle that garbage. Also, computation speed hours per genome annotation does not make sense without reference to what exactly and at what reliability is being annotated,
c'mon intel (Score:2)
Re: (Score:2)
The Answer Lies in Parallel Computation (Score:2)
Genomic analysis involves extensive use of recursive techniques, which are well suited to parallel processing and combinatoric problems. GPU's are small independent components originally designed to handle large matrices of pixel elements for video programming very quickly for video display and refresh. Thus, they can when suitably programmed, for example using CUDA, in parallel to compute solutions required to map problems of high combinatoric dimensionality onto a one dimensional space (sequence) very q
Re: (Score:3)
the Chinese are picking up on the technology and on genomic data mining far faster and with more intensity than is the broader US tech community.
You're forgetting that the vast majority of countries actually developing this technology, and making it available to consumers, are based in the US (and Britain, to some degree). One recent article about the BGI that I read last year noted the irony of seeing several crates of sequencing machines stamped "MADE IN THE USA" waiting to be unloaded in Shenzhen. The
What if they pull an Apple? (Score:2)
Except that genomics has as of yet proven minimally useful for drug development. Until they actually develop significant amounts of homegrown technology (which, to be fair, they are actually doing in the bioinformatics arena, as opposed to sequencing), I'm not convinced that they're that much of a threat.
What if they simply avoid competing by patenting the sequence for Caucasians and then pulling an Apple and suing us out of existence? ;^)
Re: (Score:2)
Sounds a lot like wishful thinking to me. I agree that virtually all technology is global these days. Its just the rate of uptake that is astounding. About 35% of all US PhD students are Chinese an other 35% are Indian, while US grads are diminishing as a percentage. Major cutbacks in the UK now as well, but China is growing in double digits in most technology areas. I don't read Chinese myself, but the number of journals in the genomics area for Chinese readers is growing fast.
Beg, borrow, steal, coll
Re: (Score:2)
In any event, the throughput for NVIDIA's Tesla product lines are quite impressive. They really are revolutionizing computational biology, where there are many NP complete and NP Hard problems that can only be tackled with very past processors (in parallel) and with heuristic rather than exact algorithms. Do you know if these are manufactured here or in Asia?
I don't know where they're manufactured; my impression was that most of the really powerful chip-fabrication technology was still essentially based in
Re: (Score:2)
Like much of science, we often only see the small pieces that we are most familiar with. Biology is not rocket science, the underlying mathematics is much more difficult than rocket science. A small combinatoric problem in a biological context can contain a larger solution space than all the electrons in the known universe.
I'm at the other end of Biology myself, Systematics (at the intersection of Taxonomy, Morphology and Machine Vision). The bridge between the two is where the action is in terms of func
Re: (Score:2)
First question: Why do you want it from Intel, versus anybody else? They've always struck me as moderately evil - the Microsoft of the chip world, looking out for their own sales numbers and not much else.
Second question: Which do you want in your chip; a fast CPU that can run your web browser and E-mail client, or a fast parallel computing unit that's good for gene sequencing, multimedia processing, etc? You can't have both. Well, you can, but both parts will be slower. The top-of-the-line chips you get th
Re: (Score:2)
There is: AVX. The difference is that to cope with the workloads GPUs are NOT good at, a lot of the CPU transistors are dedicated to things other than AVX units and registers so the peak is lower.
SIMD for the Win! (Score:1)
I've been Folding for years on GPUs (Score:2)
A nice list of distro computing projects [wikipedia.org].
Another nice list of such projects [distribute...uting.info].
Re: (Score:2)
So the visiting politician asks, "What are GPUs?" (Score:2)
"GPUs . . . ? . . . I was informed that this project was powered by GNUs . . . ?"
". . . now where is that Apple MAC chip that generates the GPL number that allows the PC to connect to the Internet . . . ?"
Show Me the Monkey (Score:1)
Re: (Score:2)
That seems to be the general attitude across the board in the US, but it seems unlikely to be warranted any more. They keep growing their economy at between 8-10% per year. Estimates are their GDP will overtake that of the US in about 2025, if not sooner. The days of resting on laurels will have been gone by then. In any event if we are that far ahead, it seems hard to get a sense of that on slashdot judging from the sophistication of most comments.
Besides, I be curious to know what specific research ha
Re: (Score:2)
Besides, I be curious to know what specific research has been falsified?
Here's a decent summary of the problem:
http://www.nature.com/news/2010/100112/full/463142a.html [nature.com]
Its not as if the US fossil fuels industry hasn't been doing the same here with respect to climate science.
"Microsoft has put out some faulty software, so I'm gonna buy my next operating system from VaporWare Inc"
They keep growing their economy at between 8-10% per year
That's part of the problem:
A new study from Wuhan University, for instance, estimates that the market for dubious science-publishing activities, such as ghostwriting papers on nonexistent research, was of the order of 1 billion renminbi (US$150 million) in 2009 - five times the amount in 2007.
Now where are the HTX slots / HTX GPU cards (Score:2)
Just thing how cool it will be to have cards that can do this on the CPU BUS.
For the curious... (Score:5, Funny)
Even faster (Score:1)
Re:first (Score:5, Insightful)
So, a site dedicated to nerds needs to explain what a GPU is? Are we not nerds anymore?
Re: (Score:2)
(Hint: the summary is a direct quote from the enterprise-y TFA [wired.com].)
Re: (Score:2)
"The world's largest genome sequencing center once needed four days to analyze data describing a human genome. Now it needs just six hours. The trick is servers built with...GPUs — a term coined by chip giant Nvidia. This fall, BGI — a mega lab headquartered in Shenzhen, China — switched to servers that use GPUs built by Nvidia, and this slashed its genome analysis time by more than an order of magnitude."
Re: (Score:1)
GPUs are nowhere near new. What's relatively new is GPGPUs (General-Purpose computation on Graphics Processing Units).
Re: (Score:2)
First, I didn't read the AC's comment. Then I thought the FP in question was pretty relevant to the quality of the summary. Not a troll in my view and I explained why.
Of course, it depends your point of view I guess. My post was probably badly worded.
Re: (Score:1)
Only if you have a Beowulf cluster of them!
Re: (Score:1)
Re: (Score:2)
Commodore didn't. The C64 had the "VIC-II" (video interface chip II). The Amiga had Agnus and Denise and later the AGA.
I'm don't think Atari did either. Sure they had a Blitter chips and there was graphics accelerators and so on. But I don't think the term "GPU" was used.
I don't recall anyone arguing with it at the time either: http://www.tgdaily.com/hardware-brief/18947-nvidia-launches-worlds-first-gpu [tgdaily.com].
Still PU was common enough already.
Re: (Score:3)
A search of Usenet reveals the Atari Jaguar had a unit called a "GPU" in 1993, considerably before NVIDIA's "first GPU" in 1999. The Amiga unit was also called a GPU.
The term's generic, and NVIDIA knows it... they don't have it registered as a trademark.