Follow Slashdot stories on Twitter

 



Forgot your password?
typodupeerror
×
Biotech Science Technology

Grid Computes 420 Years Worth of Data in 4 Months 166

Da Massive writes with a ComputerWorld article about a grid computing approach to the malaria disease. By running the problem across 5,000 computer for a total of four months, the WISDOM project analyzed some 80,000 drug compounds every hour. The search for new drug compounds is normally a time-intensive process, but the grid approach did the work of 420 years of computation in just 16 weeks. Individuals in over 25 countries participated. " All computers ran open source grid software, gLite, which allowed them to access central grid storage elements which were installed on Linux machines located in several countries worldwide. Besides being collected and saved in storage elements, data was also analyzed separately with meaningful results stored in a relational database. The database was installed on a separate Linux machine, to allow scientists to more easily analyze and select useful compounds." Are there any other 'big picture' problems out there you think would benefit from the grid approach?
This discussion has been archived. No new comments can be posted.

Grid Computes 420 Years Worth of Data in 4 Months

Comments Filter:
  • Re:Malaria? (Score:5, Informative)

    by Soporific ( 595477 ) on Saturday February 17, 2007 @01:16AM (#18048234)
    According to:

    http://archive.idrc.ca/books/reports/1996/01-07e.h tml [archive.idrc.ca]

    Malaria kills quite a few people every year so I don't think it's a waste.

    ~S
  • by Raul654 ( 453029 ) on Saturday February 17, 2007 @01:55AM (#18048454) Homepage
    (Preface - My research group specializes in parallel computing) There are classes of problems so computationally intensive that the computers that can do them in a reasonable amount of time won't be invented for decades. Almost all of these are simulations of physical reactions (invitro drug simulation, climate simulation, biomolecular engineering sims, physics sims, 'etc). As a general rule, these problems scale weakly (meaning that as you add more computers, you can simulate more datapoints, and get more accurate results). If memory serves, the hardest problem I can recall involved hydrogen fusion simulations, requiring computers 10-1000 times faster than the best in the world today.

  • by nick_davison ( 217681 ) on Saturday February 17, 2007 @02:45AM (#18048662)
    Worse...

    It's over 4 months, not a fraction of a second.

    If I have a task that takes 100 seconds to run and I want it completed in under a second, scalability becomes a challenge... I have to figure out how to break it in to at least 100 distinct parts and deal with all of the communication lags associated. To have any kind of fault tolerance, I probably want to break it in to at least 1,000 tasks so that if one processor is running fast, it can get fed more and if one processor corrupts its process, I don't find out right at the end of the second, with no room to compensate, that I have to run re-run that full second's worth of processing elsewhere to make up for it. That's where the challenge comes in.

    If I have a task that takes 100 seconds to run and all I'm trying to do is run it a lot of times over a period of time that's many times greater, I can run it 864 times a day per system with absolutely no scalability issues whatsoever and simply send the relatively small complete result sets back. With 100 systems, if each one can run a distinct task from start to finish, I'd be expecting pretty much dead on 100 times the total number crunching as there are absolutely no issues with task division, synchronization or network lag.

    In this case, they ran 5,000 computers over 4 months. Assuming a single task is solvable in under 4 months by a single system, they should have had no difficult task division problems to solve, absolutely minimal synchronization issues and next to no lag issues to address. In short, even a pretty inefficient programmer should be able to approach 1:1 scalability in that easy of a scenario.

    Efficiency of algorithms is a challenge when you want a single result fast. When you want many results and are prepared to wait so long as you're getting very many of them, that's an incredibly easy distributed computing problem.
  • by Crazy Gilmore ( 1065234 ) on Saturday February 17, 2007 @02:52AM (#18048688)
    Much of this discussion is totally misdirected because the writers are confusing a distributed computing project like SETI or BOINC - http://en.wikipedia.org/wiki/BOINC_client-server_t echnology [wikipedia.org] - with a grid system - http://en.wikipedia.org/wiki/Grid_computing [wikipedia.org]. They are completely different things.
  • Re:Malaria? (Score:2, Informative)

    by iminplaya ( 723125 ) on Saturday February 17, 2007 @04:03AM (#18048966) Journal
    After a little poking around, I found what looks like the stuff [realgreenlawns.com] they put on the nets. It was under their noses all along. And I would venture to think that the they (the Africans) have known about it for a very long time. I'm too lazy to find out how well it controls these guys [ox.ac.uk], something even more neglected in mass media. Nature triumphs the computer again. Okay, now I'm drifting off topic, but it at least I did it seamlessly and gracefully :-)
  • Re:Malaria? (Score:3, Informative)

    by God'sDuck ( 837829 ) on Saturday February 17, 2007 @05:38AM (#18049396)
    quinine cures malaria strains not yet resistant to quinine.
  • Re:Wikipedia? (Score:2, Informative)

    by elchuppa ( 602031 ) on Saturday February 17, 2007 @06:00AM (#18049508)
    Well this is an excellent question. Actually Van Jacobson is on google video [google.com] with a presentation on this precise pet peeve of yours. The main concern I have with the idea, at least with how Van Jacobson presents it is that with information addressed by content rather than location, it's slightly more challenging to locate it. At least with the IP system you can route closer towards your destination at each hop up and then down... But data without an authoritative source is basically lost. If you don't have it, you don't really have any reason to inquire about it with any one node over any other. There is a space for peer to peer data systems, and he does have a point over those live media feeds getting saturated. The truth is that all data should be potentially torrented. That's why bittorrent may be one of the most fascinating and potentially effective inventions in the modern(internet) software era (last 10 years). Bugger. so I don't have much constructive to say what with my current state of mind, except that most of the other replies are rudely and stupidly dismissive of the idea. It both resonates and feels like the future, but it's not a trivial problem. Actually it most certainly is... it's just a matter of stating it so that it is trivial.
  • by p3d0 ( 42270 ) * on Saturday February 17, 2007 @11:08AM (#18051166)
    There have been lots of responses already, but I would like to add another...

    There seems to be a widespread fallacy that all human resources should be applied to the One Biggest Problem facing humanity at any given moment. Overlooking for a moment the obvious problems inherent in trying to choose the One Biggest Problem, and assuming we could actually rank all human problems in a well-defined order, there are still two huge problems with this approach:

    1. Diminishing returns. Putting twice as many people on a problem doesn't solve it twice as quickly. The extra people could well be more productive working on a separate problem. This is the well-known fallacy of the Mythical Man-Month.

    2. Misplaced priorities. The majority of people in the world do not have cancer. If all the resources of humanity were spent on cancer, where would that leave the rest of us that don't have cancer? "Sorry, we've stopped making antibiotics, insulin, toothpaste, books, and clothing so we can focus on fighting cancer."

    In addition, there's an implicit assumption in the parent poster's position that the researchers who are looking for a cure for malaria have been wasting their time. I'd like to ask, what has *he* been doing during this time? I hope he has been looking for a cancer cure, or else he's nothing but a hypocrite.

You knew the job was dangerous when you took it, Fred. -- Superchicken

Working...