Open-Source Biology 122
nicholast writes with this "article describing the growing use of open-source collaboration methods in biology. The subtitle and main question is: Can a band of biologists who share data freely out-innovate the corporate researchers who hoard it?"
Sources of funding (Score:3, Insightful)
Personally, I think that corperate funded science is a must...though I'm glad to see that some scientists are freely sharing information. The best way to do science is to combine both (which will never happen) and show corperations that they can get rich even when they share their data.
Re:Sources of funding (Score:3, Insightful)
Re:Sources of funding (Score:2, Insightful)
Corporate lackeys will win every time (Score:2, Insightful)
Re:Corporate lackeys will win every time (Score:2, Insightful)
Re:Corporate lackeys will win every time (Score:1)
Open Source Biology isn't limited to biogenetics (Score:5, Interesting)
Re:Open Source Biology isn't limited to biogenetic (Score:3, Informative)
At least some of bioinformatics stuff (eg bioperl) includes some phylogeny stuff. Or, so I was told by someone on #bioinformatics on OPN.
Re:Open Source Biology isn't limited to biogenetic (Score:2)
Answer (Score:4, Funny)
Answer: No, certainly not. That will never work. Why do you ask such silly questions?
Sincerely,
NiftyNews
Automated Response Unit for Data-Hoarding Corporate Researchers, Inc.
Answer, AND REASON [for gene stuff] (Score:1)
Corporations are patenting human genes as they find them, without even knowing what they do. Hence, any gene-related answers found by "open source" methods of research are pretty much screwed if theres a related patent. Just take a look at this [theregister.co.uk] for an example of how that method of profiteering already works, albeit for M$ vs OpenGL.
You know, I'm pretty sure I heard an American (!!!) president say something to the tune of "The human genome belongs to the human race!"... Hmmm, nevermind.
Ali
The subtitle is misleading corps out market not -- (Score:2)
The purpose of a corporation is to create wealth for the shareholders. Wealth is created in markets. Corporations produce marketing first and foremost. People commonly confuse marketing with innovation. Just because a label says "New and Improved" or "Upgrade" doesn't represent a genuine innovation in the academic sense. To the consumer it is often enough to produce a sale and so it's a valuable corporate tool. But confusing this image of innovation with genuine innovation as the term is used in universities is foolish in the extreme.
Moreover, the patent strategies of major corporations, particularly since the formation of the pro-monopoly Court of Appeals of the Federal Circuit in 1982, has been to stifle innovation by staking a claim around a certain financially rewarding intellectual avenues by flooding those patent areas with patents to be used as offensive weapons. This is not innovation by any means although it involves applying for many patents which can be used to provide a false argument that such corporations are innovators when they are, in fact guided by financial and legal experts rather than technicians.
Almost all innovation takes place in schools and not by professors, but by their students. It has been this way all along and people who don't understand this can be forgiven because the history of education is a boring subject for many of today's youth. However, it is not mysterious by any means and the subtitle of the paper suggests the authors are uninformed at best.
A wicked new excuse (Score:2)
I wish we studied using open-source methods (Score:3, Funny)
(drumroll please)
female anatomy.
Re:I wish we studied using open-source methods (Score:2)
If you meant that you wanted your high school introductory biology class to cut open live women so the you could look at a fallopian tube, you are all kinds of sick.
If you wanted to stare in non-comprehension at a naked female in class, you would have been better off with a figure studies class. Those are art classes, FYI.
Re:I wish we studied using open-source methods (Score:1)
When are they going to make Dinosaurs? (Score:3, Funny)
What I really want is for greedy out the ass corporations to start cloning dinosaurs and creating new kinds of pets like squirrels with four asses.
That and some kind of new food that's blue. Ever notice there isn't any naturally occurring blue food? Don't say Blueberries cause the darned things are really just purple.
Sure things didn't work out that time.... (Score:1)
Mmm...blue food (Score:1)
While I'd love to see more blue foods, I'd be content with a personal blue slushee machine. Ah, to be back at my alma mater [hmc.edu], with the blue slushee machine in the dining hall...
Some of my favorite recipies:
-Greg "C'mon, try it! It's really good!" Mulert
Re:When are they going to make Dinosaurs? (Score:3, Funny)
Possibly...How cold? (Score:1)
If it was rendered cold and thus turned blue naturally then you are still in the game however if it had to become so cold that it was rendered un-eatable in order to get the color than it would be hard to consider it food. Artifically making it cold enough to turn blue would put it the same catagory as a multitude of man-made blue foods that are all better tasting and more palatable than cold raw chicken.
It could be chicken-sickle on a stick I guess.
Re:Blue killer lobsters! (Score:1, Funny)
Anyway, i'd been thinking one of you'd mention the new blue M&Ms (hey, around me these occur naturally!)
JMR
Re:When are they going to make Dinosaurs? (Score:1)
These blueberries look pretty blue. Blue berries [ismennt.is]
Scientific Journals Obsolete (Score:1)
Re:Scientific Journals Obsolete (Score:1)
PubMed [nih.gov]
Re:Scientific Journals Obsolete (Score:1)
Along those lines, I suspect that an equally important factor preventing "open source biology" will be the policy of scientific journals to not publish that has been "published" previously. That is, if you contribute your data to an open source database, you may not then be able to publish it.
Re:Scientific Journals Obsolete (Score:2, Informative)
There is also OVID, which is an online database of journals available at most universities. Not completely opensource, but for all practical purposes (at least from the perspective of the scientist), it is "open source" "policy of scientific journals to not publish that has been "published" previously"
Science literally changes hourly. There are things I thought were true on Monday, that I know are patently untrue today (seriously, specific things). There's no point in writing them down until you reach a reasonable degree of certainty. Publication is the last step before it leaves your hands entirely, it finalizes what you have say.
Presentation at conferences, retreats and workshops; poster sessions, informal review (passing your manuscript around to all of your buddies before publication), and the all important coffee room are what comprise the "open source" community of science for works in progress. Sharing prior to publication is like sharing prior to ever trying to compile your code. You'd look like an idiot.
Ron is your answer (Score:1)
Ron Jeremy doesnt hoarde his biologly and hes as successful as they come... Ohhh Scientists... well Ron was a school teacher before he became a star, maybe he was a biologly teacher.
A boon for the do-it-yourself types! (Score:2)
Who says you won't be able to have your very own clone of yourself someday? Now if we could just get some of the other fields to go this route, we could make our own antibiotics at home, have a horde of clone slaves, and power our houses with cold fusion reactors.....
Of course the biologists will win... (Score:2, Interesting)
I work in the power industry. While we do not necessarily share our source code outright (well, no one has asked), we share data like crazy. We don't hoard it because it wouldn't make sense. Our matrices and equations and system solving methods HAVE to be shared with our clients so they can ensure that we're actually giving them something that does what they think it does. Does this mean that other people are going around doing what we do? Not really, because we do it better.
If you are really good at something, other people aren't going to bother trying to do what you do because it's not worth the trouble. Just like the way I paid for someone to move my piano today. I could have gotten a bunch of friends together to do it and rented a dolly and truck, but that took way too much effort.
Convenience, much like advancements in science are going to benefit everyone. Someday, everyone who doesn't add value to society in some way is going to operate on the fringe (well, they already do to some extent), and all information will be shared for the greater good of all. If we collaborate with other industry leaders to come up with new features for our software or to help our clients do things better, it will always benefit us in the long run. The same will be true for the biologists and eventually corporations will get a clue as well.
It depends... (Score:2)
Free Flow of information? (Score:2, Interesting)
One reason why this won't work, is that researchers depend on their discoveries in order to write papers and grants, which gives them more money to make more discoveries. If a small lab of a few people were to share all their information, then right before completion a huge corporation comes in and takes all credit, then that little lab is screwed.
Another problem is that researchers can go months, even years on wrong information, and theories. If these were published, yes theres a possibility they could be discounted, but they could be perpetuated, with lots of wrong data all over the place.
Finally, there is alot of information being shared within the biomedical industry. Many experiments are based on those found in published papers, there are guest speakers bi-weekly telling people about their successful research and we can take that information and test it, or we can tell them we have found that method doesn't work.
I believe that adding an extra layer of communication would further cloud things.
Re:Free Flow of information? (Score:2, Insightful)
Another problem is that researchers can go months, even years on wrong information, and theories. If these were published, yes theres a possibility they could be discounted, but they could be perpetuated, with lots of wrong data all over the place.
How is this different from any other science. I mean, in physics, there's lots of papers out there that will eventually be shown to be wrong. That's how science is supposed to work.
It's a shame that biology has become so profitable. Hoarding data and discoveries is not how science advances. The history of chemsitry and physics are ample illustrations of that fact.
Re:Free Flow of information? (Score:1)
E
Re:Free Flow of information? (Score:3, Informative)
Incidentally, I can't recommend Ensembl [ensembl.org] highly enough. Not only have I been able to significantly further my research with their tools, but they have open-sourced the entire code behind their site! And the documentation is even in Wiki [wiki.org]! I really think what they have done is incredible and should be one of the first projects anyone mentions when expounding the virtues of open-source software as well as sharing information in the field of Biology.
-Ryan
an example, and some experiences. (Score:5, Informative)
1. Working in groups can be very difficult... i.e., when people don't share the same priorities, or see the same sources of bias as important.
2. It can be very helpful... often times getting other people's perspective is very informative. Generally in science we get feedback at the end (publication review), but here it happens at all stages, including data collection. This is really good.
3. People tend to start off thinking that they need to protect and hide data, but once they start to share data they tend to become big fans of sharing data.
4. Data transparency is essential to good science, these type of projects make that more and more possible. It does not take people long to realize how useful it is to have open and easily excessible data.
5. It is very important to open code used in analyses. I am in the process of working on a couple of papers where we have written some code to perform some fairly complex calculations. While I would like to say I am a great programmer, reality has a way of intruding. Collaboration has vastly improved the code, and I fully intend to post the code when I am finished with it. (for fear of being slashdoted I will not post the URL here).
Innovative Technologies (Score:1)
Out-compete - not really relevant (Score:3, Interesting)
Competition is sometimes relevant, but not in general. The scientist seeks to further the knowledge base, to reinforce his hypotheses, and their sub-hypotheses, etc. The corporation wants to make money.
Now, the scientist, in his quest to further knowledge, has no responsibility to avoid the intellectual property of others. If something is copyrighted, he cites it. If something is patented, he uses it anyway for research, with no necessity to pay to use the patent (unless, of course, there is no other way to get the invention).
The corporation, in its quest to make more money, need not even establish that something will work before it can establish IP. Knowledge is not directly relevant - they only need to set up tolls on the highway to commerce in the form of patents and copyright.
Sometimes, conflict exists. Celera is patenting genes, but only if they find them before NIH (which makes their database public). But in general the goals are different, patents can come from the work of scientists without interfering in the future work of the scientist (it is important to avoid conflicts of interest, usually by the scientist having no control or material interest in patent licensing - this is often not the case), and corporations establish their tolls without even paying attention to knowledge.
This has already been proven... (Score:3, Insightful)
Unless this isn't 'biology' the human genome project has already succeeded in beating out a private commercial project with the human genome map.
Re:This has already been proven... (Score:1)
Unless this isn't 'biology' the human genome project has already succeeded in beating out a private commercial project with the human genome map.
This is not true for a number of reasons:
The company in this case, should get the credit they deserve for the innovative methods and their application to this enormous problem to get the first completed, high quality human genome version in such a short period of time.
Re:This has already been proven... (Score:2)
And they definitely should get the credit for doing all their own sequencing and not using any of the GenBank data...except, well, they didn't.
The "genome race" was never a race, because you can't have a race where one side can never win (Celera could use the Human Genome Project's data as well as their own) but also could not lose (since the real goal was to guarantee a freely available version of the genome).
Re:This has already been proven... (Score:2)
And was based on the publically available version.
"the quality (and therefore usability) of the public version is still below that of the private version."
Ditto.
"Without the private version to stress the academic guys, "
This is the "academics can never do anything" argument, and its a fallacy. The public effort has still produced more sequence than the private.
"All I'm saying is that sometimes some good old competition, can help to accomplish research goals."
Celera did force the pace and this was no bad thing. But competition is not unique to the private sector, its present in academia as well.
"The company in this case, should get the credit they deserve for the innovative methods"
Indeed. Celera's main innovation was their generation of the best techniques for contig assembly, and their techniques still outstrip those publically available. Of course most of the work for this was done in the public sector, so the public/private argument is somewhat fallacious. But producing data which is not freely accessible is of limited value. Nature is complex enough without adding lawyers into the mix.
Phil
New Sourceforge Project (Score:2)
The only thing that worries me is the six day schedule to ship...
Re:New Sourceforge Project (Score:1, Offtopic)
I'm announcing a new project, ADAM 2.0.
I'm sorry, but as the father to one Adam Lewis Remy, I'm afraid you are infringing on my trademark.
You have 30 days to alter the name to something that will not be confusing to any person in the human race, or I will sic my battalion of trained attack weas^H^H^H^Hlawyers on you.
Please don't hurt me... (Score:1)
Keywords (Score:2, Interesting)
"And that's what I think about Australian jumping elephants... oh and:
Linux, Open Source, and Microsoft Sucks"
"Open Source" science was practiced long before "Open Source" programming. In fact, it was the rule, rather than the exception until just recently. Bonding science to the corporate marketplace, while extremely profitable, also leads to a great deal of "Re-inventing the wheel". Hopefully this is the beginning of a trend back toward a detached scientific community.
Open Source Biology? (Score:1, Offtopic)
I await the day... (Score:2, Insightful)
I think there is a sort of un-uttered agreement that the journal review process exists, and that data is kept tightly protected because the establishment ensures proper quality of published output. In days gone by, perhaps that was a feasable approach, but with the advent of open source, and the thousands of developers forums throughout the web, I believe that finally there is a tangible example that argues for a complete overhaul in approach.
Despite the massive size and wide distribution of the community, there is still some, nay much, order to open source. There is some sort of consensus on the best distro, or the best app. for this or that. Reliable, secure and stable abound in open source; the bolts of excellent software are clear despite the storm. Open source has tipped arguments for the Cathederal, or a stuffy establishment that upholds integrity, on its head. The Bazaar works. People can be rewarded for the value of their output, and not their ability to horde.
In terms of a long term world view, I believe open information is the future in all areas of human endevour. How does a company create value by selling the same data, the same idea, a billion times. Let them be fairly recompensed for their effort in gathering the data, or putting the idea into a servicable form, and leave it at that.
Open source has demonstrated that the Bazaar is able to sort the elite from the mundane, and what's more, the volume and value of the output would not have been possible within a single closed establishment.
Oh, I wish, I wish, I wish that data and information was open. I wish that governments would legislate against data and idea hording; I believe such intervention passes the test of expediency. All hording does is hold back the ability of so many people to produce so much valuable output. This is the future for all important areas of human endevour if we ever want to make more of our precious progress before the earth is consumed by the sun.
I have much more to rant about, but you've probably stopped reading by now anyhow.
Re:I await the day... (Score:1)
In economics, the scientific standard of replicability of results has made people more willing to share their data for that purpose. (Happened much later in econ than in natural and physical sciences) But it is a long way from open source.
So how do we pay for it?
Re:I await the day... (Score:1)
Replicability of results is one issue. But this is really a technical issue; the real questions of interest are: how do your findings fair out of sample, and how does to exact same data look under different analytical methodologies?
Yes, you are correct, some journals and other institutions do endevour to make data publicly available, but only some (Review of Financial Studies is the only journal I can recall at the moment).
How do we pay for it? I don't think people would have a problem with providing funding for efforts to make data open source, but US$25,000 per year for one database is just way too much for some universities.
Pretty ignorant article (Score:4, Interesting)
1) The idea of sharing information within and beyond the scientific community is hardly a new one. On the contrary, if anything secrecy has increased in recent decades, partly because of industrial applications but mostly becuase science is much bigger and more cutthroat than when it was a wealthy gentleman's pursuit.
The sort of collaboration described here is new and is driven by the Internet and by the large data sets in current biology. But to say that scientists got the idea to share information from Linus Torvalds is idiotic. That's not even where computer scientists got the idea.
2) Claiming that patents encourage "hoarding" of information is the sort of thing you expect to see in +1 Slashdot comments. The whole point of patents is to _encourage_ the sharing of information instead of relying on secrecy. Inventors trade disclosure for a temporary monopoly.
Again, there's a germ of truth here that companies after a patentable discovery need to be closed-mouthed about their progress, but that's probably given the author too much credit.
Re:Pretty ignorant article (Score:1)
1) In the absence of patents, companies wouldn't reveal anything about the compound or the pathway it affects. The purpose of patents is to allow inventors to at least release information after they've filed.
2) Publishing a paper on gene Y is the way science has worked for over a thousand years. Maybe financial pressures have reduced that (although the article greatly exaggerates how much work is done by industrry or by patent-minded academic researchers) but it's silly to suggest that the idea of sharing information came from Linux development.
Actually, it just occurred to me that the article suffers from Eric Raymond Syndrome. It treats openness and collaborative work as the same thing, which they're absolutely not. If it had argued that collaborative research is becoming exponentially more common, it would be absolutely correct.
Re:Pretty ignorant article (Score:1)
Re:Pretty ignorant article (Score:1)
regarding your first point, of course it's true that science had an ethic of sharing before software. In the first draft, there were several paragraphs explaining the role in distributing information that scientists such as Nicolas Peiresc played in the 17th century and noting that RMS was originally inspired by scientific ethics. See the middle of this for his clearest explanation http://www.gnu.org/philosophy/shouldbefree.html.
I cut that out because it seemed obvious, the Einstein quote gives some context, and it's really important and challenging not to bore peoeple writing about science in a political magazine. Maybe that was an editing mistake. But I certainly don't disagree with you or think that Linus influenced Einstein.
WRT your second point, as you surely know patents have a mixed impact on sharing. They do allow more information to get into the public domain, but they also allow for hoarding and blocking other research (see the Costa Rican rice example). I certainly don't think that they are entirely nefarious. My contention is simply that the overall trend is clearly going in the wrong direction (the lack of sharing amond geneticists is the clearest evidence) and that overuse of patents, particularly upstream patents, makes the problem worse.
Re:Pretty ignorant response (Score:3, Insightful)
I agree with you that the point of patents is to promote disclosure in return for a limited monopoly. The problem is that because of the running time of the monopoly (about 10-15 yrs for a drug, because the first 5-10 years deals with the approval process), the patents will be left until the last moment.
The other issue is the non end-use related patents. Large companies can swap licenses on this but smaller companies and the third-world need to carefully avoid the intermediate steps that have been patented.
Again you are right that the Open Source movement is a bit of a Johnnie-Come-Lately as regards disclosure, but the use of copyleft is something that has come from the computer field and has given vast leverage to developers. You are right about the purpose of patents, but essentially they have become a way of sowing a legal minefield in a competitor's path.
Re:libertarian atheist reply (Score:1)
The EULA for genetic material (Score:2)
1. GRANT OF LICENCE. $PERSON1, along with the cooperation of $DIETY (hereafter known as the GRANTOR) grants you, $PERSON2 (hereafter known as the GRANTEE) a non-exclusive, non-transferable licence to the GRANTOR's genetic material. GRANTEE accepts that she cannot tranfer the EULA to anyone without GRANTOR's concent.
2. RESTRICTIONS. The GRANTOR 0wnz j00. GRANTOR can change the licence at any time and can withhold all genetic material at any time without the GRANTEE's concent. The GRANTOR has all right. The GRANTEE has none. Amen.
3. LIMIT OF LIABILITY. THE GRANTOR TAKES NO GODDAMN REPONSIBLITY FOR ANYTHING. THE GRANTOR IS ALWAYS RIGHT WITH EVERYTHING AND ANYTHING. SHOULD YOU DISAGREE GO FUCK YOURSELF. IF THERE IS ANYTHING YOU WILL GET IS THE BURDEN OF RAISING ANY DERIVIATIVE OF THE GRANTOR'S AND GRANTEE'S GENETIC MATERIAL. OFFER VOID FOR EVIL DOERS AND ANYONE GRANTOR THINKS IS UNACCEPTABLE.
Do you agree to the licence terms? [I have no choice] [Screw you, I actually read the EULA!]
EZ metrics indeed. (Score:2)
Huh. Man, you must be one of these fucking geniuses.
Linux for Biotechnology - open source collection (Score:1)
Cart before horse? (Score:2)
Cheers,
IT
Is it realy that new ? (Score:1)
Open source bioinformatics tools (Score:2, Informative)
bioinformatics.org [bioinformatics.org]
bioperl.org [bioperl.org]
biojava.org [biojava.org]
and even www.cvbig.org [cvbig.org] for a talk on bioinformatics with PHP/Ming
It doesn't matter who "wins" (Score:2, Interesting)
The parallels to open source programming run rather deep. There are many advantages to open source software (or open data science). However, there will always be room for people to work on propriatary projects. Micro$oft currently co-exists with open source developers. There is room for both. Sure, they don't always get along, but that's another story.
The way I see it, every scientist builds upon the research of others. That's just how science works. The open sharing of data only serves to speed up that process. Ultimately, every scientific discovery will be "borrowed" by some other scientist down the line. So what if a company takes some of that data and uses it's additional monetary resources to turn the data into a pill that can cure a disease. The point is, science and mankind, benefit in the long run.
There is another side to this as well. Corporate research tends to be very focused on solving a specific problem or answering a specific question. While this type of scince is very valuable and necessary, there is another type of science. The kind where researches study a phenomenon just find out what's going on and how things work. Often this type of "basic research" uncovers answers to problems that previously seemed unrelated. It appears that open data sharing would help to promote this kind of "basic research".
Open Source (Score:2)
Well it might be Open Source, but it certainly doesn't seem to be accessable... Did a web search and was unable to find any website or other access to any information.
Lando
Scientists (Score:1)
How about open-sourced businesses (Score:2)
Also -- this one's nothing new -- how about transparancy in ICANN?
Ok, back to biology. Open Sourced Biology is nothing new: its as old as science itself. Back in the good old days, we didn't call it that becase it was just assumed. It was assumed that when something was discovered it would be shared with the world. Watson & Crick & Franklin made their findings available to the public for free upon the presumption that that information would be used to generate more information which would be freely dispersed, ad infinitum. Of course, now the scientific community is moving towards a proprietary model -- the dark side of the force, so to speak. But don't be fooled. The overwhelming majority of science is still transparent and "open sourced".
E-mail any professor at a university about the method they use for a particular protocol, or a finding they've found, and they'll more likely than not respond helpfully if they have time. Try e-mailing Celera on their protocols and see what you get.
You can see the parallels between the science world and the software world. Both started out completely open. Both migrated towards the proprietary closed model (in the case of software, almost completely). Now, both are having lash-back movements of evangelicism for transparency, open source, free software/biology, etc.
open-source science: funding/promotion/timetables (Score:1)
For example, it is common in my field (oceanography) that funding agencies require that data be shared openly, but only after the original investigator has had a certain amount of time to work with the data and to publish findings.
It makes sense to give the data originator "first dibs" on the data because
However, it's often the case that there is information in the data that the originator had not thought of, or that becomes clear only by integrating the data with the results of other investigations. That's why open sharing is crucial. Indeed, we would know very little about the state of the ocean, and of the climate system, without open sharing according to established "publish, then share" principles.
PS: throughout the above, please take "data" to mean either the results of measurement or the results of calculation.
Dan Kelley, Dalhousie University, Halifax, NS, Canada.
Flamebait, granted. (Score:2, Interesting)
Research is hard
Experiments aren't cheap
Many excellent biologists will hoard their work out of enlightened self interest. There will of course be individuals who see farther than that.
Some of those embracing the open source idea in this area (and anywhere an approach like that is used) will be riding coat tails and merely opportunistic. There will of course be a greater number of these men and women who are genuine in their desire to collaborate regardless of their abilities.
The real opportunity for something exceptional to come of this is when someone who IS better than his peers at what he does shares what he has with those other biologists.
Then you have a chance for one or more of those people to see that information from a different perspective and that's when the benefits are going to be obvious. It's a more optimistic take but I think in this case it's also more realistic as well.
"hoarding" research (Score:2)
One of the nice points of the current scientific system is that there's no (academic) reward for hiding your results or failing to disclose some special secret technique you invented. No results = no publication. No publication = no reputation. No reputation = no grant funding and no tenure, or at least inglorious obscurity if you're already tenured
Of course, once money is involved, there's a tendency to want to keep everything under wraps until it's patentable. And to be sure, often labs will withhold their data from competitors until it's been submitted to a journal. But that's only fair, and even competitors will usually cooperate on some level in "academic" science. I remember a friend saying how her lab and another lab agreed to publish their very similar papers in the same issue of a journal, so neither one was "scooped". As a result, they reinforced each others' credibility instead of fighting over it.