Follow Slashdot stories on Twitter

 



Forgot your password?
typodupeerror
×
Science

Computers Uncover 100,000 Novel Viruses in Old Genetic Data (science.org) 50

sciencehabit writes: It took just one virus to cripple the world's economy and kill millions of people; yet virologists estimate that trillions of still-unknown viruses exist, many of which might be lethal or have the potential to spark the next pandemic. Now, they have a new -- and very long -- list of possible suspects to interrogate. By sifting through unprecedented amounts of existing genomic data, scientists have uncovered more than 100,000 novel viruses, including nine coronaviruses and more than 300 related to the hepatitis Delta virus, which can cause liver failure. "It's a foundational piece of work," says J. Rodney Brister, a bioinformatician at the National Center for Biotechnology Information's National Library of Medicine who was not involved in the new study. The work expands the number of known viruses that use RNA instead of DNA for their genes by an order of magnitude. It also "demonstrates our outrageous lack of knowledge about this group of organisms," says disease ecologist Peter Daszak, president of the EcoHealth Alliance, a nonprofit research group in New York City that is raising money to launch a global survey of viruses. The work will also help launch so-called petabyte genomics -- the analyses of previously unfathomable quantities of DNA and RNA data.

That wasn't exactly what computational biologist Artem Babaian had in mind when he was in between jobs in early 2020. Instead, he was simply curious about how many coronaviruses -- aside from the virus that had just launched the COVID-19 pandemic -- could be found in sequences in existing genomic databases. So, he and independent supercomputing expert Jeff Taylor scoured cloud-based genomic data that had been deposited to a global sequence database and uploaded by the U.S. National Institutes of Health. As of now, the database contains 16 petabytes of archived sequences, which come from genetic surveys of everything from fugu fish to farm soils to the insides of human guts. (A database with a digital photo of every person in the United States would take up about the same amount of space.) The genomes of viruses infecting different organisms in these samples are also captured by sequencing, but they usually go undetected.

This discussion has been archived. No new comments can be posted.

Computers Uncover 100,000 Novel Viruses in Old Genetic Data

Comments Filter:
  • Photo size!? (Score:5, Insightful)

    by darkain ( 749283 ) on Thursday January 27, 2022 @04:37PM (#62213325) Homepage

    I know its not related to the main article, but I just wanted to call out a quick bullshit to this line, as a former semi-pro photographer.

    "A database with a digital photo of every person in the United States would take up about the same amount of space."

    If my math is serving me correctly, I'm over-estimating USA population at 350 million. And we're talking 16 petabytes of data? That would be roughly 61MB per photograph. WHAT THE HELL format are you using to require that much data? High quality/resolution RAW photos are ~30MB. Decent JPG images are ~1-3MB for the same resolution.

    You're off quite a bit on your arbitrary comparison metric there!

    • Fine. Whatever. Back to how many Volkswagen Beetles the photographs would all fit in. Is that precise enough for you?

    • by Anonymous Coward
      canon raw files (as would be used by a journalist, the kind of person who writes news stories) are about 45-50MB each. the analogy is pretty spot on. nobody gives a shit about how many 1024x1024 wavelet-compressed micro-images would take up that much space.
      • Most people are not journalists, nor do they care about RAW files.
        • Not sure your point here. RAW type formats are preferable for digital imaging (particularly if your going to be doing some deep analysis on them).

          Its not about "Most people", its about "Most algorithms".

          • I guess my point was that if you're going to make a comparison, make a comparison with something that people are familiar with. A digital photo, to most people, is not a 50 MB RAW file. More generally, if you make a stupid comparison like that and don't specify exactly what kind of photo you are basing the comparison on, it's really completely arbitrary and meaningless.
    • The rest of the data are for the HP printer driver, should someone need a hardcopy photo print.
    • by AmiMoJo ( 196126 )

      Professional journalists copy/paste photos into Word documents so that their editor can open them, hence each one is about 60MB.

    • Raw, uncompressed bitmaps of 20 megapixel images.

      Doesn't everybody use those?

  • by ffkom ( 3519199 ) on Thursday January 27, 2022 @04:41PM (#62213341)
    ... then the viruses were old, too. Should the headline rather read "100,000 previously unknown viruses now sequenced"?
    • by eric777 ( 613330 )
      *Novel* in this context doesn't mean *new* - it means *previously unreported*. The headline is accurate.
    • It would be much more fun it if was, "Scientists accidentally unleash 100,000 previously unknown viruses". You would get people screaming it's the end of the world, other's screaming it's a conspiracy and insisting on drinking their own urine to "become immune", while the people focuses on the real issue will be screaming that the plural of "virus" is "virii". ;)

    • Maybe this where Fauci got his Novel Wuhan virus from.
  • Now, they have a new -- and very long -- list of possible suspects to interrogate

    Unless deliberate steps are taking to adapt these viruses to humans, there is no danger. Just wash your hands, and cook your meat thoroughly...

    No, I'm not suggesting, COVID-19 virus was designed as a weapon or something like that. The charitable explanation [substack.com] is that it was adapted in order to study, how such an adaptation — should it ever happen naturally — can be treated.

    And then it "escaped" — without our tr

    • Now, they have a new -- and very long -- list of possible suspects to interrogate

      Unless deliberate steps are taking to adapt these viruses to humans, there is no danger. Just wash your hands, and cook your meat thoroughly...

      There is another [deniskitchen.com] step you have to take...

    • Re: (Score:3, Insightful)

      by Archtech ( 159117 )

      Yet again, a sober, well-informed comment - very likely close to the truth - gets moderated down.

      • Re: (Score:2, Informative)

        by waspleg ( 316038 )

        Yes, because it's against the CCP narrative and there are a bunch of wumaos here with sock puppet mod point accounts now that go through downvoting ALL posts from people who are critical of them.

        As a result, ~25 years of Excellent Karma and even some recent article submissions getting posted mean nothing at all - I haven't seen any mod points in months.

    • by hey! ( 33014 ) on Thursday January 27, 2022 @06:37PM (#62213733) Homepage Journal

      I know the linked PREEMPT study *sounds* nefarious -- inserting novel spike protein into SARS-COV-1 viruses -- but if SARS-COV-2 was genetically engineered chimeric SARS-COV-1 that would be obvious. The closest known relative to SARS-COV-2 has actually recently been found in Laos [nature.com], and it even has SARS-COV-2's heretofore unique receptor binding proteins.

      The problem all along with the "genetics experiment gone awry" conspiracy theory has been this: where did all the parts for this supposedly chimeric virus come from? You don't start with random viruses unknown to science; that leads your research to a universe of dead ends. At least some of the pieces you work with have to be known to science, otherwise you'd never get a result. PREEMPT may have been a terrible idea (or at least a now-controversial idea) that nobody wants to be associated with now, but it's not the origin of COVID-19.

      People are attracted to conspiracy theories about COVID origins because CTs are simple to understand and actually comforting. How did this happen? Someone *bad* made this happen. But epidemiologists have been expecting some emergent respiratory virus to go pandemic for years now. If the answer is that humanity is simply vulnerable to emerging pathogens, that's not a comforting conclusion at all.

    • by sg_oneill ( 159032 ) on Thursday January 27, 2022 @09:17PM (#62214003)

      Unless deliberate steps are taking to adapt these viruses to humans, there is no danger. Just wash your hands, and cook your meat thoroughly...

      No, I'm not suggesting, COVID-19 virus was designed as a weapon or something like that. The charitable explanation is that it was adapted in order to study, how such an adaptation â" should it ever happen naturally â" can be treated.

      And then it "escaped" â" without our trusted Science having a treatment... If we simply resolve to not do it again, we have no need to worry... We did resolve so before, but let it slide somehow...

      All of which is highly improbable.

      We can actualy rule out that it was "created in a lab". Gene splicing leaves large satelite repeat sequences as part of the process, and those dont exist on that genome.

      Further, we have no capacity to design that spike (Which, by the way is wholy novel to SARS-COV-2). We can barely work out how to predict protein folds on simple molecules, let alone an absolutely monsterous molecule like the sars-cov-2 spike. In fact the folding problem is so intractible, there are mathematians that think its unsolveable.

      So we can rule out the possibility that the spike was designed.

      Its also *highly* unlikely that some sort of "gain of function" (that favorite term of the "i didnt finish highschool biology" set) was involved here because it still cant explain such a huge molecule as the spike. Its a theory that posits scientists engaging in some sort of miraculous creation-ex-nihilo.

      Coupled with former western workers at the claimed lab saying that no, that is not what was being researched at the lab, and inspectors concluding that indeed the theory is highly unlikely, I think its safe to say that whole theory was a bust.

      • Do you really think such intellectual dishonesty is an effective way of countering conspiracy theorists? You're talking about custom designing sequences or copying them from another virus then manually splicing them in. Yes, that's essentially ruled out and a conspiracy theory. But then you pivot to using that particular method to rule out all gain of function research, when there's other methods: Serial passage doesn't involve splicing new sequences in; it doesn't involve manually editing the genome in a w
  • by Anonymous Coward

    FYI, viruses don't shut down economies. The world didn't shut down for spanish flu. The shut down part is optional. That doesn't make shut downs wrong, but the virus didn't do it.

    • Correct. Hundreds of previous epidemics caused barely a ripple in economies.

    • by pjt33 ( 739471 ) on Friday January 28, 2022 @05:13AM (#62214563)

      I think it's the word "cripple" that you should be taking issue with there. If no shutdowns had been applied anywhere in the world then the economy would still have suffered to some extent due to workers taking sick leave, some percentage of them dying, and more risk-averse people changing their behaviour (and hence their spending). I'm not an economic or health modeller, so I won't speculate as to whether the net effect on the global economy would have been more or less, but the effect described here as "crippling" was a reduction in global GDP by a whopping 2.77% according to this data [statista.com]. Contrast with the bubonic plague (not viral, I know), which caused a reduction in the European workforce of tens of percent, and even then didn't really cripple the economy so much as empower the serfs and eventually cause cultural change.

  • by WillAffleckUW ( 858324 ) on Thursday January 27, 2022 @05:34PM (#62213529) Homepage Journal

    While it is true that many segments of the human genome contain what we think of as uncoded or not useful sequences, many of which are the result of prior viral attacks, it is also true that using mRNA, miRNA, siRNA, circRNA and other sequences generated by our DNA in response to certain conditions, that we theoretically have defenses against many of them.

    They just haven't been expressed, because we don't need them.

    Just because you haven't seen a virus before does not mean our DNA lacks defenses against it.

    • by tlhIngan ( 30335 ) <slashdot@worf.ERDOSnet minus math_god> on Thursday January 27, 2022 @07:19PM (#62213809)

      The other thing is well, genetic transfers of information are a thing.

      Bacteria have been well known to send little snippets of DNA to other bacteria, or even things like viruses - the bacteria would capture a virus, then inject its ring of DNA into the virus. The next thing the virus infects gets that little ring of DNA as well.

      It works well for non-eukaryote cells (which lack a nucleus and basically have their genome in the cytoplasm), but the viral method is basically believed to be how we inherited a lot of our DNA. As we have eukaryote cells, random strands of DNA in the cell are basically destroyed because they shouldn't exist. Eukaryotes carry their DNA in the nucleus of the cell, and a copy to RNA is made in order to carry out cell processes. This is an excellent method of controlling infection.

      Of course, nature evolves and viruses evolved to be able to carry their DNA into the nucleus and take over the cell machinery in order to operate. Or viruses evolved to use RNA instead - but then had to copy the mechanism to copy the RNA along with them, because eukaryotes cannot duplicate RNA.

      The evolution of genetics at a high level is actually a fascinating subject to study

    • While it is true that many segments of the human genome contain what we think of as uncoded or not useful sequences, many of which are the result of prior viral attacks, it is also true that using mRNA, miRNA, siRNA, circRNA and other sequences generated by our DNA in response to certain conditions, that we theoretically have defenses against many of them.

      They just haven't been expressed, because we don't need them.

      Just because you haven't seen a virus before does not mean our DNA lacks defenses against it.

      Without active pressure from the viruses, though, mutations which inactivate those defenses will be evolutionarily neutral, meaning the defenses will degrade over time in the same way that the program to build working eyes degrades over time in cave fish. It'd be interesting to calculate how frequently a virus would have to show up in order to make deactivating mutations selectively negative enough to preserve the defenses in between attacks. (I guess the parallel would be calculating how often you'd have

      • Without active pressure from the viruses, though, mutations which inactivate those defenses will be evolutionarily neutral, meaning the defenses will degrade over time in the same way that the program to build working eyes degrades over time in cave fish. It'd be interesting to calculate how frequently a virus would have to show up in order to make deactivating mutations selectively negative enough to preserve the defenses in between attacks. (I guess the parallel would be calculating how often you'd have to light up the cave in order for the fish to not lose their sight.)

        True, but sometimes when we take medications which block the biochemical pathways, then we find some of the sequences from the secondary or tertiary pathways are activated, so they can, in fact, be expressed.

        Of course, if that resistance is that far back, it might think you're a fish and do things in a way that isn't good for you, either.

        • Fair point. I suppose it's possible that our bodies are routinely facing these attacks and shutting them down without us noticing?
          • That would make an interesting paper for a journal

            • Throw a bunch of random newly discovered viruses at some human cells in culture, see which noncoding RNAs get their expression boosted...
            • Science has had a couple of papers last year addressing this. Antigen tests of the population of central China found that indeed, they appear to have been immunologically challenged by multiple unknown coronaviruses. And a study modeling the initial emergence of the pandemic, using the transmission characteristics of the initial virus, indicated that successfully establishing a pandemic was a somewhat unlikely event - multiple similar cross-overs will occur for every one that takes off. This is easily see

  • Re: (Score:1, Troll)

    Comment removed based on user account deletion
  • Viruses are everywhere and pretty much every lifeform has parasitical viruses that target them but they are generally very specific to their hosts. And evolved in synchrony with them. There are way more than 100,000 viruses that we don't know anything about, but that doesn't mean they are a threat to us.

    Yes it is entirely possible that some small fraction could mutate sufficiently to cross over to humans. Many of those we do know about. It would cost a few billion to develop vaccines against known zoonotic

  • It seems like the novelty here is just the toolset they developed. It's hard to believe anyone was surprised by the results or previously unclassified viruses in the data. Scientists have been sampling and sequencing all sorts of viruses since the tech to do so has been available.

Biology is the only science in which multiplication means the same thing as division.

Working...