Slashdot is powered by your submissions, so send in your scoop

 



Forgot your password?
typodupeerror
×
China Medicine Science

Scientist Finds Early Virus Sequences That Had Been Mysteriously Deleted (seattletimes.com) 336

UPDATE (7/30): All the missing virus sequences have now been published, with their deletion being explained as just "an editorial oversight by a scientific journal," according to the New York Times.

In Slashdot's original report, an anonymous reader quoted another report from The New York Times: About a year ago, genetic sequences from more than 200 virus samples from early cases of Covid-19 in Wuhan disappeared from an online scientific database. Now, by rooting through files stored on Google Cloud, a researcher in Seattle reports that he has recovered 13 of those original sequences -- intriguing new information for discerning when and how the virus may have spilled over from a bat or another animal into humans. The new analysis, released on Tuesday, bolsters earlier suggestions that a variety of coronaviruses may have been circulating in Wuhan before the initial outbreaks linked to animal and seafood markets in December 2019. As the Biden administration investigates the contested origins of the virus, known as SARS-CoV-2, the study neither strengthens nor discounts the hypothesis that the pathogen leaked out of a famous Wuhan lab. But it does raise questions about why original sequences were deleted, and suggests that there may be more revelations to recover from the far corners of the internet.
UPDATE (6/25): The Washington Post notes the data wasn't exactly suppressed. "Processed forms of the same data were included in a preprint paper from Chinese scientists posted in March 2020 and, after peer review, published that June in the journal Small." And in addition: The NIH released a statement Wednesday saying that a researcher who originally published the genetic sequences asked for them to be removed from the NIH database so that they could be included in a different database. The agency said it is standard practice to remove data if requested to do so...

Bloom's paper acknowledges that there are benign reasons why researchers might want to delete data from a public database. The data cited by Bloom are not alone in being removed by the NIH during the pandemic. The agency, in response to an inquiry from The Post, said the National Library of Medicine has so far identified eight instances since the start of the pandemic when researchers had withdrawn submissions to the library.

"This one from China and the rest from submitters predominantly in the U.S.," the NIH said in its response. "All of those followed standard operating procedures."

The New York Times writes: The genetic sequences of viral samples hold crucial clues about how SARS-CoV-2 shifted to our species from another animal, most likely a bat. Most precious of all are sequences from early in the pandemic, because they take scientists closer to the original spillover event. As [Jesse Bloom, a virologist at the Fred Hutchinson Cancer Research Center who wrote the new report] was reviewing what genetic data had been published by various research groups, he came across a March 2020 study with a spreadsheet that included information on 241 genetic sequences collected by scientists at Wuhan University. The spreadsheet indicated that the scientists had uploaded the sequences to an online database called the Sequence Read Archive, managed by the U.S. government's National Library of Medicine. But when Dr. Bloom looked for the Wuhan sequences in the database earlier this month, his only result was "no item found." Puzzled, he went back to the spreadsheet for any further clues. It indicated that the 241 sequences had been collected by a scientist named Aisi Fu at Renmin Hospital in Wuhan. Searching medical literature, Dr. Bloom eventually found another study posted online in March 2020 by Dr. Fu and colleagues, describing a new experimental test for SARS-CoV-2. The Chinese scientists published it in a scientific journal three months later. In that study, the scientists wrote that they had looked at 45 samples from nasal swabs taken "from outpatients with suspected Covid-19 early in the epidemic." They then searched for a portion of SARS-CoV-2's genetic material in the swabs. The researchers did not publish the actual sequences of the genes they fished out of the samples. Instead, they only published some mutations in the viruses.

But a number of clues indicated to Dr. Bloom that the samples were the source of the 241 missing sequences. The papers included no explanation as to why the sequences had been uploaded to the Sequence Read Archive, only to disappear later. Perusing the archive, Dr. Bloom figured out that many of the sequences were stored as files on Google Cloud. Each sequence was contained in a file in the cloud, and the names of the files all shared the same basic format, he reported. Dr. Bloom swapped in the code for a missing sequence from Wuhan. Suddenly, he had the sequence. All told, he managed to recover 13 sequences from the cloud this way. With this new data, Dr. Bloom looked back once more at the early stages of the pandemic. He combined the 13 sequences with other published sequences of early coronaviruses, hoping to make progress on building the family tree of SARS-CoV-2. Working out all the steps by which SARS-CoV-2 evolved from a bat virus has been a challenge because scientists still have a limited number of samples to study. Some of the earliest samples come from the Huanan Seafood Wholesale Market in Wuhan, where an outbreak occurred in December 2019. But those market viruses actually have three extra mutations that are missing from SARS-CoV-2 samples collected weeks later. In other words, those later viruses look more like coronaviruses found in bats, supporting the idea that there was some early lineage of the virus that did not pass through the seafood market. Dr. Bloom found that the deleted sequences he recovered from the cloud also lack those extra mutations. "They're three steps more similar to the bat coronaviruses than the viruses from the Huanan fish market," Dr. Bloom said. This suggests, he said, that by the time SARS-CoV-2 reached the market, it had been circulating for awhile in Wuhan or beyond. The market viruses, he argued, aren't representative of full diversity of coronaviruses already loose in late 2019.

UPDATE (7/30): When republishing their sequences, the researchers indicated they actually came from January 30, 2020 (and not "late 2019").
This discussion has been archived. No new comments can be posted.

Scientist Finds Early Virus Sequences That Had Been Mysteriously Deleted

Comments Filter:
  • Good timing (Score:4, Funny)

    by Bad Ad ( 729117 ) on Thursday June 24, 2021 @08:05AM (#61516156)
    Good job this was done before September 23rd!
  • by kackle ( 910159 ) on Thursday June 24, 2021 @08:07AM (#61516162)
    This may sound silly, but is this something that might exist on The Wayback Machine [archive.org]?
    • This may sound silly, but is this something that might exist on The Wayback Machine [archive.org]?

      For a BSL-4 facility?

      This may sound silly, but that would be called required auditing.

      Of course, I'm also talking about US safety standards...

      • by hey! ( 33014 )

        Samples aren't always handled in a BSL-4 facility. The lab that analyzes your nasal swab isn't BSL-4, it just takes basic microbiological precautions. US research guidelines don't require BSL-4 facilities except for certain highly risky research protocols, like inoculating live animals. Culturing viruses in cells requires BSL-3.

        In any case, the auditing in a more secure facility wouldn't necessarily protect the data and analytical products. That kind of stuff is usually held pretty close to the research

    • Re:Wuhan Wayback (Score:5, Informative)

      by Samantha Wright ( 1324923 ) on Thursday June 24, 2021 @08:34AM (#61516234) Homepage Journal

      Adding the SRA to the Internet Archive would be an immense undertaking for several reasons. Not only is it huge (36 PB as of January; the IA is ~50 PB as of April) but a lot of the sequences are not publicly accessible because they can involve data from human subjects. There are two official mirrors (at DDBJ in Japan and EBI in the UK) but AFAIK they would mirror deletions, as well.

      Incidentally, to remove something from the SRA requires a written email request sent to the NCBI. It would be very interesting to see the reason given in that email.

      • by jd ( 1658 )

        Ideally, all genetic databases (be they for BLAST, the Genbank database, or any of the others you list) should be stored in some form of version control system where it is possible to synthesize any edition from the changes. Ideal is not always practical, and genetic data (through sheer volume) would be incredibly difficult to process. Anyone here familiar with BLAST (not sure if it still is, but it was certainly a free to search database) knows finding anything takes time and processor power.

        I'd still say

  • Confirmation bias (Score:5, Insightful)

    by RogueWarrior65 ( 678876 ) on Thursday June 24, 2021 @08:26AM (#61516214)

    Everyone will read this post and use it to confirm their own biases.

    • Comment removed based on user account deletion
      • If you remember back to december 2019 and january 2020, it was blatantly obvious china was covering up SOMETHING. too many scientists and reporters got 'dissapeared' they even live streamed their arrests, never to be seen again. Then 2 mos later there was that mysterious brand new hotel that collapsed like the twin towers. It was allegedly housing 50 covid patients. I have no idea why so many rush to the defense of China here. They arent much better than Iran, just more powerful.
        • Ah yes, of course it would be much easier for the Chinese government to do a controlled demolition of a whole building than to just put 50 people om a bus. Yes China obviously did what they could to hide that they had any problems what so ever early on since they wanted to save face, but extrapolating that to controlled demolition is to take it two million steps to far.
      • Re: (Score:2, Insightful)

        by Zak3056 ( 69287 )

        I think a bunch of conspiracy theorists will go for the "Proves LIEberals were LYING when they said it wasn't leaked from lab!!!" thing again, then linking to a bunch of cases where the media didn't take seriously the theory that the virus was created in a lab.

        Forget the "liberal" label, for a minute, but the above is just plain revisionist history [slashdot.org].

        It WAS quite fashionable to conflate "lab leak" with "lab created: in an attempt to discredit anyone raising the possibility, though, just like you're doing above.

        • by Darinbob ( 1142669 ) on Thursday June 24, 2021 @12:42PM (#61517076)

          The "lab leak" story seems to serve 2 purposes. First, it gives a way to blame a person, company, or country; having someone to blame is great for politics. Second, it gives a bit of false assurance that once you've figured out the origin that you need no longer fear pandemics in the future and thus do not need to take the necessary steps to try and reduce their likelihood.

          All a lab leak shows if proven is that people were sloppy.

      • Re: (Score:2, Insightful)

        It's not about created in a lab but escaped from a lab where gain of function research was taking place. Occam's Razor clearly points at the Wuhan Institute of Virology but this whole things has become so stupidly politicized that people have dug their heels in and could care less what the facts show.
    • by RightwingNutjob ( 1302813 ) on Thursday June 24, 2021 @08:38AM (#61516246)

      A year and a half of being lied to from all directions will do that to people.

      As far as I can tell, there was never any firm evidence for or against the lab leak or the seafood market. Both were plausible at the outset, but the nonsense from the media that

      "lab leak" =
      "engineered virus" =
        "bioweapon" =
        "illuminati" =
      "faster download speeds" =
        "racist conspiracy theory"

      got traction in the media...for transparently cynical political reasons...and tainted the nominally authoritative sources of information to such an absurd degree that I'd be surprised if large numbers of otherwise smart people didn't start latching onto noise and nonsense.

      If this were happening in some third world shithole country, we'd all chuckle at the rubes believing their silly superstitions and smugly declare that something like that couldn't happen in a literate, educated, scientific, liberal democracy like ours.

      But guess what: the social fabric is fragile. And introducing one element that makes the shithole countries so pungent (for example, putting propaganda up as press) tends to invite the other pathologies in as well.

      • by Darinbob ( 1142669 ) on Thursday June 24, 2021 @12:45PM (#61517100)

        It helps to have the "leader" of the free world espousing these stories. It's one thing for crazy uncle Larry to have some nutball ideas, they won't spread too far, but when the president is a nutcase in public with a fan base who believe he was personally annointed by God, those conspiracy theories will spread far and wide. Almost like a pandemic of misinformation.

      • Re: (Score:2, Insightful)

        True, but you know that everyone, including those that upvoted you, take your comment to mean "the other side's propaganda was dangerous", while remaining convinced that their side's was divine, scientific, and free of bias.

        They will offer you example after example of the other side's mistakes and silly claims, but when you ask them to point that critical eye are their own side, they say "well, what do you mean?"

        The ability to accept our side as flawed does not exist, and in rare cases in which it does, suc

    • Curious information but unfortunately inconclusive. Very odd would think such information would be better secured and reviewed.
    • by gweihir ( 88907 )

      Yep. Mine is that most people are stupid. Nicely confirmed.

    • Probably because there isn't enough to draw conclusions from yet. The real revelation with the Lab Leak theory so for is how susceptible scientific consensus was to political pressure.
  • scientists had uploaded the sequences to an online database called the Sequence Read Archive, managed by the U.S. government's National Library of Medicine

    Surely the data can be recovered from that system then, if it is controlled by the US. From a backup, or if it is still in the database and just flagged as "hidden" or "inactive", or from the server logs containing POSTed data, etc.

  • What China does know better than everyone else is that they are a large country that until a few decades ago was largely isolated from the rest of the world. They have diseases, many not yet identified or understood, that the rest of the world has little to no immunity. Its a difficult situation for them to handle because they don't want the rest of the world to mistrust them or their products.

    In a perfect world everyone would put their cards on the table and cooperate to address the issue. Unfortunately
    • There is no "now", they always have. Also, if someone tries to kill you, it's perfectly moral and ethical to kill them before they can. Even the Dali Lama thought so.
      • >There is no "now", they always have.
        I agree, but now we have given them a valid reason to be uncooperative

        >Also, if someone tries to kill you, it's perfectly moral and ethical to kill them before they can. Even the Dali Lama thought so.
        I assume you are commenting on my signature line. In my view a cannibal is insane, and likewise anybody who would take a life for any reason would also be insane. I doubt you and the Dali Lama are suggesting that if somebody tries to eat you, it is moral and ethica
  • by tekram ( 8023518 ) on Thursday June 24, 2021 @09:58AM (#61516472)
    Jesse Bloom found exactly nothing new that is not already part of the scientific literature, Garry wrote in an email. He called the Bloom paper inflammatory.
    Dr. Robert F. Garry of Tulane University is the virologist who co-wrote the earliest influential March 2020 paper on SARS-COV2.
    Another mistake in the OP is that Bloom is not a virologist but a computational biologist: Jesse Bloom, a computational biologist at the Fred Hutchinson Cancer Research Center in Seattle.

    University of California at San Diego evolutionary biologist Dr. Joel Wertheim, who has studied the emergence of the virus in Hubei province, said, I actually dont think this study adds much to the origins debate.
    The NIH released a statement Wednesday saying that a researcher who originally published the genetic sequences asked for them to be removed from the NIH database so that they could be included in a different database. The NIH said it is standard practice to remove data if requested to do so. The NIH statement did not identify the scientist who requested that the material be excised from the agencyâ(TM)s sequence read archive, known as the SRA.
  • by LordHighExecutioner ( 4245243 ) on Thursday June 24, 2021 @10:03AM (#61516490)
    ...installing an antivirus on top of Google Cloud.
  • Why is this even news? I've forgotten how many times people have "lost files" on a network share or cloud. Then later figured out it mistakenly got moved when a person accidentally renamed stuff. I've also lost count of how many companies lost valuable source code and couldn't find it again.
  • You tell me "a scientist named Aisi Fu", no way this is a coincidence...

  • Whatever sequence they come up to figure out where this virus came from will pale in comparison to the Delta variant ramping up in this country. Currently, one third of all variants in the Dakotas region [nd.gov], which includes Montana, Colorado, Wyoming and Utah, are the delta variant. What's neat about the delta variant is it more contagious, more lethal AND, even if you were previously infected, you may not be protected against the new variant.

    Currently, estimates put the delta variant at over 20% of all
  • by TheMiddleRoad ( 1153113 ) on Thursday June 24, 2021 @11:56AM (#61516888)

    The sequences were moved to other databases and requested to be deleted for version control. What a garbage dump /. is now.

  • by rsilvergun ( 571051 ) on Thursday June 24, 2021 @12:10PM (#61516940)
    for us to think it was a lab leak. That's because addressing the much more likely cause (that it was caused by deforestation putting humans and livestock in contact with bat populations that are both prone to coronaviruses and not particularly affected by them) would require a massive cutback in very profitable slash & burn economic policies that China is using to keep their economy growing at the breakneck pace needed to keep their population distracted from their complete lack of civil rights.

    So yeah, we already know China was covering things up. The question is why. As for me, I find it much more likely that the thing Epidemiologists have been warning us about for 30-40 years (that deforestation was going to lead to new viruses as wild animals were forced out of their habitats) happened like it has many times in the past. But hey, since when have we listened to experts?
  • by joe_frisch ( 1366229 ) on Thursday June 24, 2021 @01:12PM (#61517230)
    I think people are looking at the lab origin question the wrong way. If the virus was an accidental lab release, then its important to understand how that happened so we can prevent a repeat in the future. One claim is that Wuhan was doing "gain of function" experiments where a disease that can't infect humans was modified so that it could - a legitimate type of research. But - the claim is that this was done under P2 controls, as was required. If that is true, then international lab safety rules need to be changed. https://thebulletin.org/2021/0... [thebulletin.org] To be 100% absolutely clear, I am NOT saying the above linked information is correct, just that there is some belief that the safety protocols met international standards but that those standards are not sufficiently tight. OTOH if this was not a lab escape, then we need to look at other ways to reduce the risk of a repeat of the pandemic. The idea that China will compensate the world for trillions of dollars for this is absurd, and not warranted unless it was either intentional, or a gross violation of international safety standards. Changing international safety standards may be appropriate.

Every nonzero finite dimensional inner product space has an orthonormal basis. It makes sense, when you don't think about it.

Working...