Follow Slashdot stories on Twitter

 



Forgot your password?
typodupeerror
×
Biotech

Four New DNA Letters Double Life's Alphabet (nature.com) 67

Joe_NoOne (Slashdot reader #48,818) shares this update from Nature: The DNA of life on Earth naturally stores its information in just four key chemicals -- guanine, cytosine, adenine and thymine, commonly referred to as G, C, A and T, respectively. Now scientists have doubled this number of life's building blocks, creating for the first time a synthetic, eight-letter genetic language that seems to store and transcribe information just like natural DNA.

In a study published on 22 February in Science, a consortium of researchers led by Steven Benner, founder of the Foundation for Applied Molecular Evolution in Alachua, Florida, suggests that an expanded genetic alphabet could, in theory, also support life. "It's a real landmark," says Floyd Romesberg, a chemical biologist at the Scripps Research Institute in La Jolla, California. The study implies that there is nothing particularly "magic" or special about those four chemicals that evolved on Earth, says Romesberg. "That's a conceptual breakthrough," he adds... Benner says that the work shows that life could potentially be supported by DNA bases with different structures from the four that we know, which could be relevant in the search for signatures of life elsewhere in the Universe...

The researchers call the resulting eight-letter language 'hachimoji' after the Japanese words for 'eight' and 'letter'. The additional bases are each similar in shape to one of the natural four, but have variations in their bonding patterns. The researchers then conducted a series of experiments that showed that their synthetic sequences shares properties with natural DNA that are essential for supporting life... Benner's group previously showed that strands of DNA that included Z and P were better at binding to cancer cells than sequences with just the standard four bases. And Benner has set up a company which commercialises synthetic DNA for use in medical diagnostics.

This discussion has been archived. No new comments can be posted.

Four New DNA Letters Double Life's Alphabet

Comments Filter:
  • Good god man, don't tell Apple about this!
    • This is like going from ASCII to Unicode. But it isn't really "doubling". With four nucleotides, each base pair encodes 2 bits. With eight pairs, it is 3 bits each. So this is only a 50% improvement in information density.

      But I am not sure how much this helps. With four nucleotides we already have 64 triples, but there are only 20 amino acids. Add stop and start codons, and that is only 22. So there is already plenty of redundancy.

      At least for now, I am sticking with four nucleotides.

      • by Livius ( 318358 )

        With four nucleotides, each base pair encodes 2 bits. With eight pairs, it is 3 bits each. So this is only a 50% improvement in information density.

        3 bits is twice as much information as 2 bits.

  • 6 to 8 (Score:4, Informative)

    by Artem S. Tashkinov ( 764309 ) on Saturday March 02, 2019 @02:42PM (#58204664) Homepage
    Not meaning to downplay the significance of this breakthrough but early last year synthetic biologist Floyd E. Romesberg already announced the creation of two additional synthetic letters [ted.com].
    • Don't mistake an entertainment focused presentation like a TED talk for an original research presentation.

      FAME and Steve Benner have been the leaders in this field for a long time. Benner made the first expansion from 4 to 6 bases about 30 years ago. SRI and Floyd Romesberg are very good, but they are following Benner here (and Romesberg is a better public speaker).

      You're right, though, that this is from 6 to 8.

  • by oldgraybeard ( 2939809 ) on Saturday March 02, 2019 @02:45PM (#58204680)
    Life is 8 bit
    How soon before we move to 16 -> 32 -> 64?

    Just my 2 cents ;)
    • by Anonymous Coward

      You can only have one of any of the 8 occupying a given strand pair position, meaning that it is 2^3 or 3 bit resolution, up from 2 bit with the previous base pairs. The rest of what you say is correct however.

      • The rest of what you say is correct however.

        Right, when you double numbers they do get bigger, and the 2 cents is correct.

    • Life on Earth (DNA code) is still four-bit. Synthetic organisms can't yet produce synthetic base pairs.
    • by Red_Forman ( 5546482 ) on Saturday March 02, 2019 @03:34PM (#58204814)

      How soon before we move to 16 -> 32 -> 64?

      It depends if you rely on the marketing departments of NEC, SEGA or Intel.

    • Life is 8 bit

      It's not. A codon consists of 3 nucleotides, each of which has 4 different values (i.e., distinct chemical composition) equivalent to a two bit encoding. Total: six bits. The artificially extended version adds one bit to the nucleotide range, times 3 nucleotides, total 9 bits. So there is no way to interpret your joke as correct, sorry. It's not about lacking a sense of humour, it just that the numbers need to add up. For example, "life is braille". Which is actually close to the truth when you look at the

      • Hmm, correction, the ribosome actually doesn't need to change much if at all for their 3 bit code, mostly they just need some new tRNAs that incorporate their new nucleotides. The natural machinery should be able to handle it with minimal modification. Yikes.

      • by rgmoore ( 133276 )

        It's not just four distinct nucleotides, you also need a decoder

        You need more than just a decoder. You also need a supply of the nucleotides. If you're doing this as a proof of concept experiment in vitro, you can get away with synthesizing them chemically, but if you want it to work in vivo, you need would need a whole set of enzymes to synthesize, metabolize, and transport those nucleotides. That would probably require dozens of new enzymes, which is way beyond the current state of the art.

  • https://science.slashdot.org/s... [slashdot.org]

    At least it's not on the front page anymore...

  • ..to the zombie apocalypse.
  • by Solandri ( 704621 ) on Saturday March 02, 2019 @04:01PM (#58204914)
    Humans have created a variety of languages, from Chinese with thousands of different characters (one for each word), to English with 26 characters which are combined to make different words, to binary with just 2 characters which are combined to make words. When researchers started playing around with compression algorithms, they got to wondering - what's the optimal number of characters in an alphabet for maximizing compression? That is, minimizing the size of the words, while also minimizing the space taken up by each character. With binary, you minimize the space needed to encode each character, but it comes at the cost of lengthening the size of each word. With Chinese you minimize the size of each word, but it comes at the cost of increasing the space needed to encode each character. How many letters in an alphabet results in the most compact language?

    The answer turns out to be e. 2.718. An alphabet with e characters allows you to represent data the most efficiently and compactly. Obviously you can't have a non-integer number of characters, so the optimal number of characters for a compact language is 3.

    Which is probably why DNA only codes 4 different molecules. Since a double helix with conjugate pairs can't be coded with 3 letters, 4 end up being the next step. Likely, DNA/RNA with more base pairs have developed naturally before (probably several times), but were eventually selected out after having to compete with 4-base pair DNA. So as interesting as this is, it probably isn't the first time it's happened like TFA states.
    • by dfghjk ( 711126 )

      "Likely, DNA/RNA with more base pairs have developed naturally before (probably several times), but were eventually selected out after having to compete with 4-base pair DNA."

      And why would it be selected out? Are you claiming that it could not compete with 4-base pair due encoding efficiency? Please.

  • Benner says that the work shows that life could potentially be supported by DNA bases with different structures from the four that we know, which could be relevant in the search for signatures of life elsewhere in the Universe...

    Not sure why anyone would expect to find terrestrial DNA anywhere but Terra. Totally unrelated creatures would use a totally different system. If they did find something like ours, that would mean we were relatives.

    • Because if DNA is the only way to do life, then anywhere life exists, it will have DNA. They've shown that there's at least one potential alternative.
    • Might I suggest you look up "panspermia" ? In it's fullest theories, it's the idea that the most elementary building blocks of life were seeded on Earth from interstellar dust or asteroids, and that they will be seeded on other worlds the same way.

      Even if panspermia did not happen. It does not mean that significant amounts of biochemistry, perhaps including some form of DNA, might not exist elsewhere. The nature and extent of parallel evolution surprised Charles Darwin when he originally wrote about evoluti

Understanding is always the understanding of a smaller problem in relation to a bigger problem. -- P.D. Ouspensky

Working...