Ambiguous Stop Codons Expand the Genetic Code in Archaea

The beauty of the DNA code is that organisms interpret it unambiguously. Each three-letter nucleotide sequence, or codon, in a gene codes for a unique amino acid that's added to a chain of amino acids to make a protein.

But University of California, Berkeley, researchers have now shown that one microorganism can live with a bit of ambiguity in its genetic code, overturning a standard dogma of biology.

The organism, a methane-producing member of a group of microbes called Archaea, interprets one three-letter sequence - normally a stop codon that signals the end of a protein - in two different ways, synthesizing two different proteins seemingly at random, though biased by conditions in the environment. The microbe, Methanosarcina acetivorans, survives just fine with this loosey-goosey translation, proving that life can exist with a slightly imprecise genetic code.

The ambiguity may have arisen to allow the microbes to incorporate an uncommon amino acid, pyrrolysine, into an enzyme needed to digest a specific food - methylamine - that is common in the environment, including the human gut.

"Objectively, ambiguity in the genetic code should be deleterious; you end up generating a random pool of proteins," said Dipti Nayak, a UC Berkeley assistant professor of molecular and cell biology and senior author of a paper describing the findings published Nov. 6 in the journal Proceedings of the National Academy of Sciences. "But biological systems are more ambiguous than we give them credit to be and that ambiguity is actually a feature - it's not a bug."

Archaea that eat methylamines, and bacteria that may have acquired that ability too, play an important role in the human body. In the liver, metabolites released by red meat are turned into trimethylamine N-oxide, which is associated with cardiovascular disease. We rely upon these microbes to remove those methylamines before they reach the liver.

The findings have implications for future disease therapies. Some researchers have speculated that introducing some imprecision into the translation machinery might help treat diseases caused by a premature stop codon in important genes, which produces nonfunctional proteins. That includes about 10% of all genetic diseases, including cystic fibrosis and Duchenne muscular dystrophy. Making a stop codon a bit leaky could allow enough of the normal protein to be produced to alleviate symptoms.

The Genetic Cipher

The DNA in the genome is initially transcribed into RNA, and that genetic code is then read by cellular machinery to produce proteins. The nucleic acids that comprise RNA come in four varieties - adenine (A), cytosine (C), guanine (G) and uracil (U). In most organisms investigated to date, groups of three nucleic acids or codons are either assigned to a single amino acid or a so-called stop codon, which terminates synthesis of that protein. When the RNA gets translated into a string of amino acids, the machinery always abides by this one-to-one association.

Not all organisms decode RNA in the same way, however. Some assign a different amino acid to a given codon, some have more than the standard 20 amino acids per organism, and codons are redundant - several can code for the same amino acid. But uniformly across the tree of life, each codon has only one meaning - no ifs, ands or buts.

It's essentially like a cipher. You're taking something in one language and translating it into another, nucleotides to amino acids."

Dipti Nayak, UC Berkeley assistant professor of molecular and cell biology and senior author

Scientists have known for a long time that many members of the Archaea produce pyrrolysine, giving them 21 amino acid options instead of the usual 20 from which to make proteins. It has advantages, Nayak said.

"Now that you have a new amino acid, the world's your oyster," she said. "You can start playing around with the much larger code. It's like adding one more letter to the alphabet."

But these organisms were thought to have merely changed the interpretation of the UAG stop codon to code for pyrrolysine instead.

In the new study, Nayak and former graduate student Katie Shalvarjian surveyed the Archaea and found pyrrolysine production in many lineages.

"We found that the machinery required to create pyrrolysine is widespread in the Archaea, especially amongst these methanogenic archaea that consume methylated amines," said Shalvarjian, now a postdoctoral researcher at Lawrence Livermore National Laboratory.

She was curious, however, how having 21 instead of 20 amino acids affects these organisms and their physiology. While investigating the methanogen's genetic control of pyrrolysine production, she noticed that the UAG codon was not always interpreted as pyrrolysine (Pyl).

"The UAG codon is like a fork in the road, where it can be interpreted either as a stop codon or as a pyrrolysine residue," Shalvarjian said. "We think whether or not a protein exists primarily in its elongated or in its truncated form might form a regulatory cue for the cell."

Nayak and Shalvarjian looked for sequence or structure dependent cues that might affect interpretation of the UAG codon, but found none.

"The methanogens have not recoded UAG, nor have they added any new factors to make it deterministic," Nayak said. "They're flip-flopping back and forth between whether they should call this a stop or whether they should keep going by adding this new amino acid. They cannot decide. They just do both and they seem to be fine by making this random choice."

Preliminary evidence suggests that the supply of pyrrolysine in the cell may be the determining factor. If it's flooding the cell, it may bias the interpretation of UAG more toward incorporating the amino acid into a protein. There are between 200 and 300 genes in this organism that contain the UAG codon and thus have the ability to produce a protein containing pyrrolysine. With little of the amino acid around, however, UAG is interpreted as a stop codon, yielding a different protein that may or may not be functional, depending on the context.

"This really opens the door to finding interesting ways to control how cells interpret stop codons," Nayak said.

The work was funded by the Searle Scholars Program, a Rose Hills Innovator Grant, a Beckman Young Investigator Award, an Alfred P. Sloan Research Fellowship, a Simons Foundation Early Career Investigator in Marine Microbial Ecology and Evolution Award, and a Packard Fellowship in Science and Engineering. Nayak is also a Chan-Zuckerberg Biohub-San Francisco investigator.

Other co-authors are Grayson Chadwick and Paloma Pérez of UC Berkeley and Philip Woods and Victoria Orphan of the California Institute of Technology.

Source:
Journal reference:

Shalvarjian, K. E., et al. (2025). Methanogenic archaea encoding Pyrrolysine maintain ambiguous amber codon usage. Proceedings of the National Academy of Sciences. doi: 10.1073/pnas.2517473122. https://www.pnas.org/doi/10.1073/pnas.2517473122

Posted in: Genomics

Comments

The opinions expressed here are the views of the writer and do not necessarily reflect the views and opinions of AZoLifeSciences.
Post a new comment
Post

While we only use edited and approved content for Azthena answers, it may on occasions provide incorrect responses. Please confirm any data provided with the related suppliers or authors. We do not provide medical advice, if you search for medical information you must always consult a medical professional before acting on any information provided.

Your questions, but not your email details will be shared with OpenAI and retained for 30 days in accordance with their privacy principles.

Please do not ask questions that use sensitive or confidential information.

Read the full Terms & Conditions.

You might also like...
RNA Imaging Predicts Early Cell Damage with High Sensitivity