Mapping the Genetic Code

Hilary Spencer

Wednesday, 29 Aug 2007 21:33 UTC

This month’s cover of Nature Genetics features a modified Venn diagram by Anthony Edwards1 which illustrates the space of possible base combinations.

An article describing the layout is available on Nature Precedings. Myles Axton has also written a brief note which includes a quote from Dr. Edwards explaining the color codes used in the diagram: “Codons in contiguous areas always differ by just one base. Coloring the areas according to their corresponding amino acids leads to the final diagram. We have selected colors that help to classify the amino acids as basic, acidic, polar and nonpolar. Gray is used for the three stop codons.” I think this is a great way to use color to signify similarity—the same color represents the same amino acid.

The space of codons is frequently described in a 3-dimensional chart, like the one found here. While the 3d table is useful for looking up the corresponding amino acid, Dr. Edwards’s representation makes it easier to see the changes due to a single point mutation (e.g. a point mutation would be represented by moving from the space of the original codon to any one of the adjacent spaces).

Venn diagrams, invented by John Venn in 1881, were a response in part to the Euler’s circles invented in 1786. Traditional Venn diagrams, which only use circles, are limited in their ability to scale to higher dimensions (see this AMS article) and, when used to represent logical statements, cannot represent all possible logical statements (see the article on Diagrams in the Stanford Encyclopedia of Philosophy. Pierce followed up with a proposed technique for representing statements that were inexpressible in Venn diagrams (such as disjunctive statements—A or B—and existential statements—there exists an A that is also a B) by replacing Venn’s shading with a system of X’s and O’s connected by lines. However, Pierce’s system was unintuitive and difficult to follow. in 1994, Shin proposed a new technique that makes Pierce’s proposed modifications easier to use. Shin and Pierce both use a connecting line between x’s to express disjunctive information (e.g. A or B).

Instead of using color to express codons that code for the same amino acid (such as UCU and UCC), Edwards could have used connecting lines to express what is effectively a disjunctive relation. I tried overlaying this information on the black and white Edwards-Venn diagram and ended up with what looked like a bit of a cobweb. I also think I probably looked a bit like Russell Crowe’s character in A Beautiful Mind while sitting at my desk with a ruler and pen, connecting letters on a diagram.

1 Edwards, A.W.F.. Picturing the genetic code. Available from Nature Precedings <http://dx.doi.org/10.1038/npre.2007.682.1 (2007)

Updated 24 Sep 2007 02:15 UTC

  • Replies

    Post a reply
    • Its a very clever and interesting way to represent that data. Although, I am not so sure about calling all basic amino acids “the same” (the same holds true for acidic, hydrophobic, polar, etc.). I know what you are trying to say, but at the same time, sometimes people overlook the biochemical significance of the differences amoung amino acids with the same gross properties. Nevertheless, displaying this information in a Venn diagram is a unique way of looking at it, which I find, helps to provide insight. Just my 2 cents.
      Also, did the “cobwebs” help at all or were they just more confusing than they were worth?

    • I agree that the Venn diagram is impressive. I was working on a related problem in March this year which might be of interest to you. I tried to find a graphical representation of the code that would allow all the codons for leucine, serine, arginine and stop to be brought together.

      I chose the circular depiction of the code, but placed the second base in the centre, followed by the first and third codon base, and allowed for some variation in the way the bases were listed. The results were amazing. You can either try it yourself, take a peak at some of the results I published on the net – or wait for my article to appear in the Journal of Theoretical Biology.

      It seems to me that all the different ways to represent the genetic code have their pros and cons. In a way, the Venn diagram is one step ahead of the 2-1-3 perspective, but may be slightly off-putting to people not used to the approach. Although I am not an expert I love it and hope that it will come in useful. Congratulations.

    • Roland—Have you seen The American Heritage Dictionary’s diagram of Indo-European Languages? Your diagram reminds me a bit of this. I don’t know what this type of representation is called (it’s not a circular tree map), but I think it can be used for any tree-like (taxonomic?) representations. The language diagram also seems to implicitly capture the evolutionary relationship between the languages. I know next to nothing about languages, but it appears that time is mapped on the radius (?)

      Congrats on the paper!

    Post a reply

Search forums Advanced search

Submit this topic to

web feed

Advertisement