×

Reversibility and efficiency in coding protein information. (English) Zbl 1414.92140

Summary: Why the genetic code has a fixed length? Protein information is transferred by coding each amino acid using codons whose length equals 3 for all amino acids. Hence the most probable and the least probable amino acid get a codeword with an equal length. Moreover, the distributions of amino acids found in nature are not uniform and therefore the efficiency of such codes is sub-optimal. The origins of these apparently non-efficient codes are yet unclear. In this paper we propose an a priori argument for the energy efficiency of such codes resulting from their reversibility, in contrast to their time inefficiency. Such codes are reversible in the sense that a primitive processor, reading three letters in each step, can always reverse its operation, undoing its process.
We examine the codes for the distributions of amino acids that exist in nature and show that they could not be both time efficient and reversible. We investigate a family of Zipf-type distributions and present their efficient (non-fixed length) prefix code, their graphs, and the condition for their reversibility. We prove that for a large family of such distributions, if the code is time efficient, it could not be reversible. In other words, if pre-biotic processes demand reversibility, the protein code could not be time efficient. The benefits of reversibility are clear: reversible processes are adiabatic, namely, they dissipate a very small amount of energy. Such processes must be done slowly enough; therefore time efficiency is non-important. It is reasonable to assume that early biochemical complexes were more prone towards energy efficiency, where forward and backward processes were almost symmetrical.

MSC:

92C40 Biochemistry, molecular biology
92D20 Protein sequences, DNA sequences
Full Text: DOI

References:

[1] Bennett, C., The thermodynamics of computation—a review, Int. J. Theor. Phys., 21, 905-940 (1982)
[2] Cover, T. M.; Thomas, J. A., Elements of Information Theory (2006), Wiley Interscience · Zbl 1140.94001
[3] Doig, A. J., Improving the efficiency of the genetic code by varying the codon length—the perfect genetic code, J. Theor. Biol., 188, 355-360 (1997)
[4] Farhi, E., Goldstone, J., Gutmann, S., Sipser, M., 2000. Quantum computation by adiabatic evolution, arXiv:quant-ph/000110.; Farhi, E., Goldstone, J., Gutmann, S., Sipser, M., 2000. Quantum computation by adiabatic evolution, arXiv:quant-ph/000110.
[5] Gong, L.; Bouaynaya, N.; Schonfeld, D., Information theoretic model of evolution over protein communication channel, IEEE/ACM Transactions on Computational Biology and Bioinformatics, 99 (2009)
[6] Kim, J. K.; Yang, S. I.; Kwon, Y. H.; Lee, E. I., Codon and amino-acid distribution in D.N.A., Chaos, Solitons and Fractals, 23, 5, 1795-1807 (2005) · Zbl 1066.92023
[7] Landauer, R., Irreversibility and heat generation in the computing process, IBM Journal of Research and Development, 5, 183-191 (1961) · Zbl 1160.68305
[8] Leff, H.L., Rex, A.F., 2003. Maxwell’s Demon 2, Entropy, Classical and Quantum Information, Computing. Institute of Physics.; Leff, H.L., Rex, A.F., 2003. Maxwell’s Demon 2, Entropy, Classical and Quantum Information, Computing. Institute of Physics.
[9] Orgel, L. E., Journal of Molecular Biology, 38, 381-393 (1968)
[10] Planck, M., Treatise on Thermodynamics (1921), Dover
[11] Shannon, C. E.; Weaver, W., The Mathematical Theory of Communication (1949), University of Illinois Press: University of Illinois Press Urbana · Zbl 0041.25804
[12] Som, A.; Chattopadhyay, S.; Chakrabarti, J.; Bandyopadhyay, D., Codon distribution in DNA, Physical Review E, 63, 051908 (2001)
[13] Szilard, L., Uber die entropieverminderung in einem thermodynamischen system bei eingriffen intelligenter wesen, Zeitschrift für Physik, 53, 840-856 (1929) · JFM 55.0488.06
[14] Ycas, M., The Biological Code, Frontiers of Biology, vol. 12 (1969), North-Holland Publication
[15] Youngman, E. M.; Green, R., Ribosomal Translocation: LepA Does it Backwards Current Biology, vol. 17, R136-R139 (2007)
[16] Zipf, G. K., Human Behavior and the Principle of Least Effort: An Introduction to Human Ecology (1949), Addison Wesley Press
This reference list is based on information provided by the publisher or from digital mathematics libraries. Its items are heuristically matched to zbMATH identifiers and may contain data conversion errors. In some cases that data have been complemented/enhanced by data from zbMATH Open. This attempts to reflect the references listed in the original paper as accurately as possible without claiming completeness or a perfect matching.