Abstract
The information capacity in frequency dictionaries of nucleotide sequences is estimated through the efficiency of reconstruction of a longer frequency dictionary from a short one. This reconstruction is performed by the maximum entropy method. Real nucleotide sequences are compared to random ones (with the same composition of nucleotides). Phages genes from NCBI bank were analyzed. The reliable difference of real genetic texts from random sequences is observed for the dictionary length q = 2, 5 and 6.
Similar content being viewed by others
Bibliography
H. P. Yockey, Information Theory and Molecular Biology, Cambridge Univ. Press, N.Y., 1992.
A. A. Alexandrov, et al.: Computer Analysis of Genetic Texts (in Russian), Nauka, Moscow, 1990.
S. Karlin and L. R. Cardon, Ann. Rev. Microbiol. 48, 619 (1994).
A. K. Konopka, in D. Smith, ed., Biocomputings: Informatics and Conome Projects, Acad. Press, San Diego, p. 119, 1995.
A. K. Konopka, in: R. A. Meyers, ed., Molecular Biology and Biotechnology, VCH Publishers, Weinheim, p. 888, 1995.
P. W. Garden, J. Theor. Biol. 82, 679 (1980).
V. Brcndel, J. S. Beckmann, and E. N. Trifonov, J. Biomol. Struct. Dyn. 4, 11 (1986).
P. A. Pevzner, M. Yu. Borodovski, and A. A. Mironov, J. Biomol. Struct. Dyn. 6, 1013 (1989).
M. A. Roytberg, in: S. Gindikin, ed., Biosystems, AMS, Providence, p. 103, 1992.
C. Martingale and A. K. Konopka, Computers and Chemistry 20, 45 (1996)
A. K. Konopka and C. Martingale, Science 268, 1789 (1992).
E. M. Mirkes, T. G. Popova, and M. G. Sadovsky, Adv. in Modelling and Analysis, ser. B 27, 1 (1993).
A. N. Kolmogorov, Dokl. AN SSSR 65, 793 (1949)
J. Kirkwood and E. Boggs, J. Chem. Phys. 10, 394 (1942).
R. Balescu, Equilibrium and Nonequilibrium Statistical Mechanics, John Wiley & Sons, New York-London-Sidney-Toronto, 1975.
N. N. Bugaenko, A. N. Gorban, and I. V. Karlin, Teoret. i mat fizika 88, 430 (1991); (english translation: Theoretical and Mathematical Physics, Plenum Publ. Corp., p. 977, 1992).
T. G. Popova and M. G. Sadovsky, Advances in Modelling & Analysis, ser. A 22, 13 (1994).
E. M. Mirkes, T. G. Popova, and M. G. Sadovsky, Advances in Modelling & Analysis, ser. B 27, 11 (1993).
T. G. Popova and M. G. Sadovsky, Modelling, Measurement & Control, ser. C 45, 27 (1994).
N. N. Bugaenko, A. N. Gorban, and M. G. Sadovsky, Molecular Biology 30, 313 (1996).
Author information
Authors and Affiliations
Rights and permissions
About this article
Cite this article
Bugaenko, N.N., Gorban, A.N. & Sadovsky, M.G. Maximum Entropy Method in Analysis of Genetic Text and Measurement of its Information Content. Open Systems & Information Dynamics 5, 265–278 (1998). https://doi.org/10.1023/A:1009637019316
Issue Date:
DOI: https://doi.org/10.1023/A:1009637019316