Entropy Analysis of Protein Sequences Reveals a Hierarchical Organization

Background: Analyzing the local sequence content in proteins, earlier we found that amino acid residue frequencies differ on various distances between amino acid positions in the sequence, assuming the existence of structural units. Methods: We used informational entropy of protein sequences to find that the structural unit of proteins is a block of adjacent amino acid residues—“information unit”. The ANIS (ANalysis of Informational Structure) method uses these information units for revealing hierarchically organized Elements of the Information Structure (ELIS) in amino acid sequences. Results: The developed mathematical apparatus gives stable results on the structural unit description even with a significant variation in the parameters. The optimal length of the information unit is five, and the number of allowed substitutions is one. Examples of the application of the method for the design of protein molecules, intermolecular interactions analysis, and the study of the mechanisms of functioning of protein molecular machines are given. Conclusions: ANIS method makes it possible not only to analyze native proteins but also to design artificial polypeptide chains with a given spatial organization and, possibly, function.

Authors
Anashkina Anastasia A.1 , Petrushanko Irina Yu.1 , Ziganshin Rustam H.2 , Orlov Yuriy L. 3, 4 , Nekrasov Alexei N.2
Journal
Number of issue
12
Language
English
Pages
1647
Status
Published
Department
Аграрно-Технологический Институт
Volume
23
Year
2021
Organizations
  • 1 Engelhardt Institute of Molecular Biology, Russian Academy of Sciences, Vavilov St. 32, 119991 Moscow, Russia
  • 2 Shemyakin-Ovchinnikov Institute of Bioorganic Chemistry, The Russian Academy of Sciences, Miklukho-Maklaya St. 16/10, 117997 Moscow, Russia
  • 3 The Digital Health Institute, I.M. Sechenov First Moscow State Medical University of the Ministry of Health of the Russian Federation (Sechenov University), Trubetskaya 8-2, 119991 Moscow, Russia
  • 4 Agrarian and Technological Institute, Peoples’ Friendship University of Russia (RUDN University), Miklukho-Maklaya Str. 6, 117198 Moscow, Russia
Keywords
protein structure; hierarchy; protein sequences; ANIS method; informational structure; protein design; foldon; peroxiredoxin; interleukin 13; hydrolases; oligopeptidase B; TNF; HSP70; carboxypeptidase; hem-containing proteins
Share

Other records