物理化学学报 >> 2020, Vol. 36 >> Issue (1): 1907002.doi: 10.3866/PKU.WHXB201907002

所属专题: 庆祝唐有祺院士百岁华诞专刊

综述 上一篇    下一篇

蛋白质结构的“限域下最低能量结构片段”假说与蛋白质进化的“石器时代”

曹傲能*()   

  • 收稿日期:2019-07-01 录用日期:2019-08-22 发布日期:2019-09-02
  • 通讯作者: 曹傲能 E-mail:ancao@shu.edu.cn
  • 作者简介:曹傲能,1969年出生。1989年在浙江大学获学士学位,1995年在浙江大学获硕士学位,1998年在北京大学获博士学位。现上海大学教授。主要研究方向为蛋白质折叠、人工蛋白质设计、纳米生物医学
  • 基金资助:
    国家自然科学基金(31871007);国家自然科学基金(31571024);国家重点研发计划(2016YFA0201600)

"Confined Lowest Energy Structure Fragments (CLESFs)" Hypothesis for Protein Structure and the "Stone Age" of Protein Prebiotic Evolution

Aoneng Cao*()   

  • Received:2019-07-01 Accepted:2019-08-22 Published:2019-09-02
  • Contact: Aoneng Cao E-mail:ancao@shu.edu.cn
  • Supported by:
    the National Natural Science Foundation of China(31871007);the National Natural Science Foundation of China(31571024);the National Key Research and Development Plan of China(2016YFA0201600)

摘要:

蛋白质折叠问题被称为第二遗传密码,至今未破译;蛋白质序列的天书仍然是“句读之不知,惑之不解”。在最近工作的基础上,我们提出了蛋白质结构的“限域下最低能量结构片段”假说。这一假说指出,蛋白质中存在一些关键的长程强相互作用位点,这些位点相当于标点符号,将蛋白质序列的天书变成可读的句子(多肽片段)。这些片段的天然结构是在这些强长程相互作用位点限域下的能量最低状态。完整的蛋白质结构由这些“限域下最低能量结构片段”拼合而成,而蛋白质整体结构并不一定是全局性的能量最低状态。在蛋白质折叠过程中,局部片段的天然结构倾向性为强长程相互作用的形成提供主要基于焓效应的驱动力,而天然强长程相互作用的形成为局部片段的天然结构提供主要基于熵效应的稳定性。在蛋白质进化早期,可能存在一个“石器时代”,即依附不同界面(比如岩石)的限域作用而稳定的多肽片段先进化出来,后由这些片段逐步进化(包括拼合)而成蛋白质。

关键词: 蛋白质折叠, 多肽, 生命起源前, 杂合体, 人工蛋白质, 金抗体

Abstract:

The protein folding problem is regarded as the second genetic code which has yet to be deciphered. To date, Anfinsen's thermodynamic hypothesis, i.e., the native structure of a protein is its most stable state, is the only generally accepted theory for protein folding, although exceptions have been reported. However, this hypothesis is a simple overall statement, with no information regarding where or how a protein is folded. The mechanism underlying protein folding has not yet been elucidated, and it is still not clear how the overall sequence (context) determines the structure of a protein. Based on our recent study, we propose a "Confined Lowest Energy Structure Fragments" (CLESFs) hypothesis. This hypothesis states that proteins are CLESFs joined together by a small number of strong constraints (key long-range interactions). Although the native structure of a protein contains various long-range interactions between amino acids that are far apart in the sequence, only a few strong interactions, such as disulfide bonds, hydrophobic packing, structural ion coordination as in zinc fingers, and hydrogen-bonding networks within beta-sheets, are critical. These key long-range interactions serve as a form of punctuation in the "language" of protein sequence and divide the protein sequence into different "sentences, " i.e., fragments (CLESFs). The local native structures of these CLESFs are the lowest energy structures under the confinements of those key long-range interactions, but the overall protein structure is not necessarily the global minimum as Anfinsen hypothesized. The same fragment may adopt different native structures in different proteins. Each native structure of the same fragment in a different protein is a local minimum for the free fragment and the "global minimum" for the fragment under the specific confinement in the specific protein. Essentially, the native local structures of the CLESFs have an enthalpic advantage (local minimum) which serves as a driving force to form the key long-range interactions; the key long-range interactions stabilize the native local structures with entropy effects by excluding enormous amount of random conformations possible for the fragments. Our CLESFs hypothesis suggests that the protein folding code is not as mysterious as previously thought. Only a few critical long-range interactions have principal influence on the local structures of protein fragments. This is why protein fragments can be grafted onto different proteins, and even more notably, can be grafted onto gold nanoparticles to form a "goldbody". Given that short peptides are generally flexible, and flexible peptides are usually unstable and inactive, it is still a mystery how proteins, i.e., peptides that are long enough to fold into unique structures, evolved in the first place. The CLESFs hypothesis implies that prior to the appearance of the first protein that was long enough to fold into a unique stable structure, there might have been a "Stone Age" during prebiotic protein evolution. At that time, short peptides that could not fold by themselves might have been able to adopt active conformations with a few strong anchors to the surface of "stones", such as rocks, solid particles, or vesicles in the primitive soup, forming CLESFs and gaining an evolutionary advantage against degradation. Later, multiple CLESFs on the same "stone" might have assembled in certain ways to perform more complicated functions, and finally, the first protein might have emerged when individual CLESFs joined together and left the "stone".

Key words: Protein folding, Peptide, Prebiotic, Hybrid, Artificial protein, Goldbody