Structural Bioinformatics and Computer Aided Design of Novel Drugs and Functional Proteins. Lin Li and Dongqing Wei Department of Bioinformatics College of Life Sciences and Biotechnology Room 4211, Life Science Building Shanghai Jiaotong University 800 Dongchuan Road, Minhang District, Shanghai, 200240, China email:
[email protected] Genome projects[1] are yielding protein sequences for which there is no knowledge about function or conformation. The draft human genome has now been available and the exploitation of this unique source of knowledge is a major challenge for biology. In parallel with these sequencing projects, there are structural genomics initiatives involving the determination of the conformations of uncharacterized proteins in the genomes with the particular aim of determining function. In addition, gene expression arrays are providing a mass of data that require analysis to relate protein sequences to activity under different conditions and in different cellular locations. Knowledge of the structure and function of the relevant proteins, revealed by crystallography and NMR, is central to the interpretation and exploitation of this pool of biological information. Threedimensional structure can guide further experiments to probe activity and direct the systematic design of therapeutic agents for diseases. The experimental determination of protein structure remains difficult. There are large disparities between the numbers of protein sequences. Protein modeling provides a valuable approach to maximize the biological knowledge that can be obtained given these disparities. Accordingly, the task of the Structural Bioinformatics is the development of protein modeling algorithms and the application of the technology to systems of interests. Methodologies based on the known protein structures from experiments have been developed to predict 3D structures from a 1D sequence information, which is known as the homology modeling. It has become a reliable tool as long as the homologue with experimental 3D structure of significant similarity(usually greater than 35%) could be found. Other methods, for example, ab initio molecular dynamics, is still too slow to predict structure of a real protein. Often the structures of many drug targets are not available. The structural bioinformatics tools are needed to generate them readily to initiate drug design processes. We have predicted many protein structures using homology modeling to facilitate the computer aided drug design, which include database screening, pharmacophore search, docking and molecular dynamics conformation search. To join the worldwide efforts[17] against H5N1 viruses which experience rapid mutation and become increasingly drugresistant. A homology model of the H5N1NA[8] from the highly pathogenic chicken H5N1 A viruses isolated during the 2003– 2004 influenza outbreaks in Japan was built based on the crystal structure of N9NA complexed with DANA (PDB code: 1F8B). It was found that the traditional constituent
residues around the active site of NA family are highly conserved in the H5N1NA. However, a partially lipophilic pocket composed by Ala248 and Thr249 in N9NA becomes a hydrophilic pocket because the two residues in the H5N1NA are replaced by hydrophilic residues Ser227 and Asn228, respectively. On the other hand, two hydrophilic residues Asn347 and Asn348 in the N9NA are replaced by two lipophilic residues Ala323 and Tyr324 in the H5N1NA, respectively, leading to the formation of a new lipophilic pocket. This kind of subtle variation not only destroys the original lipophilic environment but also changes the complement interaction between the H5N1NA and DANA. Such a finding might provide insights into the secret why some of H5N1 strains bear high resistance for existing NA inhibitors, and stimulate new strategies for designing new drugs against these viruses. Cytochrome P450 2C19 (CYP2C19)[9] is a member of the cytochrome P450 enzyme superfamily and plays an important role in the metabolism of drugs. In order to gain insights for developing personalized drugs, the 3D (dimensional) structure of CYP2C19 has been developed based on the crystal structure of CYP2C9 (PDB code 1R90), and its structure– activity relationship with the ligands of CEC, Fluvoxamine, Lescol, and Ticlopidine investigated through the structure–activity relationship approach. By means of a series of docking studies, the binding pockets of CYP2C19 for the four compounds are explicitly defined that will be very useful for conducting mutagenesis studies, providing insights into personalization of drug treatments and stimulating novel strategies for finding desired personalized drugs. NAD(P)Hdependent Dxylose reductase[10] is a homodimeric oxidoreductase that belongs to the aldo–keto reductase superfamily. The enzyme has the special function to catalyze the first step in the assimilation of xylose into yeast metabolic pathways. Performing this function via reducing the open chain xylose to xylitol, the xylose reductase of Pichia stipitis is one of the most important enzymes that can be used to construct recombinant Saccharomyces cerevisiae strain for utilizing xylose and producing alcohol. To investigate into the interaction mechanism of the enzyme with its ligand NAD and NADP, the 3D structure was developed for the NAD(P)Hdependent Dxylose reductase from P. stipitis. With the 3D structure, the molecular docking operations were conducted to find the most stable bindings of the enzyme with NAD and NADP, respectively. Based on these results, the binding pockets of the enzyme for NAD and NADP have been explicitly defined. It has been found that the residues in forming the binding pockets for both NAD and NADP are almost the same and mainly hydrophilic. These findings may be used to guide mutagenesis studies, providing useful clues to modify the enzyme to improve the utilization of xylose for producing alcohol. Also, because human aldose reductases have the function to reduce the open chain form of glucose to sorbitol, a process physiologically significant for diabetic patients at the time that their blood glucose levels are elevated, the information gained through this study may also stimulate the development of new strategies for therapeutic treatment of diabetes. Eventually quantum chemical tools, such as, QM/MM studies will be extremely useful to study details of biological catalysis involved. Good efforts have been made by Prof. Hong Guo’s group[1014] in elucidating the catalysis mechanism of many important enzymatic reactions using QM/MM approach. It is expected that it will become an accurate and predictive tool in the process of designing novel drugs and functional proteins.
References 1.
Chou, K.C. (2004). Structural bioinformatics and its impact to biomedical science. Current Medicinal Chemistry, 11, 21052134.
2.
Du QS, Wang SQ, Zhu Y, Wei DQ, Guo H, Sirois S and Chou KC4(2004), “Polyprotein cleavage mechanism of SARS CoC Mpro and chemical modification of the octapeptide”,. Peptides 25: 1857186. Chou, K. C., Wei, D. Q. & Zhong, W. Z. (2003). Binding mechanism of coronavirus main proteinase with ligands and its implication to drug design against SARS. (Erratum: ibid., 2003, Vol.310, 675). Biochem Biophys Res Comm 308, 148151. Sirois, S., Hatzakis, G. E., Wei, D. Q., Du, Q. S. & Chou, K. C. (2005). Assessment of chemical libraries for their druggability. Computational Biology & Chemistry 29, 5567. Wei, D. Q., Sirois, S., Du, Q. S., Arias, H. R. & Chou, K. C. (2005). Theoretical studies of Alzheimer's disease drug candidate [(2,4dimethoxy) benzylidene]anabaseine dihydrochloride (GTS21) and its derivatives. Biochem. Biophys. Res. Commun. 338, 10591064. Sirois, S., Wei, D. Q., Du, Q. S. & Chou, K. C. (2004). Virtual Screening for SARSCoV Protease Based on KZ7088 Pharmacophore Points. J. Chem. Inf. Comput. Sci. 44, 1111 1122. Wei, D. Q., S. Sirois, et al. (2005). "Theoretical studies of Alzheimer's disease drug candiate [(2,4dimethoxyl)benzylidene]anabaseine dihydrochloride (GTS21) and its derivatives." Biochem. Biophys. Res. Commun. 338: 10591064. Wei, D. Q., Q. S. Du, et al. (2006). "Insights from modeling the 3D structure of H5N1 influenza virus neuraminidase and its binding interactions with ligands." Biochem Biophys Res Commun 344: 10481055. Wang, J. F., D. Q. Wei, et al. (2007). "3D structure modeling of cytochrome P450 2C19 and its implication for personalized drug design." Biochem. Biophys. Res. Commun. 355: 513519. Wang, J. F., D. Q. Wei, et al. (2007) “Insights from modeling the 3D structure of NAD(P)Hdependent Dxylose reductase of Pichia stipitis and its binding interactions with NAD and NADP”, Biochemical and Biophysical Research Communications, 359, 323329. Guo HB. and Guo H.(2007), “Mechanism of histone methylation catalyzed by protein lysine methyltransferase SET7/9”, PNAS, 104, 87978802. Guo H, Beahm R and Guo H (2004) “Stabilization and Destabilization of the CdH∙∙O=C Hydrogen Bonds Involving Proline Residues in Helices”, J. Phys. Chem. B 108: 18053– 18064. Xu Q and Guo H(2004), “Quantum Mechanical/Molecular Mechanical Molecular Dynamics Simulations of Cytidine Deaminase: from Stabilization of TransitionState Analogs to Catalytic Mechanisms”,. J. Phys. Chem. B 108: 24772483. Guo H, Cui Q, Lipscomb WN and Karplus M(2003), “Understanding the Role of Active Site Residues in Chorismate Mutase Catalysis from Molecular Dynamic Simulations”, Angew. Chem. Int. Ed. 42: 15081511.
3.
4. 5.
6.
7.
8.
9.
10.
11. 12.
13.
14.