|
My research at the EMBL and the Pasteur Institute was focussed on structural bioinformatics and structural genomics comprising methods from sequence analysis, modelling and molecular dynamics. Structural bioinformatics is concerned with computational approaches to predict and analyse the spatial structure of proteins and nucleic acids and to derive their function in the cell. Structural genomics is the systematic effort to gain a complete structural description of a defined set of molecules, ultimately for an organism’s entire proteome. Structural genomics projects apply X-ray crystallography and NMR spectroscopy in a high-throughput manner. In Michael Nilges' group, I was working on the ARIA software for automated NOE assignment to speed up structure determination from NMR data. The most important experimental parameter for the determination of biomolecular structures is the nuclear Overhauser effect (NOE). Although NOEs can only be measured between nuclei not farther apart than approximately 0.5 nm, they provide a wealth of interproton distances within and in between residues. Unfortunately, the interpretation of the NOE spectra is complicated by the high number of possible assignments, noise and heavy overlap. A substantial amount of time is therefore spent for interpretating and assigning NOE spectra. With increasingly bigger systems, a completely manual approach for the assignment of the NOEs is too cumbersome and time-consuming. Thus, automation of the NOE assignment and structure calculation process has become an important issue. ARIA leads to a substantial speed-up of structure calculation by automation of the NOE assignment process. Compared with a manual approach where initial structures are calculated based on a small fraction of the NOEs, the automated approach uses much more data to direct the calculation from the start. ARIA comprises a relaxation matrix refinement algorithm that corrects the target distances for spin diffusion in each round of automated NOE assignment. Spin diffusion corrected target distances allow the choice of smaller error bounds. Eventually, this leads to a clearer distinction between signal and noise. I was studying different non-bonded parameters of force fields for NMR structure calculation and their influence on the quality of the resulting NMR solution structures. The results clearly indicate that larger vdW radii than commonly employed in virtually all NMR structure calculation programs significantly improve the quality of the structures, if appropriate corrections for 1-4 interactions are employed. A hybrid PROLSQ/CSDX force field yields high quality structures. A refinement of biomolecular structures in a thin layer of explicit solvent improves the quality of biomolecular structures. To illustrate the refinement procedure, please have a look at a trajectory of an HRDC domain in water as mpg movie (6.3 MB) calculated with a hybrid CSDX/OPLS force field. The increased speed of structure determination necessary for the structural genomics projects make an independent validation of the structures particularly important, e.g., by comparison to expected properties. Structure validation helps to correct obvious errors and leads to a more standardized representation of structural data, e.g., by agreeing on a common atom name nomenclature. The knowledge of the structure quality is a prerequisite for further use of the structure in molecular modelling or drug design. As part of an effort to improve data representation and data exchange, I was working on XML representation of NMR data. StarDOM is a software package to transform data provided in the Self Defining Text Archival and Retrieval (STAR) format into XML. This opens new possibilities for visual editing, archiving, parsing and structured queries of structural biology data. This work eventually lead to a collaboration with the CCPN project which aims to establish a common standard for NMR data, spectrum and structure files. For further reading
please have a look at the following publications:
|
|||||
|
[
last changed:
01.06.2007
] [imprint]
|
|||||