January 4, 2005
Similar papers 2
October 24, 2011
The similarity in the three-dimensional structures of homologous proteins imposes strong constraints on their sequence variability. It has long been suggested that the resulting correlations among amino acid compositions at different sequence positions can be exploited to infer spatial contacts within the tertiary protein structure. Crucial to this inference is the ability to disentangle direct and indirect correlations, as accomplished by the recently introduced Direct Coupl...
March 3, 2014
Correlation patterns in multiple sequence alignments of homologous proteins can be exploited to infer information on the three-dimensional structure of their members. The typical pipeline to address this task, which we in this paper refer to as the three dimensions of contact prediction, is to: (i) filter and align the raw sequence data representing the evolutionarily related proteins; (ii) choose a predictive model to describe a sequence alignment; (iii) infer the model para...
April 13, 2004
We show that the contact map of the native structure of globular proteins can be reconstructed starting from the sole knowledge of the contact map's principal eigenvector, and present an exact algorithm for this purpose. Our algorithm yields a unique contact map for all 221 globular structures of PDBselect25 of length $N \le 120$. We also show that the reconstructed contact maps allow in turn for the accurate reconstruction of the three-dimensional structure. These results in...
June 11, 2019
Given native 2D contact map, protein 3D structure could be reconstructed with accuracy of 2A or better, and such reconstruction is a feasible computational approach for protein folding problem. The prediction accuracy from traditional methods is generally too poor to useful, but the recent deep learning model has significantly improved the accuracy. In this study, we proposed a neural network model comprising a bi-directional recurrent neural network and artificial neural net...
May 28, 2019
Native contacts between residues could be predicted from the amino acid sequence of proteins, and the predicted contact information could assist the de novo protein structure prediction. Here, we present a novel pipeline of a residue contact predictor AmoebaContact and a contact-assisted folder GDFold for rapid protein structure prediction. Unlike mainstream contact predictors that utilize human-designed neural networks, AmoebaContact adopts a set of network architectures tha...
October 4, 2013
Mapping between sequence and structure is currently an open problem in structural biology. Despite many experimental and computational efforts it is not clear yet how the structure is encoded in the sequence. Answering this question may pave the way for predicting a protein fold given its sequence. My doctoral studies have focused on a particular phenomenon relevant to the protein sequence-structure relationship. It has been observed that many proteins having apparently dis...
November 25, 2015
Despite the significant increase in computational power, molecular modeling of protein structure using classical all-atom approaches remains inefficient, at least for most of the protein targets in the focus of biomedical research. Perhaps the most successful strategy to overcome the inefficiency problem is multiscale modeling to merge all-atom and coarse-grained models. This chapter describes a well-established CABS coarse-grained protein model. The CABS (C-Alpha, C-Beta and...
June 19, 2023
Recently, we presented a framework for understanding protein structure based on the idea that simple constructs of holding hands or touching of objects can be used to rationalize the common characteristics of globular proteins. We developed a consistent approach for understanding the formation of the two key common building blocks of helices and sheets as well as the compatible assembly of secondary structures into the tertiary structure through the notion of poking pairwise ...
December 1, 2017
This chapter gives a graceful introduction to problem of protein three- dimensional structure prediction, and focuses on how to make structural sense out of a single input sequence with unknown structure, the 'query' or 'target' sequence. We give an overview of the different classes of modelling techniques, notably template-based and template free. We also discuss the way in which structural predictions are validated within the global com- munity, and elaborate on the extent ...
June 1, 2004
With the aim to study the relationship between protein sequences and their native structures, we adopt vectorial representations for both sequence and structure. The structural representation is based on the Principal Eigenvector of the fold's contact matrix (PE). As recently shown, the latter encodes sufficient information for reconstructing the whole contact matrix. The sequence is represented through a Hydrophobicity Profile (HP), using a generalized hydrophobicity scale t...