Prediction of TF target sites based on atomistic models of protein-DNA complexes
BMC bioinformatics, 2008•Springer
Background The specific recognition of genomic cis-regulatory elements by transcription
factors (TFs) plays an essential role in the regulation of coordinated gene expression.
Studying the mechanisms determining binding specificity in protein-DNA interactions is thus
an important goal. Most current approaches for modeling TF specific recognition rely on the
knowledge of large sets of cognate target sites and consider only the information contained
in their primary sequence. Results Here we describe a structure-based methodology for …
factors (TFs) plays an essential role in the regulation of coordinated gene expression.
Studying the mechanisms determining binding specificity in protein-DNA interactions is thus
an important goal. Most current approaches for modeling TF specific recognition rely on the
knowledge of large sets of cognate target sites and consider only the information contained
in their primary sequence. Results Here we describe a structure-based methodology for …
Background
The specific recognition of genomic cis-regulatory elements by transcription factors (TFs) plays an essential role in the regulation of coordinated gene expression. Studying the mechanisms determining binding specificity in protein-DNA interactions is thus an important goal. Most current approaches for modeling TF specific recognition rely on the knowledge of large sets of cognate target sites and consider only the information contained in their primary sequence.
Results
Here we describe a structure-based methodology for predicting sequence motifs starting from the coordinates of a TF-DNA complex. Our algorithm combines information regarding the direct and indirect readout of DNA into an atomistic statistical model, which is used to estimate the interaction potential. We first measure the ability of our method to correctly estimate the binding specificities of eight prokaryotic and eukaryotic TFs that belong to different structural superfamilies. Secondly, the method is applied to two homology models, finding that sampling of interface side-chain rotamers remarkably improves the results. Thirdly, the algorithm is compared with a reference structural method based on contact counts, obtaining comparable predictions for the experimental complexes and more accurate sequence motifs for the homology models.
Conclusion
Our results demonstrate that atomic-detail structural information can be feasibly used to predict TF binding sites. The computational method presented here is universal and might be applied to other systems involving protein-DNA recognition.
Springer
以上显示的是最相近的搜索结果。 查看全部搜索结果