Learning inverse folding from millions of predicted structures

C Hsu, R Verkuil, J Liu, Z Lin, B Hie… - International …, 2022 - proceedings.mlr.press
We consider the problem of predicting a protein sequence from its backbone atom
coordinates. Machine learning approaches to this problem to date have been limited by the
number of available experimentally determined protein structures. We augment training data
by nearly three orders of magnitude by predicting structures for 12M protein sequences
using AlphaFold2. Trained with this additional data, a sequence-to-sequence transformer
with invariant geometric input processing layers achieves 51% native sequence recovery on …

[引用][C] Learning inverse folding from millions of predicted structures. bioRxiv (2022)

C Hsu, R Verkuil, J Liu, Z Lin, B Hie, T Sercu, A Lerer… - preprint, 2022
以上显示的是最相近的搜索结果。 查看全部搜索结果