Learning the Language of Protein Structure

B Gaujac, J Donà, L Copoiu, T Atkinson… - arXiv preprint arXiv …, 2024 - arxiv.org
Representation learning and\emph {de novo} generation of proteins are pivotal
computational biology tasks. Whilst natural language processing (NLP) techniques have …

Balance of Number of Embedding and their Dimensions in Vector Quantization

H Chen, SS Reddy, Z Chen, D Liu - arXiv preprint arXiv:2407.04939, 2024 - arxiv.org
The dimensionality of the embedding and the number of available embeddings (also called
codebook size) are critical factors influencing the performance of Vector Quantization (VQ), a …