Section 01
[Introduction] Yeti: A Compact and Efficient Multimodal Protein Structure Tokenizer
Yeti is a protein structure tokenizer based on Lookup-Free Quantization (LFQ), achieving reconstruction accuracy comparable to ESM3 with only 1/10 the number of parameters, and demonstrating strong generative capabilities in a from-scratch trained multimodal model. It aims to address the core challenge of structural representation in multimodal protein AI, providing an efficient foundational component for protein design to move from prediction to creation.