Section 01
NEO Series: Introduction to Building Native Vision-Language Models from First Principles
The NEO series project launched by EvolvingLMMs-Lab explores building native vision-language models from first principles. Unlike traditional 'post-added' VLM architectures, it aims to fundamentally integrate visual perception and language understanding, providing a brand-new technical path for multimodal AI research. The project is open-source and has significant research and application value.