Section 01
Introduction: Multimodal and LLM Paper List — A Researcher's Daily arXiv Reading Tracking
This article introduces the open-source GitHub repository Multimodal-AND-Large-Language-Models maintained by Yangyi-Chen, which aims to address the reading dilemma caused by the explosion of papers in the AI field. By recording the author's daily reading trajectory on arXiv, it systematically tracks cutting-edge research in the intersection of multimodal and large language models. Its core value lies in its personalized and real-time characteristics, providing efficient literature screening and learning references for researchers, engineers, and beginners.