Section 01
POINTS-Seeker: Training a Multimodal Agent Search Model from Scratch (Introduction)
This article introduces POINTS-Seeker-8B, a multimodal agent search model trained from scratch. By establishing the foundation of agent behavior through the Agentic Seeding phase and combining V-Fold history compression technology to solve the bottleneck of long-range interaction, it achieves breakthroughs in long-range knowledge-intensive visual reasoning and attains state-of-the-art performance in six benchmark tests.