Section 01
Introduction: Activation Replay—A New Method to Enhance Multimodal Large Model Reasoning Capabilities Without Training
The team from the National University of Singapore proposed the Activation Replay technique, which manipulates visual tokens during testing to replay low-entropy activations from the base model into the RLVR-trained model. This achieves significant improvements in tasks such as mathematical reasoning, visual agents, and video reasoning without additional strategy optimization training. This method opens up a new path for enhancing the reasoning capabilities of multimodal large models.