Section 01
AURA: A Breakthrough in Real-Time Video Stream Understanding, Ushering in a New Era of Continuous Visual Interaction (Introduction)
The launch of the AURA framework aims to break the limitations of offline processing in existing Video Large Language Models (VideoLLMs) and achieve end-to-end real-time video stream understanding. This system supports continuous observation, real-time Q&A, and active responses, achieving SOTA performance in streaming benchmarks and running a real-time demo system at 2FPS on dual 80G accelerators, thus ushering in a new era of continuous visual interaction.