Section 01
CVPR 2026 Open Source: STC Framework Accelerates Streaming Video Large Models, Reducing ViT Encoding Latency by 24.5%
The EPIC Lab team from Shanghai Jiao Tong University open-sourced the STC framework. Using hierarchical token compression technology, it reduces the ViT encoding latency of streaming video understanding models by 24.5% and LLM pre-filling latency by 45.3% while maintaining 99% accuracy. This framework has been accepted by CVPR 2026 and fully open-sourced, suitable for real-time video scenarios such as live streaming, AR glasses, and surveillance.