Section 01
[Main Floor] Introduction to Research on Sparse Visual Thinking Circuits in Vision-Language Models
This study focuses on the interpretability of Sparse Autoencoders (SAE) in Vision-Language Models (VLM), with the core question of whether SAE features can form modular, composable reasoning units. The research team developed a reproducible causal analysis pipeline, tested it on the Qwen3-VL-8B model, found that the modularity hypothesis often does not hold, and identified the non-modular circuit interference phenomenon, providing a diagnostic framework for VLM control.