Section 01
[Introduction] TAG-Head: A Lightweight Graph Neural Network Head for Fine-Grained Action Recognition Using Only RGB Videos
A paper accepted by ICPR 2026 introduces TAG-Head, a plug-and-play spatiotemporal graph head module. It upgrades standard 3D backbone networks into powerful tools for fine-grained action recognition without additional modalities, outperforming multimodal methods on multiple benchmarks. This module is lightweight and efficient, seamlessly integrable into mainstream architectures like SlowFast and R(2+1)D-34, providing a new solution for fine-grained action recognition.