Section 01
Introduction: HOI-MLLM—A New Breakthrough in Open-World Human-Object Interaction Detection
The HOI-MLLM project combines multimodal large language models (MLLMs) with chain-of-thought (CoT) reasoning to achieve open-world human-object interaction (HOI) detection, breaking through the limitations of traditional methods in understanding complex scenarios. Developed and open-sourced by jasminethurder, this project represents an important attempt to advance HOI research toward generality and flexibility.