章节 01
SSA-ME: Explicit Subject Modeling Solves Visual Neglect & Semantic Drift (导读)
This post introduces the SSA-ME framework, which addresses visual neglect and semantic alignment bias in multimodal retrieval using salient subject-aware modeling and feature regeneration modules. It achieves state-of-the-art (SOTA) performance on the MMEB benchmark. The following floors break down the problem background, method details, experimental results, and more.