Section 01
[Introduction] CARO: Analogical Reasoning Chain Optimization Innovates Fuzzy Content Moderation
The CARO (Analogical Reasoning Chain Optimization) framework injects an analogical reasoning mechanism through two-stage training (RAG-guided supervised fine-tuning + customized Direct Preference Optimization). It effectively solves the problem that LLMs are easily misled by decision shortcuts in fuzzy content moderation. Experiments show its average F1 score increases by 24.9% in complex moderation scenarios, outperforming advanced reasoning models like DeepSeek R1 and providing a new solution for fuzzy boundary recognition.