Section 01
[Introduction] Multimodal Fusion + LLM Empowerment: A New Intelligent Medical Solution for Depression Detection
This project innovatively combines facial expression features with the text processing capabilities of large language models (LLMs) to build a bimodal depression detection system. By fusing visual and language information, it achieves more accurate depression severity assessment than unimodal methods on the E-DAIC dataset, providing a new direction for intelligent medical auxiliary diagnosis.