Section 01
AVGen-Bench: First Task-Driven Evaluation Benchmark for Text-to-Audio-Visual Generation Released
Microsoft Research Team has released AVGen-Bench, the first comprehensive evaluation benchmark for text-to-audio-visual (T2AV) generation tasks. This benchmark addresses the fragmentation issue in existing evaluations, reveals common semantic controllability flaws in current T2AV models through a multi-granularity framework, and has open-sourced the code and dataset (link: http://aka.ms/avgenbench).