Section 01
[Introduction] MMT-Bench: A Comprehensive Evaluation Benchmark for Multi-Task AGI Vision-Language Models
MMT-Bench is a large-scale vision-language model evaluation benchmark accepted by ICML 2024. Targeting multi-task general artificial intelligence (AGI), it aims to comprehensively assess models' comprehensive capabilities in multi-task scenarios such as cross-modal understanding, reasoning, and generation, address the limitations of existing evaluation benchmarks, and advance general artificial intelligence research.