Section 01
RespondeoQA: The First Latin-English Bilingual Question Answering Benchmark Dataset Released (Introduction)
RespondeoQA is the first question answering benchmark dataset focused on Latin, containing approximately 7800 Latin-English bilingual question-answer pairs covering various types such as knowledge-based, skill-based, multi-hop reasoning, and translation-constrained questions. The research team evaluated LLaMa 3, Qwen QwQ, and o3-mini and found that current large models perform poorly on Latin skill-based questions, providing a crucial resource for model capability assessment in this domain.