Section 01
[Introduction] LLM-Eval-Suite: An Open-Source Comprehensive Evaluation Framework for Large Language Models
This article introduces the open-source project LLM-Eval-Suite, a comprehensive evaluation framework for large language models that supports multiple prompt strategies and RAG retrieval strategies. It aims to address the problem of incomplete coverage of existing evaluation tools and provides a complete evaluation solution from traditional NLP metrics to LLM-based judgment. The project is maintained by Otniel Gomes, and its repository is at https://github.com/OtnielGomes/LLM-Eval-Suite.