Section 01
ALUE Framework: A Standardized Solution for LLM Evaluation in the Aerospace Domain
MITRE's ALUE (Aerospace Language Understanding Evaluation) framework provides a standardized solution for evaluating large language models (LLMs) in the aerospace domain. This framework fills the gap in vertical domain model evaluation, supporting local GPU inference, remote API calls (such as TGI and OpenAI-compatible endpoints), custom datasets, and quantitative metrics to facilitate scientific evaluation and selection of models in the domain.