Section 01
Introduction: oh-my-knowledge—A Scientific Evaluation Framework for LLM Knowledge Input
oh-my-knowledge is an open-source framework focused on evaluating the knowledge input of Large Language Models (LLMs). It provides systematic evaluation methods for prompts, RAG corpora, skills, and agent workflows, with built-in tools for statistical rigor (e.g., Bootstrap, Krippendorff Alpha) and debiasing mechanisms. Its core philosophy is to fix the model and vary the input to accurately measure the causal impact of knowledge input on model performance.