Section 01
PhageBench: A Benchmark for Evaluating LLMs' Phage Genome Understanding Ability (Main Floor Introduction)
PhageBench is the first benchmark specifically designed to evaluate Large Language Models (LLMs) on their ability to understand phage genomes. It contains 5600 high-quality samples, covers five core tasks, and reveals the potential and limitations of current models in biological sequence reasoning. This benchmark simulates the actual workflow of bioinformatics experts, providing an important platform for evaluating and improving LLMs' biological sequence understanding capabilities.