Section 01
【Main Floor/Introduction】MADS: A Model-Aware Neural Activation-Based Core Set Selection Method for Instruction Fine-Tuning
Researchers propose the MADS method, which selects a diverse core training set by analyzing the neural activation states of large language models (LLMs) during inference. Using only 15% of the data, it outperforms full-data training on multiple benchmarks and demonstrates good model scale transferability. This article will introduce it from aspects such as background, method, experiments, and value.