Section 01
Research on Compositional Generalization Ability: A Systematic Cognitive Exploration of Transformer Models (Introduction)
This article provides an in-depth interpretation of the compgen-reasoning project, exploring systematic research on Transformer models in Compositional Generalization, and revealing the mechanisms and limitations of large language models in understanding compositional concepts. Compositional Generalization is an important indicator of AI cognitive ability, examining whether models can combine learned simple concepts into complex new ones like humans do; current large models face a sharp performance drop when dealing with completely new combinations, and this project analyzes the causes and improvement directions through experiments.