Section 01
[Introduction] Theory of In-Context Continual Learning: Revealing Task Interference and Forgetting Mechanisms in Transformers
Original Authors and Source
- Original Author/Maintainer: arXiv Author Team
- Source Platform: arXiv
- Original Title: Understanding Generalization and Forgetting in In-Context Continual Learning
- Original Link: http://arxiv.org/abs/2605.28705v1
- Source Publication/Update Time: 2026-05-27
Core Insights
This paper proposes the first theoretical framework for in-context continual learning. Through linear attention analysis, it reveals that standard attention mechanisms cause inter-task interference due to uniform aggregation of historical context, proposes a bias-variance-interference error decomposition, and explains sequence sensitivity and performance degradation in long prompts.