Section 01
RiM: Unlocking LLM Working Memory for Efficient Implicit Reasoning
Reasoning in Memory (RiM) is a novel approach that replaces autoregressive Chain of Thought (CoT) steps with fixed memory blocks, enabling LLMs to perform implicit reasoning like humans using working memory. It addresses key inefficiencies of CoT: high computational cost, coupling of internal reasoning and external communication, context length constraints, and training-inference inconsistency.