正文

Reviser：通过光标编辑动作实现非自回归文本生成的新范式

Reviser 提出了一种革命性的文本生成方法，摒弃了传统的从左到右 token 生成方式，转而使用光标在可编辑画布上执行插入、移动和停止等操作来构建文本。这种基于编辑动作的生成范式在 300M 参数规模上显著优于 SEDD 和 MDLM 等扩散基线模型。

非自回归生成文本编辑光标动作扩散模型语言模型文本生成迭代修订

发布时间 2026/05/06 08:41最近活动 2026/05/06 10:11预计阅读 4 分钟

章节 01

Reviser: A New Paradigm for Non-Autoregressive Text Generation via Cursor Editing Actions

Reviser proposes a revolutionary text generation method that abandons traditional left-to-right token generation. Instead, it uses cursor actions (insert, move, stop) on an editable canvas to build text. This approach outperforms diffusion baselines like SEDD and MDLM at the 300M parameter scale, addressing the error accumulation issue of autoregressive models while balancing quality and flexibility.

章节 02

Background: Limitations of Traditional Text Generation Approaches

Mainstream large language models (e.g., GPT, Claude) use autoregressive generation, leading to early error accumulation in long texts. Non-autoregressive methods like diffusion models and masked language models allow multiple revisions but struggle to balance quality and efficiency.

章节 03

Core Innovation: Cursor-Based Editing Actions

Reviser defines three basic edit actions: Insert (add tokens at cursor), Move (shift cursor position), Stop (end editing). Though the final text is non-autoregressive, the edit actions are generated autoregressively—combining the stability of autoregressive models with the flexibility of non-autoregressive ones.

章节 04

Technical Implementation of Reviser

Reviser uses a decoder-only Transformer architecture. Inputs include current canvas state and cursor position; outputs are next edit actions. Training involves deriving reasonable edit trajectories from target texts (via shortest edit distance) and maximizing the likelihood of these trajectories.

章节 05

Experimental Results: Performance Comparison

At 300M parameters, Reviser outperforms diffusion baselines SEDD and MDLM. It is competitive with same-size autoregressive models on C4续写. Evaluation metrics include EvalPPL (perplexity), MAUVE (distribution matching), and trajectory statistics (length, modification frequency).

章节 06

Applications and Open Source Resources

Reviser is suitable for interactive writing assistants, code generation/refactoring, and multi-round dialogue. Open resources: code structure (paper, src, configs, scripts, results, visualizations), 100M/300M pre-trained models on Hugging Face (sean-diab/reviser-checkpoints), and quick start commands for inference.

章节 07

Limitations and Future Directions

Current limitations: complex edit trajectories may reduce efficiency, training is more complex than traditional tasks, and models are small (max 300M). Future directions: learn optimal edit strategies, mix with autoregressive models, and extend to multi-modal tasks.