Reading

RePAIR: Interactive Machine Unlearning, Empowering Users to Control the Knowledge Boundaries of Large Models

This article introduces the RePAIR framework, which implements a new paradigm of Interactive Machine Unlearning (IMU). Users can instruct the model to forget specific knowledge during inference via natural language commands. The core STAMP method guides MLP activations to a rejection subspace through pseudoinverse updates, enabling efficient, on-device knowledge deletion without retraining.

RePAIR机器遗忘交互式遗忘用户控制STAMP隐私保护模型修复设备端计算

Published 2026-04-14 22:44Recent activity 2026-04-15 09:55Estimated read 5 min

Section 01

RePAIR: Interactive Machine Unlearning, Empowering Users to Control the Knowledge Boundaries of Large Models (Introduction)

This article introduces the RePAIR framework and proposes a new paradigm of Interactive Machine Unlearning (IMU). Users can instruct the model to forget specific knowledge during inference via natural language commands. The core STAMP method guides MLP activations to a rejection subspace through pseudoinverse updates, enabling efficient, on-device knowledge deletion without retraining. This solves the selective unlearning challenge for large models and returns data control to users.

Section 02

Background: Memory Dilemmas of Large Models and Limitations of Existing Methods

Large models absorb massive amounts of data during training, easily learning harmful knowledge (e.g., how to make dangerous items), misinformation (pseudoscientific advice), and personal privacy, yet lack a selective unlearning mechanism. Existing machine unlearning methods are provider-centric, requiring retraining or complex post-processing. Ordinary users cannot independently control whether their data is forgotten, leading to privacy and ethical issues.

Section 03

Methodology: Interactive Machine Unlearning Paradigm and System Architecture

RePAIR proposes the Interactive Machine Unlearning (IMU) paradigm, where users trigger unlearning in real time via natural language commands. The system consists of three components: a Watchdog model to detect unlearning intent, a Surgeon model to generate repair procedures (identify content to forget, plan steps, generate parameter modification instructions), and a Patient model to execute parameter updates, achieving separation of responsibilities.

Section 04

Core Technology: Principles and Advantages of the STAMP Method

STAMP (Steering Through Activation Manipulation with PseudoInverse) is the core technology of RePAIR, featuring no retraining, single-sample operation, and high efficiency. It is based on the observation that model knowledge is encoded in MLP activation patterns. By guiding activations to a rejection subspace via pseudoinverse updates, the model refuses to answer relevant inputs. A low-rank variant reduces computational complexity, completes operations in milliseconds, and supports on-device execution.

Section 05

Experimental Validation: Results and Baseline Comparison

RePAIR was tested in three scenarios: 1. Harmful knowledge suppression: Forgetting score approaches 0, while retaining 84.47% of task performance; 2. Misinformation correction: F-RL metric is 0.00, completely forgetting misinformation; 3. Personal data erasure: R-RL metric is 0.88, accurately erasing target data while preserving irrelevant knowledge. Compared with 6 baselines, RePAIR performs best in terms of unlearning completeness, model utility, efficiency, and user control.

Section 06

Technical Highlights and Application Scenarios

Technical Highlights: 1. User autonomy without relying on providers; 2. No retraining, millisecond-level unlearning; 3. On-device execution for privacy protection; 4. Extensible to multimodal models. Application Scenarios: Personal privacy protection (GDPR compliance), enterprise data security, real-time fact-checking, and safety compliance.

Section 07

Limitations and Future Research Directions

Limitations: The theory of complete unlearning is not fully resolved, and indirect recovery may occur; side effect control is difficult (over/under unlearning); risk of adversarial attacks; interpretability needs improvement. Future Directions: Multimodal unlearning, progressive unlearning, reversible unlearning, and federated unlearning.

Continue Reading

Keep going with more reads from the same topic.

Nornir MCP Server: An Enterprise-Grade Bridge for Integrating Large Language Models into Network Automation

Nornir MCP Server is an enterprise-level server based on the Model Context Protocol (MCP). It seamlessly integrates large language models (such as Claude) with the Nornir network automation framework, supporting natural language orchestration for multi-vendor network devices (Cisco, Arista, Juniper, etc.), and providing production-grade features like a dual-engine architecture (NAPALM + Netmiko), intelligent filtering, and a secure sandbox.

Recent activity 2026-05-06 20:51

Bibliothèque Française LLM: A French Public Domain Literature Index System Optimized for Large Language Models

Bibliothèque Française LLM is a structured indexing and annotation project for French public domain literature designed specifically for large language models (LLMs). It integrates multiple authoritative sources such as DraCor, Common Corpus, and Wikisource, providing metadata indexing categorized by genre, author, and era, as well as in-depth annotations for dramatic texts (including characters, lines, stage directions, etc.). Its aim is to enable LLMs to efficiently read and understand classic French literary works.

Recent activity 2026-05-06 20:50

Splinter: A Lock-Free Zero-Copy Shared Memory KV and Vector Storage Library That Eliminates Socket and Memcpy Overhead for LLM Inference

Splinter is a minimalist, high-performance key-value (KV) and vector storage system enabling zero-latency inter-process communication via shared memory and atomic operations. With only 766 lines of core code, it supports millions of operations per second and 768-dimensional vector storage, offering a new architectural approach for local LLM inference and data-intensive applications.

Recent activity 2026-04-03 08:49

Folkering OS: When the Operating System Itself Is AI—A Self-Evolving Bare-Metal Rust System

Folkering OS is the world's first AI-native bare-metal operating system, entirely written in Rust no_std without relying on Linux, POSIX, or libc. It can generate commands from scratch, compile them into WASM, and run them in 10 seconds, achieving true self-evolution.

Recent activity 2026-04-09 16:15