Section 01
STRIDE: Spatial Data Attribution Tool with 13x Speed Improvement
Core Insights
STRIDE is a training data attribution tool that increases attribution speed by 13x using spatial modeling activation and sparse recovery techniques, providing efficient solutions for LLM scenarios such as data selection and contamination detection.
Source Information
- Paper Title: STRIDE: Training Data Attribution via Sparse Recovery from Subset Perturbations
- Publication Platform: arXiv
- Publication Date: June 3, 2026
- Original Link: http://arxiv.org/abs/2606.05165v1