Section 01
DIFFHEADS Project Guide: Eliminating LLM Bias via Differential Analysis and Inference-Time Masking
DIFFHEADS is an open-source research project by the GeniusHTX team. Its core is to identify 'bias heads' that drive LLMs to produce unfair outputs through differential analysis, and mask these heads during inference to achieve lightweight, reversible debiasing without retraining the model. The project includes an automated evaluation tool and a multi-turn dialogue experimental framework, providing a new path for LLM fairness assurance.