Reading

ThaiFACTUAL: Eliminating Large Model Bias in Thai Political Stance Detection via Counterfactual Calibration

This article introduces the ThaiFACTUAL framework, a lightweight, model-agnostic calibration method designed to address systemic bias in large language models (LLMs) for political stance detection in low-resource languages, significantly improving fairness and accuracy without fine-tuning.

大语言模型偏见消除政治立场检测泰语NLP反事实校准AI公平性低资源语言EMNLP 2025

Published 2026-06-02 14:44Recent activity 2026-06-02 14:49Estimated read 8 min

ThaiFACTUAL: Eliminating Large Model Bias in Thai Political Stance Detection via Counterfactual Calibration

Section 01

Introduction: ThaiFACTUAL Framework – A Counterfactual Calibration De-biasing Solution for Thai Political Stance Detection

This article presents the ThaiFACTUAL framework developed by Teerapong Panboonyuen from Chulalongkorn University and MARSAIL. It is a lightweight, model-agnostic calibration method aimed at resolving systemic bias in large language models (LLMs) for political stance detection in low-resource languages such as Thai. The framework significantly enhances fairness and accuracy without requiring fine-tuning of the base model. Related research was published at the EMNLP 2025 Widening NLP (WiNLP) Workshop (Suzhou, China), and the source code is available on GitHub: https://github.com/kaopanboonyuen/ThaiFACTUAL (updated on 2026-06-02). Its core innovation lies in using counterfactual reasoning to separate stance signals from emotional noise, compatible with mainstream models like GPT-4 and LLaMA-3.

Section 02

Background: Bias Challenges in Thai Political Stance Detection

Political stance detection is a key NLP task that identifies the support/opposition/neutral attitude of text toward political entities (individuals, parties, policies). However, when LLMs are applied to low-resource languages like Thai, systemic bias is prominent:

Thai Language Features: Rich indirect expressions, entanglement of emotion and stance, and numerous polarized figures;
Typical Biases: Emotional leakage (equating positive emotion with supportive stance), entity preference (incorrectly associating specific political figures with fixed stances). These issues cause model predictions to deviate from the true stance.

Section 03

Overview of ThaiFACTUAL Framework: Lightweight Model-Agnostic Post-Processing Calibration

The core design idea of ThaiFACTUAL is post-processing calibration, which does not require expensive model retraining or large amounts of labeled data and is compatible with any black-box/white-box LLM. It is based on the principle of counterfactual reasoning: by systematically swapping political entities in text and re-scoring, it separates real stance signals from emotional noise, thereby reducing bias.

Section 04

Detailed Technical Principles: Counterfactual Samples and Calibration Process

Counterfactual Sample Construction

Given a text X containing political entity E:

Replace E with another entity E' to generate counterfactual text X';
Keep the emotional polarity of X' consistent with X;
Obtain the model's stance prediction probabilities for X and X'.

RStd Bias Metric

Calculate the standard deviation of recall rates across different stance categories: RStd = sqrt( sum( (Recall_i - mean(Recall))^2 ) / N ) A higher RStd indicates more severe model bias.

Calibration Flow

Evaluate the original model's bias using RStd and Bias-SSC;
Generate multiple counterfactual versions for each sample;
Adjust the original probabilities based on the counterfactual prediction distribution;
Verify entity-level fairness.

Section 05

Experimental Results: Bias Reduction and Performance Improvement

Evaluation results on the Thai political stance dataset (compared to baseline methods):

Method	Bias-SSC ↓	RStd ↓	F1 ↑	OOD ↑
GPT-4 (Original)	21.7	15.2	70.8	56.4
GPT-4 (De-biased Prompt)	18.3	12.6	71.9	57.0
LLaMA-3 (CoT Prompt)	16.5	11.8	68.1	59.7
ThaiFACTUAL	9.8	6.4	73.5	65.2

Key Findings:

Bias-SSC decreased by over 55%;
RStd was significantly reduced, improving model prediction consistency;
F1 score increased (without sacrificing accuracy);
OOD generalization ability was enhanced.

Section 06

Practical Application Value: From Public Opinion Monitoring to Academic Research

Social Media Public Opinion Monitoring

Reduce content misjudgment caused by model bias;
Fairly present diverse political views;
Improve consistency in cross-entity content moderation.

Academic Research Tools

Analyze the distribution of political discourse;
Track the evolution of public attitudes;
Conduct cross-cultural/language stance comparison studies.

Model Evaluation Benchmark

Bias audit before model release;
Compare de-biasing effects of different architectures;
Monitor bias drift in production models.

Section 07

Limitations and Future Directions

Limitations:

Language Scope: Currently tailored for Thai;迁移 to other low-resource languages requires linguistic adaptation;
Entity Coverage: Relies on a predefined list of political entities; emerging entities need dynamic updates;
Computational Overhead: Counterfactual generation increases inference costs.

Future Directions:

Automated counterfactual sample generation;
Extend to more low-resource languages and cultural scenarios;
Combine fine-tuning and post-processing calibration to further reduce bias.

Section 08

Conclusion: A Practical Solution for AI Fairness in Low-Resource Languages

ThaiFACTUAL provides a practical de-biasing solution for political stance detection in low-resource languages. Its core contribution is proving that without modifying model parameters, post-processing calibration can significantly reduce LLM systemic bias. This approach is not only applicable to Thai but also provides a reference methodology for other low-resource languages and cultural environments. For researchers and developers focusing on AI fairness, low-resource NLP, or political text analysis, ThaiFACTUAL is a plug-and-play tool that helps build more fair and reliable detection systems.