Reading

Northern Thai LLM: Evaluation Framework for Dialect Understanding Capabilities of Large Language Models

For the translation task between Northern Thai dialect (Lanna language) and Standard Thai, this project constructs a complete evaluation framework for large language models, and significantly improves the models' performance on minority languages through LoRA fine-tuning.

大语言模型低资源语言泰语兰纳语LoRA微调机器翻译方言理解AI公平性

Published 2026-05-13 02:56Recent activity 2026-05-13 03:03Estimated read 5 min

Section 01

Introduction / Main Floor: Northern Thai LLM: Evaluation Framework for Dialect Understanding Capabilities of Large Language Models

Section 02

Project Background: Linguistic Diversity and AI Fairness

Lanna language (ISO code: nod/nort2740) is a dialect used by millions of people in Northern Thailand, with significant differences from Standard Thai (tha/thai1261). Although it has a writing system (Lanna script), it is severely lacking in digital resources and internet content. This data scarcity makes Lanna a typical low-resource language scenario, which is ideal for testing the capability boundaries of large language models in handling non-mainstream languages.

Section 03

Three-Layer Architecture Design

The project adopts a clear three-layer architecture, with each layer named after a Lanna cultural item:

Section 04

Layer 1: lanna_khuang (Data Layer)

"Khuang" means container in Lanna culture; this layer is responsible for containerized data management:

Convert raw corpus in Excel format to JSONL
Perform stratified division of training/development/test sets
Manage the alt-translation flow
Support bidirectional translation: Lanna → Standard Thai, Standard Thai → Lanna

Section 05

Layer 2: lanna_kuafai (Adaptation Layer)

"Kuafai" means bamboo tray, symbolizing bearing and transmission. This layer is responsible for the actual operation of the model:

Support cutting-edge API calls (GPT-4o, Claude, Gemini, DeepSeek-V3)
Inference for open-source weight models (Typhoon2, SeaLLM, Qwen2.5, LLaMA-3.1-8B)
LoRA fine-tuning (PEFT r=8)
Provide the lanna-kuafai command-line tool

Section 06

Layer 3: lanna_jorfa (Diagnostic Layer)

"Jorfa" means offering, representing the examination and inspection of the model. This layer focuses on evaluation and analysis:

Triple-ChrF scoring (supports variable N-grams 1-4)
G-statistic calculation
Multi-dimensional facet slicing
Error typology analysis
Manual scoring form (BaiLan)
Krippendorff's α consistency test (HomPoi)

Section 07

Triple-ChrF Scoring Mechanism

The project adopts an improved ChrF (character-level F-score) evaluation method, calculating scores in three dimensions simultaneously:

ChrF_avg: Average F-score
ChrF_max: Best performance
ChrF_diff: Score difference (reflects the instability of model output)

This triple evaluation mechanism can capture the overall level and fluctuation degree of model performance.

Section 08

Error Typology Analysis

The project establishes a five-category error classification system to help deeply understand model failure patterns:

Lexical-level errors
Syntactic-level errors
Semantic-level errors
Cultural-specific item errors
Transcription errors

Continue Reading

Keep going with more reads from the same topic.

SignalCut: An Intelligent Tool for Turning AI Search Visibility Gaps into Video Marketing Campaigns

SignalCut is an innovative web application that analyzes brands' visibility gaps in AI search, automatically generates evidence-based marketing strategies, and creates Hera video materials, helping early-stage brands gain a competitive edge in the AI answer engine era.

Recent activity 2026-04-26 11:27

AWS Open-Sources AI Search Citation Analysis System: Track Brand Exposure in AI Search Engines

An open-source project officially released by AWS, built on Amazon Bedrock, Step Functions, and React to form a complete serverless citation analysis system. It helps enterprises monitor their brand's citation status and competitive landscape in AI searches like ChatGPT, Perplexity, Gemini, and Claude.

Recent activity 2026-03-31 20:49

Next.js Application SEO and GEO Integrated Optimization Solution: Comprehensive Visibility from Search Engines to AI Assistants

This article delves into the stevewerme/seo-geo-nextjs project, an open-source tool designed specifically for Next.js applications to simultaneously optimize traditional search engine rankings (SEO) and generative engine visibility (GEO). It analyzes the project's core architecture, implementation mechanisms, practical application scenarios, and its strategic significance for developers and content creators.

Recent activity 2026-04-03 14:48

Baiyuan GEO Platform Technical White Paper: SaaS Engineering Practice for Generative Engine Optimization (GEO)

This article deeply analyzes the GEO Platform technical white paper developed by Baiyuan Technology, covering the seven-dimensional AI citation rate scoring algorithm, AXP shadow document delivery mechanism, Schema.org three-layer entity knowledge graph, and the hallucination automatic detection and repair closed-loop system, providing an engineering solution for brands to gain visibility in generative AI such as ChatGPT and Claude.

Recent activity 2026-04-18 22:54