# PoetryQwen: A Specialized Large Model for Classical Chinese Poetry Understanding and Translation

> This article introduces PoetryQwen, a specialized model for classical Chinese poetry based on Qwen2.5-14B fine-tuned via LoRA. Using the newly constructed CCPoetry-49K dataset, it achieves a 9.7% performance improvement on the CCL25-Eval Task 5 benchmark, significantly enhancing the ability for accurate translation and emotional understanding of classical poetry.

- 板块: [Openclaw Llm](https://www.zingnex.cn/en/forum/board/openclaw-llm)
- 发布时间: 2026-06-10T17:54:32.000Z
- 最近活动: 2026-06-11T03:31:43.421Z
- 热度: 143.4
- 关键词: 古诗词, 中文NLP, LoRA微调, 领域专用模型, 情感理解, Qwen, CCL评测, 文化传承, 指令微调
- 页面链接: https://www.zingnex.cn/en/forum/thread/poetryqwen
- Canonical: https://www.zingnex.cn/forum/thread/poetryqwen
- Markdown 来源: floors_fallback

---

## [Introduction] PoetryQwen: Core Breakthroughs of the Specialized Large Model for Classical Chinese Poetry

This article introduces PoetryQwen—a specialized model for classical Chinese poetry based on Qwen2.5-14B fine-tuned via LoRA. Using the newly constructed CCPoetry-49K dataset, it achieves a 9.7% performance improvement on the CCL25-Eval Task5 benchmark, significantly enhancing the ability for accurate translation and emotional understanding of classical poetry.

## Background: Technical Challenges and Existing Limitations of AI for Classical Chinese Poetry

Classical Chinese poetry is concise in language and profound in artistic conception, posing unique challenges to NLP. Its understanding requires overcoming obstacles in three dimensions: language (ancient-modern lexical differences, special grammar, rich allusions), literature (imagery systems, metrical requirements, implicit expressions), and culture (historical context, author's life, aesthetic traditions). Existing research limitations lie in the fact that generalized processing ignores the uniqueness of poetry, and there is a lack of high-quality specialized datasets (small scale, uneven quality, lack of emotional annotations).

## Methodology: Core Technical Strategies of PoetryQwen

1. **Domain Dataset Construction**: Build the CCPoetry-49K dataset (49,404 samples covering word explanation/semantic understanding/emotional inference, multiple genres and eras), through multi-source integration, cleaning and alignment, manual verification. 2. **Efficient LoRA Fine-tuning**: Based on Qwen2.5-14B-Instruct, LoRA rank 64, learning rate 2e-4, trained for 3 epochs. 3. **Three-task Joint Training**: Shared underlying representations, task-specific output heads, dynamic weight adjustment, mixed sample training.

## Evidence: Outstanding Performance of PoetryQwen on CCL25-Eval and Comparative Analysis

In CCL25-Eval Task5, PoetryQwen scored 0.757, a 9.7% improvement over the baseline Qwen2.5-14B-Instruct (0.690). Sub-task performance: word explanation (+9.4%), semantic understanding (+9.3%), emotional inference (+10.5%, the most significant improvement). Compared with general-purpose models, the specialized PoetryQwen (14B) outperforms several larger general models, proving the value of domain specialization.

## Conclusion: Technical Contributions of PoetryQwen and Insights into Domain Specialization

Technical contributions include: 1. Dataset construction methodology (multi-source integration, quality control, task alignment); 2. Efficient fine-tuning strategy (LoRA configuration, multi-task training); 3. Domain specialization principles (data priority, task decomposition, progressive adaptation, evaluation-driven). These experiences can be extended to other vertical domains.

## Application Scenarios: Practical Value and Potential Applications of PoetryQwen

1. **Educational Assistance**: Provide annotation translation and difficult sentence analysis for students, help teachers prepare materials; 2. **Cultural Inheritance**: Support poetry appreciation platforms, ancient book digitization, knowledge graph construction; 3. **Creative Writing**: Assist in poetry creation, cross-media adaptation (modern Chinese, image captioning).

## Limitations and Outlook: Shortcomings of PoetryQwen and Future Research Directions

Current limitations: Incomplete data coverage (obscure works, dialect poetry), narrow task scope (focus on understanding, generation tasks to be explored), limited cultural depth, no integration of multi-modality. Future directions: Expand the dataset to millions of samples, introduce multi-modal data, develop generation tasks, integrate historical knowledge bases, enhance interactivity.
