Section 01
[Introduction] Study on Failure Modes of Small Language Models in Intelligent RAG Workflows
This paper conducts a systematic evaluation of four small language models (SLMs) on financial document reasoning tasks, revealing the dominant failure modes in intelligent RAG workflows and proposing a reusable error taxonomy and dual-review protocol.
Original Authors: Muhammad Ahmed Mufti, Usman Haroon (FAST National University) Source: GitHub Project GenAI_Project Link: https://github.com/UsmanHaroon1177/GenAI_Project Release Time: 2026-05-12
The core research objects include four SLMs: Qwen3-1.7B, SmolLM3-3B, Phi-4-mini, and Llama-3.1-8B, with GPT-OSS-120B used as a capability upper bound for comparison.