Large Language Models (LLMs) exhibit amazing capabilities in text generation, but they are inherently stateless. Each call is independent; the model does not automatically remember previous conversation content nor follow preset business processes. This 'freedom' may be an advantage in open-ended creation, but it becomes an insurmountable obstacle in practical application scenarios that require multi-turn interactions, state tracking, and process control.
Imagine customer service robots, order processing systems, or medical consultation assistants — these scenarios all require AI to remember information provided by users, advance the conversation according to specific processes, and make consistent decisions at key nodes. Simple LLM calls are difficult to meet these needs, while traditional rule engines lack the flexibility of language understanding.
The FSM-LLM framework was born to solve this contradiction.