Three-Act Adventure Structure
Corresponding to the three stages of LLM development: pre-training (basic architecture cards), fine-tuning (task-specific optimization), and release (addressing technical + real-world challenges). Each act has 6 stages, with a boss evaluation at the end of each stage.
Technical Card System
28 core technical cards:
- Architecture optimization: MoE (sparse activation for cost reduction), FlashAttention (memory-efficient attention)
- Alignment technologies: RLHF (reinforcement learning from human feedback), DPO (direct preference optimization)
- Reasoning capabilities: CoT (chain of thought), Long-CoT, Self-Play
Synergy Effects and Event System
- 16 synergy effects: Specific card combinations trigger additional bonuses, simulating the effects of technical combinations
- 20 random events: Simulate real-world challenges like insufficient computing power, data quality issues, and investor visits
Researcher Recruitment
8 researchers provide skill bonuses (algorithm, data, alignment, system engineering, etc.), simulating team talent strategies.