Section 01
LLM-Vocabulary-Insight: Guide to the In-depth Analysis of Greek Tokenization Capabilities of 50 Large Language Models
This project conducts a comprehensive analysis of the Greek tokenization capabilities of 50 mainstream large language models (LLMs), revealing significant differences in multilingual support among different models and providing data-driven references for selecting LLMs suitable for Greek language processing. The project was developed by constLiakos and released on the GitHub platform on June 5, 2026.