Section 01
CAFUNE Model Guide: A Locally Trained Brazilian Portuguese Discrete Masked Diffusion Large Model
CAFUNE is a fully locally trained bidirectional Transformer model optimized for Brazilian Portuguese, using LLaDA-style discrete masked diffusion technology to generate text. The project demonstrates the feasibility of building an approximately 5 million parameter model without external API call costs and zero data privacy risks, equipped with a complete RLAIF teacher system and ethical monitoring mechanism.