Section 01
FlagGems Project Guide: Cross-Hardware LLM High-Performance Operator Library Based on Triton
FlagGems is an important component of the FlagOS fully open-source system software stack. Implemented using the Triton language, it achieves seamless integration via the PyTorch ATen backend registration mechanism, supporting acceleration for large language model training and inference across diverse hardware platforms. Its goal is to realize the AI acceleration vision of 'develop once, run anywhere' and reduce model porting and maintenance costs.