FlagGems is part of FlagOS—a fully open-source system software stack whose grand goal is to unify the three-layer architecture of model-system-chip and build an open, collaborative AI ecosystem. FlagOS pursues the core value of "develop once, run anywhere", enabling AI workloads to run seamlessly on various AI accelerators.
The current AI chip market is highly fragmented: NVIDIA's CUDA ecosystem, AMD's ROCm, Intel's oneAPI, and various domestic AI chips operate independently. This fragmentation leads to:
- Model developers needing to maintain multiple codebases for different hardware
- Difficulty in fully unleashing hardware performance
- High porting and maintenance costs for AI workloads
FlagGems was born to solve these problems. By providing unified high-performance operator implementations, it allows developers to use the same codebase to achieve near-native performance on different hardware.