Section 01
Artifex-Assistantv5 Overview: A Breakthrough in Running 90B-Parameter Large Models Locally in the Browser
Artifex-Assistantv5 is a browser-based AI inference engine built on WebGPU/WGSL. It supports running 90-billion-parameter large models in an environment with 8GB of VRAM, integrates cutting-edge optimization technologies like TurboQuant KV cache compression and GPTQ INT4 quantization, enables local data processing, protects user privacy, and lowers the barrier to using AI.