The modern GPU API for the web
High-performance graphics and compute. The foundation for the next generation of browser-based AI, games, and scientific computing.
WebGPU is the next-generation graphics and compute API for the web. It gives web developers low-level, high-performance access to the GPU — similar to Vulkan, Metal, and Direct3D 12 on native platforms.
Unlike its predecessor WebGL (based on OpenGL ES), WebGPU was designed from the ground up for modern GPUs. It offers:
From early experiments in 2016 to universal browser support in 2026
First public discussion about replacing WebGL with a modern, low-level API at the WebGL working group.
Apple proposes WebGPU (initially inspired by Metal). Google, Mozilla, and Apple begin collaboration. Khronos hosts “WebGL Next” meeting.
WebGPU Shading Language (WGSL) standardized. First public Working Draft published.
First stable browser implementations. Major milestone for real-world adoption.
Apple and Mozilla bring full support. WebGPU reaches critical mass.
W3C Candidate Recommendation. WebGPU becomes production-ready for graphics and AI workloads.
WebGPU’s real power for most developers lies in its compute capabilities. It enables high-performance machine learning inference directly in the browser using the user’s GPU.
True client-side inference
Run powerful AI models entirely on the user’s device using WebGPU. Perfect for privacy-sensitive applications, offline tools, and reducing cloud costs.
Production-ready libraries and demos using WebGPU for AI
High-performance in-browser LLM inference engine. Runs Llama, Mistral, Phi and more entirely in the browser with WebGPU acceleration. OpenAI-compatible API.
Run thousands of Hugging Face models in the browser. Native WebGPU support via ONNX Runtime. Up to 100× faster than WASM for many tasks.
Official collection of interactive demos showcasing graphics, compute shaders, and advanced techniques. Excellent starting point for learning the API.
Browse All Samples →