Built-in AI APIs & WebNN: AI right in your browser, local and offline-capable

Built-in AI APIs & WebNN AI right in your browser,
local and offline-capable Christian Liebel @christianliebel Consultant

Hello, it’s me. Built-in AI APIs & WebNN Christian Liebel
W3C WebML WG & CG TAG Associate X: @christianliebel Bluesky: @christianliebel.com Angular, PWA & Generative AI Microsoft MVP & Google GDE (Angular, Web) AI right in your browser, local and offline-capable

Examples Built-in AI APIs & WebNN Generative AI Cloud Providers
AI right in your browser, local and offline-capable

Drawbacks Built-in AI APIs & WebNN Generative AI Cloud Providers
Require a (stable) internet connection Subject to network latency and server availability Data is transferred to the cloud service Require a subscription AI right in your browser, local and offline-capable

Can we run GenAI models locally? Built-in AI APIs &
WebNN AI right in your browser, local and offline-capable

Bring Your Own AI (BYOAI) – Libraries – WebLLM –
Transfomers.js – Frameworks – ONNX Runtime – TensorFlow.js – APIs – WebNN – Cross-Origin Storage Built-in AI (BIAI) – Writing Assistance APIs – Summarizer API – Writer API – Rewriter API – Translator & Language Detector APIs – Prompt API Built-in AI APIs & WebNN Local AI Inference AI right in your browser, local and offline-capable

https://webllm.mlc.ai/ Built-in AI APIs & WebNN WebLLM DEMO AI right
in your browser, local and offline-capable

On NPM Built-in AI APIs & WebNN WebLLM AI right

Storing model files locally Built-in AI APIs & WebNN WebLLM
Internet Website HTML/JS Cache with model files Hugging Face Note: Due to the Same-Origin Policy, models cannot be shared across origins. AI right in your browser, local and offline-capable

Model Size Comparison Model:Parameters Size phi3:3b 2.2 GB mistral:7b 4.1
GB llama3:8b 4.7 GB gemma2:9b 5.4 GB gemma2:27b 16 GB llama3:70b 40 GB Built-in AI APIs & WebNN WebLLM AI right in your browser, local and offline-capable

https://huggingface.co/docs/transformers.js/index Built-in AI APIs & WebNN Transformers.js DEMO AI right

– Grants web apps access to the device’s CPU, GPU
and Neural Processing Unit (NPU) – In specification by the WebML Working Group at W3C – Implementation in progress in Chromium (behind a flag) – Better performance for specific workloads Built-in AI APIs & WebNN WebNN Source: https://webmachinelearning.github.io/webnn-intro/ DEMO AI right in your browser, local and offline-capable

Built-in AI APIs & WebNN Why should you care? DEMO

about://flags Enables WebNN API à Enabled Enables experimental WebNN API
features à Enabled Built-in AI APIs & WebNN WebNN AI right in your browser, local and offline-capable

Drawbacks Built-in AI APIs & WebNN WebNN Models can’t be
shared across origins Inference is fast, but doesn’t reach full native speed AI right in your browser, local and offline-capable

https://github.com/explainers-by-googlers/cross-origin-storage Built-in AI APIs & WebNN Cross-Origin Storage AI right

– Initiative by Google Chrome – Exploratory APIs for local
experiments and use case determination – Downloads AI models into Google Chrome – Models are shared across origins – Uses native APIs directly (full performance) Built-in AI APIs & WebNN Built-in AI https://developer.chrome.com/docs/ai/built-in AI right in your browser, local and offline-capable

Incubated by the WebML CG Built-in AI APIs & WebNN
Built-in AI APIs https://webmachinelearning.github.io/incubations/ DEMO AI right in your browser, local and offline-capable

Built-in AI APIs & WebNN Multimodal Models AI right in
your browser, local and offline-capable

Built-in AI APIs & WebNN Built-in AI APIs Operating System
Website HTML/JS Browser Internet Apple Intelligence Gemini Nano AI right in your browser, local and offline-capable

about://on-device-internals https://www.google.com/chrome/canary/ about://flags Enables optimization guide on device à EnabledBypassPerfRequirement
(API) for Gemini Nano à Enabled Built-in AI APIs & WebNN Built-in AI APIs AI right in your browser, local and offline-capable

TypeScript Definitions Built-in AI APIs & WebNN Built-in AI APIs

Built-in AI APIs & WebNN Chatbots DEMO AI right in

Built-in AI APIs & WebNN Categorization DEMO AI right in

Built-in AI APIs & WebNN Realtime Models DEMO AI right

(Cloud only!) Built-in AI APIs & WebNN Multimodal Realtime Models
DEMO AI right in your browser, local and offline-capable

Pros & Cons + Data does not leave the browser
(privacy) + High availability (offline support) + Low latency + Stability (no external API changes) + Low cost – Lower response quality – Less capable – High system (RAM, GPU) and bandwidth requirements – Large model size, models cannot always be shared – Model initialization and inference are relatively slow – APIs are experimental Built-in AI APIs & WebNN On-device AI Models AI right in your browser, local and offline-capable

Thank you for your kind attention! Christian Liebel @christianliebel [email protected]

Built-in AI APIs & WebNN: AI right in your brow...

Built-in AI APIs & WebNN: AI right in your browser, local and offline-capable

Christian Liebel PRO

More Decks by Christian Liebel

Other Decks in Programming

Featured

Transcript