Slide 1

Slide 1 text

What’s New in Web AI? Christian Liebel @christianliebel Consultant

Slide 2

Slide 2 text

Hello, it’s me. What’s New in Web AI? Christian Liebel W3C WebML WG & CG TAG Associate christian.liebel @thinktecture.com Angular, PWA & Generative AI Microsoft MVP & Google GDE (Angular, Web)

Slide 3

Slide 3 text

It’s all about the new possibilities. What’s New in Web AI? Why should you care about AI? Stolen from Matthieu’s keynote

Slide 4

Slide 4 text

Examples What’s New in Web AI? Generative AI Cloud Providers

Slide 5

Slide 5 text

Drawbacks What’s New in Web AI? Generative AI Cloud Providers Require a (stable) internet connection Subject to network latency and server availability Data is transferred to the cloud service Require a subscription

Slide 6

Slide 6 text

Bring Your Own AI (BYOAI) – Libraries – WebLLM – Transfomers.js – Frameworks – ONNX Runtime – TensorFlow.js – APIs – WebGPU, WebNN – Cross-Origin Storage NEW! Built-in AI (BIAI) – Writing Assistance APIs – Summarizer API – Writer API – Rewriter API – Proofreader API NEW! – Translator & Language Detector APIs – Prompt API NEW! Multimodal – WebMCP NEW! What’s New in Web AI? Web AI Landscape

Slide 7

Slide 7 text

https://webllm.mlc.ai/ What’s New in Web AI? WebLLM DEMO

Slide 8

Slide 8 text

On NPM What’s New in Web AI? WebLLM

Slide 9

Slide 9 text

Storing model files locally What’s New in Web AI? WebLLM Internet Website HTML/JS Cache with model files Hugging Face Note: Due to the Same-Origin Policy, models cannot be shared across origins.

Slide 10

Slide 10 text

Model Size Comparison Model:Parameters Size phi3:3.8b 2.2 GB mistral:7b 4.1 GB deepseek-r1:8b 5.2 GB gemma3n:e4b 7.5 GB gemma3:12b 8.1 GB llama4:16x17b 67 GB What’s New in Web AI? WebLLM

Slide 11

Slide 11 text

Drawbacks What’s New in Web AI? WebNN Models can’t be shared across origins Inference is fast, but doesn’t reach full native speed

Slide 12

Slide 12 text

https://github.com/explainers-by-googlers/cross-origin-storage What’s New in Web AI? Cross-Origin Storage

Slide 13

Slide 13 text

https://huggingface.co/docs/transformers.js/index What’s New in Web AI? Transformers.js DEMO

Slide 14

Slide 14 text

Cloud only… What’s New in Web AI? Realtime Models DEMO

Slide 15

Slide 15 text

…or is it? NEW! What’s New in Web AI? Realtime Models DEMO

Slide 16

Slide 16 text

– Grants web apps access to the device’s CPU, GPU and Neural Processing Unit (NPU) – In specification by the WebML Working Group at W3C – Implementation in progress in Chromium (behind a flag) – Better performance for specific workloads What’s New in Web AI? WebNN Source: https://webmachinelearning.github.io/webnn-intro/

Slide 17

Slide 17 text

about://flags Enables WebNN API à Enabled Enables experimental WebNN API features à Enabled What’s New in Web AI? WebNN

Slide 18

Slide 18 text

– Initiative by Google Chrome – Exploratory APIs for local experiments and use case determination – Downloads AI models into Google Chrome – Models are shared across origins – Uses native APIs directly (full performance) What’s New in Web AI? Built-in AI https://developer.chrome.com/docs/ai/built-in

Slide 19

Slide 19 text

Incubated by the WebML CG What’s New in Web AI? Built-in AI APIs https://webmachinelearning.github.io/incubations/ DEMO

Slide 20

Slide 20 text

What’s New in Web AI? Multimodal Models

Slide 21

Slide 21 text

What’s New in Web AI? Multimodal Models DEMO

Slide 22

Slide 22 text

What’s New in Web AI? Structured Output DEMO https://bandarra.me/apps/flyby/

Slide 23

Slide 23 text

What’s New in Web AI? Built-in AI APIs Operating System Website HTML/JS Browser Internet Apple Intelligence Gemini Nano

Slide 24

Slide 24 text

about://on-device-internals https://www.google.com/chrome/canary/ about://flags Enables optimization guide on device à EnabledBypassPerfRequirement (API) for Gemini Nano à Enabled What’s New in Web AI? Built-in AI APIs

Slide 25

Slide 25 text

TypeScript Definitions What’s New in Web AI? Built-in AI APIs

Slide 26

Slide 26 text

What’s New in Web AI? WebMCP – Allows websites to expose tools to browser or external agents – Joint effort by Microsoft and Google https://github.com/webmachinelearning/webmcp

Slide 27

Slide 27 text

What’s New in Web AI? WebMCP https://www.youtube.com/watch?v=gbu9kyY2B60

Slide 28

Slide 28 text

Pros & Cons + Data does not leave the browser (privacy) + High availability (offline support) + Low latency + Stability (no external API changes) + Low cost – Lower response quality – Less capable – High system (RAM, GPU) and bandwidth requirements – Large model size, models cannot always be shared – Model initialization and inference are relatively slow – APIs are experimental What’s New in Web AI? Web AI

Slide 29

Slide 29 text

Thank you for your kind attention! Christian Liebel @christianliebel [email protected]