model config # python from ai_edge_torch.generative.examples.gemma3 import gemma3 // rust use candle_transformers::models::gemma::{Config as Config1, Model as Model1};
format Works for classical ML and neural networks For LLM: need to construct input tensors + decode output tensors Not realistic for LLM inference in-browser due to model size 300M LLM model is 1.2 GB https://onnxruntime.ai/docs/tutorials/web/deploy.html
for customization Face detection, image classification, etc. Can customize the models - only the input data HuggingFace provides pre-built LLM models Can also convert the models yourself LLM is packaged as a .task file Includes LiteRT files, components, metadata .task format already includes tokenizer + model config https://ai.google.dev/edge/mediapipe/solutions/guide
construct input tokenizer + tensors Needs to use both Rust + JS Focuses on NN + LLM https://github.com/huggingface/candle/tree/main/candle-wasm-examples