Slide 21
Slide 21 text
Comparison
22,98 33,96
19,08
38,75
564,63
0
100
200
300
400
500
600
WebLLM (Mistral-7b, M1) WebLLM (Mistral-7b, M3) OpenAI (GPT-4) Azure OpenAI (GPT-4) Groq (Mixtral-8x7b)
Tokens/sec
Generative AI power on the web
Making web apps smarter with WebGPU and WebNN
Performance
WebLLM/Groq: Own tests (23.03.2024), OpenAI/Azure OpenAI: https://mcplusa.com/comparing-performance-of-openai-gpt-4-and-microsoft-azure-gpt-4/ (31.08.2023)