Cloud-Native Generative AI mit Fermyon Serverless AI

Cloud-Native Thorsten Hans @ThorstenHans Generative AI mit Fermyon Serverless AI

Cloud-Native Business Applications Day Uhrzeit Titel Sprecher 09:00 – 10:00
Cloud-Native-all-the-Things: Definition, Praktiken und Patterns Christian Weyer Thorsten Hans 10:30 – 11:30 Containerbasierte Entwicklung für .NET-Entwickler Tobias Fenster 12:00 – 13:00 Cloud-Native Generative AI mit Fermyon Serverless AI Thorsten Hans 15:15 – 16:15 Cloud-Native Microservices, on-premises oder in der Cloud – mit Dapr Christian Weyer 16:45 – 17:45 Mega Mergers: Cloud-Native-Architekturen mit Containern und WebAssembly Thorsten Hans

Consultant @ Thinktecture #Azure #Containers #CloudNative #Wasm [email protected] thinktecture.com thorsten-hans.com
@ThorstenHans Microsoft MVP | Docker Captain Thorsten Hans

• Intro • What is Fermyon Spin • Serverless AI
with Fermyon Cloud • Conclusion Agenda

WebAssembly will change the way we architecture distributed systems in
the future

• Cloud-vendor interest: • They can put more apps on
a compute resource as today • Wasm and WASI give them a strict security and isolation model • Wasm workloads are way smaller than everything else • They can scale to zero due to super-fast bootstrapping < 1msec Intro Why will Wasm have such a big impact?

• Developer interest: • They can use any language that
compiles to wasm32_wasi • They can ship just the app • They can reduce cloud spendings • Same workloads will be cheaper because they consume less resources and execute faster Intro Why will Wasm have such a big impact?

Wasm on the server relates to containers in the same
way containers related to virtual machines 10+ years ago

Fermyon Spin is: • A serverless runtime build using Wasm,
WASI, and the WebAssembly Component Model (leveraging wasmtime internally) • A collection of SDKs for many popular languages • A super focussed developer tooling Intro Let’s get everybody on track! 🦀

Dive into Fermyon Spin Demo

Give application developers sophisticated generative AI capabilities with no-ops and
maintain developer productivity Serverless AI with Fermyon Cloud

• Fermyon Serverless AI empowers developers to use AI inferencing
in their apps with no additional setup • Encapsulate inferencing capabilities into two methods • Local developer story (independent from your hardware / operating system) Serverless AI with Fermyon Cloud

• Frictionless AI offering by Fermyon • Execute inferencing against
LLMs (currently Llama 2 and CodeLlama with 13b parameter variants) with no-ops • Generate sentence embeddings (all-minilm-l6-v2) for your data using a no-ops vector database • Support for additional models coming soon Serverless AI with Fermyon Cloud

Hello Serverless AI Demo

Deploy the Spin application to Fermyon Cloud for speeding it
up 🚀 and use GPU powers Demo

• With Spin, Fermyon demonstrates how WebAssembly will change the
way we build software for the next wave of cloud-computing • Serverless AI allows application developers to add generative AI capabilities to their apps in “no-time” • Although we have access to Llama2 and CodeLlama now, we will see more models in Fermyon Cloud soon Conclusion

Thanks for your attention @ThorstenHans @Thinktecture

Cloud-Native Generative AI mit Fermyon Serverle...

Cloud-Native Generative AI mit Fermyon Serverless AI

Thorsten Hans

More Decks by Thorsten Hans

Other Decks in Technology

Featured

Transcript

Cloud-Native Thorsten Hans @ThorstenHans Generative AI mit Fermyon Serverless AI

Cloud-Native Business Applications Day Uhrzeit Titel Sprecher 09:00 – 10:00

Consultant @ Thinktecture #Azure #Containers #CloudNative #Wasm [email protected] thinktecture.com thorsten-hans.com

• Intro • What is Fermyon Spin • Serverless AI

• Intro • What is Fermyon Spin • Serverless AI

WebAssembly will change the way we architecture distributed systems in

Why?

• Cloud-vendor interest: • They can put more apps on

• Developer interest: • They can use any language that

Wasm on the server relates to containers in the same

• Intro • What is Fermyon Spin • Serverless AI

• Intro • What is Fermyon Spin • Serverless AI

Fermyon Spin is: • A serverless runtime build using Wasm,

Dive into Fermyon Spin Demo

• Intro • What is Fermyon Spin • Serverless AI

• Intro • What is Fermyon Spin • Serverless AI

Give application developers sophisticated generative AI capabilities with no-ops and

• Fermyon Serverless AI empowers developers to use AI inferencing

• Frictionless AI offering by Fermyon • Execute inferencing against

Hello Serverless AI Demo

Deploy the Spin application to Fermyon Cloud for speeding it

• Intro • What is Fermyon Spin • Serverless AI

• Intro • What is Fermyon Spin • Serverless AI

• With Spin, Fermyon demonstrates how WebAssembly will change the

Thanks for your attention @ThorstenHans @Thinktecture