Techorama NL 2024: ‘Talk to your systems’ - Integrating Gen AI into your architectures with structured LLM output

Slide 1

Slide 1 text

‘Talk to your systems’ Integrating Gen AI into your architectures with structured LLM output Christian Weyer | Co-Founder & CTO | Thinktecture AG | [email protected]

Slide 2

Slide 2 text

§ Technology catalyst § AI-powered solutions § Pragmatic end-to-end architectures § Microsoft Regional Director § Microsoft MVP for AI § Google GDE for Web Technologies [email protected] @christianweyer https://www.thinktecture.com 'Talk to your systems' Integrating Gen AI into your architectures with structured LLM output Christian Weyer Co-Founder & CTO @ Thinktecture AG 2

Slide 3

Slide 3 text

'Talk to your systems' Integrating Gen AI into your architectures with structured LLM output 3 Talk to your systems Why? What? How?

Slide 4

Slide 4 text

'Talk to your systems' Integrating Gen AI into your architectures with structured LLM output TALK TO YOUR SYSTEMS WHY? 4

Slide 5

Slide 5 text

'Talk to your systems' Integrating Gen AI into your architectures with structured LLM output 5 Human language rocks Extending access to software

Slide 6

Slide 6 text

One possible UX pattern 'Talk to your systems' Integrating Gen AI into your architectures with structured LLM output A language-enabled “UI” 6

Slide 7

Slide 7 text

'Talk to your systems' Integrating Gen AI into your architectures with structured LLM output 7 LLMs Use wisely

Slide 8

Slide 8 text

§ LLMs are always part of end-to-end architectures § Client apps (Web, desktop, mobile) § Services with APIs § Databases § etc. § An LLM is ‘just’ an additional asset in your architecture § Enabling human language understanding & generation § It is not the Holy Grail for everything § Enable human language as a ﬁrst-class citizen 'Talk to your systems' Integrating Gen AI into your architectures with structured LLM output End-to-end architectures with LLMs 8 Clients Services LLMs Desktop Web Mobile Service A Service B Service C API Gateway Monitoring LLM 1 LLM 2

Slide 9

Slide 9 text

'Talk to your systems' Integrating Gen AI into your architectures with structured LLM output It’s just HTTP APIs Inference, FTW 9

Slide 10

Slide 10 text

'Talk to your systems' Integrating Gen AI into your architectures with structured LLM output 10 Your LLM OpenAI & beyond

Slide 11

Slide 11 text

§ Llama, Mistral, Qwen families show big potential § Success factors § Use case § Parameter size § Quantization § Processing power needed § CPU optimization on its way § Local inference runtimes with APIs § E.g. llama.cpp, ollama, llamaﬁle, vLLM 'Talk to your systems' Integrating Gen AI into your architectures with structured LLM output Open-source LLMs thrive 11 § Local UIs § E.g. Open WebUI

Slide 12

Slide 12 text

'Talk to your systems' Integrating Gen AI into your architectures with structured LLM output TALK TO YOUR SYSTEMS HOW? 12

Slide 13

Slide 13 text

'Talk to your systems' Integrating Gen AI into your architectures with structured LLM output Most convenient platform for developers today to work with Gen AI 13

Slide 14

Slide 14 text

'Talk to your systems' Integrating Gen AI into your architectures with structured LLM output 14 Prompting Talk to me!

Slide 15

Slide 15 text

'Talk to your systems' Integrating Gen AI into your architectures with structured LLM output 15 JSON Mode Give it structure!

Slide 16

Slide 16 text

'Talk to your systems' Integrating Gen AI into your architectures with structured LLM output 16 ‘Function’ Calling Give it schema!

Slide 17

Slide 17 text

'Talk to your systems' Integrating Gen AI into your architectures with structured LLM output 17 Strict Mode Make it robust!

Slide 18

Slide 18 text

'Talk to your systems' Integrating Gen AI into your architectures with structured LLM output 18 Pydantic & Instructor Make my life easy!

Slide 19

Slide 19 text

'Talk to your systems' Integrating Gen AI into your architectures with structured LLM output 19 Talking to systems No auto-magic

Slide 20

Slide 20 text

'Talk to your systems' Integrating Gen AI into your architectures with structured LLM output TALK TO YOUR SYSTEMS WHAT? 20

Slide 21

Slide 21 text

'Talk to your systems' Integrating Gen AI into your architectures with structured LLM output 21 End-to-End examples Talking to your applications

Slide 22

Slide 22 text

'Talk to your systems' Integrating Gen AI into your architectures with structured LLM output Talk to Thinktecture 22 Angular PWA Speech-to-Text Internal Gateway (Python FastAPI) LLM / SLM Text-to-Speech Transcribe spoken text Transcribed text Check for experts availability with text Extract { experts, booking times } from text Structured JSON data (Tool calling) Generate response with availability Response Response with experts availability 🗣 🔉 Speech-to-text for response Response audio Internal Business API (node.js – veeeery old) Query Availability API Availability When is CL…? CL will be…

Slide 23

Slide 23 text

Filling Angular forms with human language input protected readonly formGroup = this.fb.group({ firstName: [’’], lastName: [’’], addressLine1: [’’], addressLine2: [’’], city: [’’], state: [’’], zip: [’’], country: [’’] }); 'Talk to your systems' Integrating Gen AI into your architectures with structured LLM output Smart Form Filler OK, nice – so here is my address then:: Peter Schmitt, Rheinstr. 7 in Schkeuditz – postcode is 04435, BTW. 23 Smart Form Filler (TS code & LLM)

Slide 24

Slide 24 text

'Talk to your systems' Integrating Gen AI into your architectures with structured LLM output TALK TO YOUR SYSTEMS RECAP 24

Slide 25

Slide 25 text

§ Human language enables new powerful use cases & access to our software § Always use structured output § Structured output is the secret sauce for integrating LLMs into your application architectures § Consider applying the Maybe pattern § Brings more robustness § Function Calling can be ﬂaky § Especially with smaller models § Do not use frameworks that ‘auto-magically’ map Function Calling results to local code § Always validate return data! § Instructor is a helpful library to boost LLM use cases § Implements lots of best practices § Supports any LLM / SLM § Integrates with FastAPI 'Talk to your systems' Integrating Gen AI into your architectures with structured LLM output Recap & Recommendations 25

Slide 26

Slide 26 text

Thank you! Christian Weyer https://thinktecture.com/christian-weyer 26 https://github.com/thinktecture-labs/talk-to-your-systems https://github.com/thinktecture-labs/smart-form-ﬁller