programmier.con 2025: Semantic AI: A Language Model & an Embedding Model walk into a bar...

Semantic AI: A Language Model & an Embedding Model walk
into a bar... Christian Weyer | Co-Founder & CTO | Thinktecture AG | [email protected]

Semantic AI A Language Model & an Embedding Model Walk
Into a Bar... Our journey Models for our software Lightweight RAG Semantic Routing Structured Output / Tool Calling 2

Into a Bar... MODELS FOR OUR SOFTWARE 3

Language Models understand and generate semantically rich human language, transforming
it into text or structured data for both humans and machines. ⚠ Non-deterministic: same input can lead to different outputs. Embedding Models capture semantic meaning by encoding human language into numerical vector representations, facilitating understanding, comparison, and retrieval for both humans and machines. ✅ Deterministic: same input always results in the same embedding. Semantic AI A Language Model & an Embedding Model Walk Into a Bar... 🫱 🫲 Semantic AI Generative AI 4

§ Language & embedding models part of end-to-end architectures §
Embedding models can be run locally § Optimized for CPU § Language models still hard to run locally § High GPU power § High VRAM § High memory bandwidth Semantic AI A Language Model & an Embedding Model Walk Into a Bar... API-based AI model integrations 5

Into a Bar... Classical applications & UIs API-based data Document-based data 6

Into a Bar... Language-enabled “UIs” – Talk-to-TT 7

Into a Bar... C4 system context diagram § Various tech stacks § Docker-based distributed system 8

Into a Bar... PATTERN STRUCTURED OUTPUT 9

§ Tools integration (and more) is being standardized with MCP
Semantic AI A Language Model & an Embedding Model Walk Into a Bar... Talking to APIs (Function / Tool calling) 10 “When is CW available for a two-days workshop?” System Prompt (+ employee data) + Schema (for structured output) Web API Availability business logic

§ Frameworks § Pydantic § Instructor § Methodology § Schema
with JSON Mode (not Function Calling) § SLM/LLM § Llama 3.3 70B on Cerebras (very fast) Semantic AI A Language Model & an Embedding Model Walk Into a Bar... Technical implementation – Structured Output 11

Into a Bar... PATTERN LIGHTWEIGHT RAG 12

Into a Bar... Talking to documents (Retrieval-augmented generation) Cleanup & Split Text Embedding Question Text Embedding Save Query Relevant Results Question Answ er w / sources LLM Embedding Model Embedding Model 💡 Indexing / Embedding Question Answering .md, .docx, .pdf etc. “Lorem ipsum…?” 💡 Vector DB 13

§ Frameworks § LangChain § FastEmbed § Lightweight & efﬁcient
for generating text embeddings § Embedding model § jinaai/jina-embeddings-v2-base-de (local) – 768 dims § Vector store § PostgreSql (pgvector) vector store § LLM/SLM § Llama 3.3 70B on Cerebras (very fast) Semantic AI A Language Model & an Embedding Model Walk Into a Bar... Technical implementation – Lightweight RAG 14

Into a Bar... PATTERN SEMANTIC ROUTING 15

Into a Bar... Semantics-based decisions for user interactions Guarding (e.g. prompt injection) Routing (selecting correct target) “Lorem ipsum…?” Target RAG Target API Call Target … something else … Fine-tuned NLP Model Embedding Model 16

Guarding § Frameworks § llm-guard § HuggingFace Transformers § NLP
model § deepset/ deberta-v3-base-injection (local) Routing § Frameworks § semantic-routing § FastEmbed § Embedding model § intﬂoat/ multilingual-e5-large (local) – 1024 dims § Vector store § PostgreSql (pgvector) Semantic AI A Language Model & an Embedding Model Walk Into a Bar... Technical implementation – Semantic Guarding & Routing 17

Into a Bar... Typical Semantic AI patterns & solutions – in end-to-end software engineering Lightweight RAG Structured Output Semantic Guarding & Routing Insightful Observability 18

Into a Bar... 19 AI-based solutions are ≅10% AI and 100% software engineering. 19

Thank you! Christian Weyer [email protected] https://thinktecture.com/christian-weyer 20

programmier.con 2025: Semantic AI: A Language M...

programmier.con 2025: Semantic AI: A Language Model & an Embedding Model walk into a bar...

Christian Weyer PRO

More Decks by Christian Weyer

Other Decks in Programming

Featured

Transcript

Semantic AI: A Language Model & an Embedding Model walk

Semantic AI A Language Model & an Embedding Model Walk

Semantic AI A Language Model & an Embedding Model Walk

Language Models understand and generate semantically rich human language, transforming

§ Language & embedding models part of end-to-end architectures §

Semantic AI A Language Model & an Embedding Model Walk

Semantic AI A Language Model & an Embedding Model Walk

Semantic AI A Language Model & an Embedding Model Walk

Semantic AI A Language Model & an Embedding Model Walk

§ Tools integration (and more) is being standardized with MCP

§ Frameworks § Pydantic § Instructor § Methodology § Schema

Semantic AI A Language Model & an Embedding Model Walk

Semantic AI A Language Model & an Embedding Model Walk

§ Frameworks § LangChain § FastEmbed § Lightweight & efﬁcient

Semantic AI A Language Model & an Embedding Model Walk

Semantic AI A Language Model & an Embedding Model Walk

Guarding § Frameworks § llm-guard § HuggingFace Transformers § NLP

Semantic AI A Language Model & an Embedding Model Walk

Semantic AI A Language Model & an Embedding Model Walk

Thank you! Christian Weyer [email protected] https://thinktecture.com/christian-weyer 20