Building and deploying AI infused apps with Elasticsearch using Podman and OpenShift AI

Slide 1

Slide 1 text

No content

Slide 2

Slide 2 text

Building and deploying AI infused apps with Elasticsearch using Podman and OpenShift AI Syed M Shaaf Sr. Principal Developer Advocate Red Hat

Slide 3

Slide 3 text

3 Building and deploying AI Infused applications ● Java developer, advocate, architect, engineer… ● Open source enthusiast, contributor ● InfoQ Java Technical Editor ● Ask me about #Java, backends, architecture, containers.. fosstodon.org/@shaaf sshaaf https://www.linkedin.com/in/shaaf/ shaaf.dev https://bsky.app/proﬁle/shaaf.dev

Slide 4

Slide 4 text

@shaaf.dev ● Systems do not speak Natural language, can’t translate and lack context outside of system boundaries. (e.g. sentiment) ● Generating content is costly and sometimes hard. ● Rapid data growth ● Rising Expectations: Customers demand instant, personalized solutions. ● Inefﬁciency: Manual processes increase costs and slow operations. ● Skill Gaps: Limited expertise in AI adoption. Systems, Data, Networks and a Solution?

Slide 5

Slide 5 text

@shaaf.dev Foundation models Learning without labels, adapt, tune, massive data appetite ● Tasks ○ Translation, Summarization, Writing, Q&A ● “Attention is All you need”, Transformer architecture ● Recognize, Predict, and Generate text ● Trained on a Billions of words ● Can also be tuned further A LLM predicts the next token based on its training data and statistical deduction Large Language Models

Slide 6

Slide 6 text

@shaaf.dev Tokens Tokenization: breaking down text into tokens. e.g., Byte Pair Encoding (BPE) or WordPiece); handle diverse languages and manage vocabulary size efficiently. [12488, 6391, 4014, 316, 1001, 6602, 11, 889, 1236, 4128, 25, 3862, 181386, 364, 61064, 9862, 1299, 166700, 1340, 413, 12648, 1511, 1991, 20290, 15683, 290, 27899, 11643, 25, 93643, 248, 52622, 122, 279, 168191, 328, 9862, 22378, 2491, 2613, 316, 2454, 1273, 1340, 413, 73263, 4717, 25, 220, 7633, 19354, 29338, 15] https://platform.openai.com/tokenizer "Running", “unpredictability” (word-based tokenization). Or: "run" " ning" ; “un” “predict” “ability” (subword-based tokenization, used by many LLMs). “Building Large Language Models from scratch” - Sebastian Raschka

Slide 7

Slide 7 text

@shaaf.dev Amazing things Stupid mistakes “..Do not mix accuracy with truth..”

Slide 8

Slide 8 text

@shaaf.dev Demo Overview

Slide 9

Slide 9 text

DEMO

Slide 10

Slide 10 text

@shaaf.dev Langchain4j

Slide 11

Slide 11 text

1 1 Thank you! Source for the demo https://github.com/sshaaf/gpt-java-chatbot Syed M Shaaf Developer Advocate Red Hat fosstodon.org/@shaaf sshaaf https://www.linkedin.com/in/shaaf/ shaaf.dev https://bsky.app/proﬁle/shaaf.dev