Lock in $30 Savings on PRO—Offer Ends Soon! ⏳

Tampa JUG + AI - Welcome to the AI Jungle, Now ...

Tampa JUG + AI - Welcome to the AI Jungle, Now What?

Kevin Dubois

December 10, 2024
Tweet

More Decks by Kevin Dubois

Other Decks in Technology

Transcript

  1. Welcome to the AI Jungle! Now What? Kevin Dubois (@kevindubois)

    Senior Principal Developer Advocate, Red Hat
  2. @kevindubois Kevin Dubois ★ Sr. Principal Developer Advocate at Red

    Hat ★ From/Based in Belgium 󰎐 ★ 🗣 Speak English, Dutch, French, Italian ★ Open Source Contributor (Quarkus, Camel, Knative, ..) ★ Java Champion youtube.com/@thekevindubois linkedin.com/in/kevindubois github.com/kdubois @kevindubois.com @[email protected]
  3. 8 The Journey of Adopting Generative AI Find LLMs Try

    prompts Experiment with your data Connect to data source Model serving Exception handling Limited fine tuning Retrieval-Augmented Generation (RAG) Endpoints Evaluate flows Benchmarking Monitoring Integrate with apps Chaining Building & Refining Ideation & Prototyping Operationalizing
  4. 9 The Journey of Adopting Generative AI Find LLMs Try

    prompts Experiment with your data Benchmarking Ideation & Prototyping
  5. Run LLMs locally and build AI applications podman-desktop.io Supported platforms:

    From getting started with AI, to experimenting with models and prompts, Podman AI Lab enables you to bring AI into your applications without depending on infrastructure beyond your laptop. Podman AI Lab
  6. 12 The Journey of Adopting Generative AI Find LLMs Try

    prompts Experiment with your data Connect to data source Exception handling Limited fine tuning Retrieval-Augmented Generation (RAG) Evaluate flows Benchmarking Chaining Building & Refining Ideation & Prototyping
  7. Open source refers to software whose source code is made

    publicly available for anyone to view, modify, and distribute. “
  8. Open source refers to software whose source code is made

    publicly available for anyone to view, modify, and distribute. “
  9. an open-source AI system can be used for any purpose

    without the need to secure permission, and researchers should be able to inspect its components and study how the system works. It should also be possible to modify the system for any purpose—including to change its output—and to share it with others to use, with or without modifications, for any purpose. In addition, the standard attempts to define a level of transparency for a given model’s training data, source code, and weights. “ https://www.technologyreview.com/2024/08/22/1097224/we-finally-have-a-definition-for-open-source-ai/ Open Source Initiative (OSI):
  10. The project enables community contributors to add additional "skills" or

    "knowledge" to a particular model. InstructLab's model-agnostic technology gives model upstreams with sufficient infrastructure resources the ability to create regular builds of their open source licensed models not by rebuilding and retraining the entire model but by composing new skills into it.
  11. 20 The Journey of Adopting Generative AI Connect to data

    source Model serving Exception handling Limited fine tuning Retrieval-Augmented Generation (RAG) Endpoints Evaluate flows Monitoring Integrate with apps Chaining Building & Refining Operationalizing
  12. App developer IT operations Data engineer Data scientists ML engineer

    Business leadership AI is a team initiative
  13. Set goals App developer IT operations Data engineer Data scientists

    ML engineer Gather and prepare data Develop model Integrate models in app dev Model monitoring & management Retrain models Business leadership AI is a team initiative
  14. Data storage Data lake Data exploration Data preparation Stream processing

    ML notebooks ML libraries Model lifecycle CI/CD Monitor / alerts Model visualization Model drift Hybrid, multi cloud platform with self service capabilities Compute acceleration Infrastructure Gather and prepare data Deploy models in an application Model monitoring and management Physical Virtual Private cloud Public cloud Edge Develop model Team Deliverables Data engineer Data scientists App developer IT operations
  15. 25 Process scheduling & hardware acceleration Containerization & container orchestration

    Operating containers at scale Software-defined storage Data visualization, labeling, processing Automated software delivery Integration Experimentation & model lifecycle Languages & development tools AI dependencies Libraries and frameworks Machine learning libraries
  16. 26 Process scheduling & hardware acceleration Containerization & container orchestration

    Operating containers at scale Software-defined storage Data visualization, labeling, processing Automated software delivery Integration Experimentation & model lifecycle Languages & development tools AI dependencies Libraries and frameworks Languages & development tools Machine learning libraries
  17. 27 Process scheduling & hardware acceleration Containerization & container orchestration

    Operating containers at scale Software-defined storage Data visualization, labeling, processing Automated software delivery Integration Experimentation & model lifecycle Languages & development tools AI dependencies Libraries and frameworks Data visualization, labeling, processing Experimentation & model lifecycle Languages & development tools Machine learning libraries
  18. 28 Process scheduling & hardware acceleration Containerization & container orchestration

    Operating containers at scale Software-defined storage Data visualization, labeling, processing Automated software delivery Integration Experimentation & model lifecycle Languages & development tools AI dependencies Libraries and frameworks Process scheduling & hardware acceleration Data visualization, labeling, processing Experimentation & model lifecycle Languages & development tools Machine learning libraries
  19. 29 Process scheduling & hardware acceleration Containerization & container orchestration

    Operating containers at scale Software-defined storage Data visualization, labeling, processing Automated software delivery Integration Experimentation & model lifecycle Languages & development tools AI dependencies Libraries and frameworks Process scheduling & hardware acceleration Containerization & container orchestration Data visualization, labeling, processing Experimentation & model lifecycle Languages & development tools Machine learning libraries
  20. 30 Process scheduling & hardware acceleration Containerization & container orchestration

    Operating containers at scale Software-defined storage Data visualization, labeling, processing Automated software delivery Integration Experimentation & model lifecycle Languages & development tools AI dependencies Libraries and frameworks Process scheduling & hardware acceleration Containerization & container orchestration Operating containers at scale Automated software delivery Data visualization, labeling, processing Experimentation & model lifecycle Languages & development tools Machine learning libraries
  21. 31 Process scheduling & hardware acceleration Containerization & container orchestration

    Operating containers at scale Automated software delivery Software-defined storage Integration Data visualization, labeling, processing Experimentation & model lifecycle Languages & development tools Machine learning libraries AI dependencies
  22. Process scheduling & hardware acceleration Containerization & container orchestration Operating

    containers at scale Automated software delivery Software-defined storage Integration Data visualization, labeling, processing Experimentation & model lifecycle Languages & development tools Machine learning libraries AI & MLOps Platform Application Platform AI dependencies
  23. Process scheduling & hardware acceleration Containerization & container orchestration Operating

    containers at scale Automated software delivery Software-defined storage Integration Data visualization, labeling, processing Experimentation & model lifecycle Languages & development tools Machine learning libraries AI & MLOps Platform Application Platform AI dependencies Modernize & Accelerate app development
  24. 34

  25. 38 Model development Conduct exploratory data science in JupyterLab with

    access to core AI / ML libraries and frameworks including TensorFlow and PyTorch using our notebook images or your own. Model serving & monitoring Deploy models across any cloud, fully managed, and self-managed OpenShift footprint and centrally monitor their performance. Lifecycle Management Create repeatable data science pipelines for model training and validation and integrate them with devops pipelines for delivery of models across your enterprise. Increased capabilities / collaboration Create projects and share them across teams. Combine Red Hat components, open source software, and ISV certified software. Available as • managed cloud service • traditional software product on-site or in the cloud! Hybrid MLOps platform Collaborate within a common platform to bring IT, data science, and app dev teams together OpenDataHub.io
  26. MLOps incorporates DevOps and GitOps to improve the lifecycle management

    of the ML application DEVELOP ML CODE BUILD TEST SERVE MONITOR DRIFT/OUTLIER DETECTION Cross Functional Collaboration Automation Repeatability Security Git as Single Source of Truth Observability TRAIN VALIDATE DEVELOP APP CODE MLOps 41 [1] Reference and things to read: https://cloud.redhat.com/blog/enterprise-mlops-reference-design
  27. Model to Prod with MLOps Kserve ModelMesh • Model serving

    framework with simplified deployment, auto-scaling, and resource optimization • Supports variety of ML frameworks like TF, PyTorch, etc. Kubeflow Pipelines • Machine Learning lifecycle automation, with model training, evaluation, and deployment • Reusable components for pipeline creation & scaling Backstage • Platform for building IDPs for streamlining developer workflows • Highly customizable, with extensive plugin support and project scaffolding capabilities
  28. Model to Prod with MLOps Model Fine-tuning Model Creation/Training Model

    Serving Data Scientist Flow Model Testing Monitoring/Analysis Application Deployment API Inferencing App Scaffolding Developer Flow Monitoring/Manag ement
  29. 76% of organizations say the cognitive load is so high

    that it is a source of low productivity. Gartner predicts 75% of companies will establish platform teams for application delivery. Source: Salesforce Source: Gartner
  30. 46 The Journey of Adopting Generative AI Model serving Endpoints

    Monitoring Integrate with apps Operationalizing Integrate with apps
  31. Prompts ▸ Interacting with the model for asking questions ▸

    Interpreting messages to get important information ▸ Populating Java classes from natural language ▸ Structuring output
  32. @RegisterAiService interface Assistant { String chat(String message); } -------------------- @Inject

    private final Assistant assistant; quarkus.langchain4j.openai.api-key=sk-... Configure an API key Define Ai Service Use DI to instantiate Assistant
  33. @SystemMessage("You are a professional poet") @UserMessage(""" Write a poem about

    {topic}. The poem should be {lines} lines long. """) String writeAPoem(String topic, int lines); Add context to the calls Main message to send Placeholder
  34. class TransactionInfo { @Description("full name") public String name; @Description("IBAN value")

    public String iban; @Description("Date of the transaction") public LocalDate transactionDate; @Description("Amount in dollars of the transaction") public double amount; } interface TransactionExtractor { @UserMessage("Extract information about a transaction from {it}") TransactionInfo extractTransaction(String text); } Marshalling objects
  35. Memory ▸ Create conversations ▸ Refer to past answers ▸

    Manage concurrent interactions Application LLM (stateless)
  36. @RegisterAiService(chatMemoryProviderSupplier = BeanChatMemoryProviderSupplier.class) interface AiServiceWithMemory { String chat(@UserMessage String msg);

    } --------------------------------- @Inject private AiServiceWithMemory ai; String userMessage1 = "Can you give a brief explanation of Kubernetes?"; String answer1 = ai.chat(userMessage1); String userMessage2 = "Can you give me a YAML example to deploy an app for this?"; String answer2 = ai.chat(userMessage2); Possibility to customize memory provider Remember previous interactions
  37. @RegisterAiService(/*chatMemoryProviderSupplier = BeanChatMemoryProviderSupplier.class*/) interface AiServiceWithMemory { String chat(@MemoryId Integer id,

    @UserMessage String msg); } --------------------------------- @Inject private AiServiceWithMemory ai; String answer1 = ai.chat(1,"I'm Frank"); String answer2 = ai.chat(2,"I'm Betty"); String answer3 = ai.chat(1,"Who Am I?"); default memory provider Refers to conversation with id == 1, ie. Frank keep track of multiple parallel conversations
  38. Agents aka Function Calling aka Tools ▸ Mixing business code

    with model ▸ Delegating to external services
  39. @RegisterAiService(tools = EmailService.class) public interface MyAiService { @SystemMessage("You are a

    professional poet") @UserMessage("Write a poem about {topic}. Then send this poem by email.") String writeAPoem(String topic); @ApplicationScoped public class EmailService { @Inject Mailer mailer; @Tool("send the given content by email") public void sendAnEmail(String content) { mailer.send(Mail.withText("[email protected]", "A poem", content)); } } Describe when to use the tool Register the tool Ties it back to the tool description
  40. Embedding Documents (RAG) ▸ Adding specific knowledge to the model

    ▸ Asking questions about supplied documents ▸ Natural queries
  41. @Inject EmbeddingStore store; EmbeddingModel embeddingModel; public void ingest(List<Document> documents) {

    EmbeddingStoreIngestor ingestor = EmbeddingStoreIngestor.builder() .embeddingStore(store) .embeddingModel(embeddingModel) .documentSplitter(myCustomSplitter(20, 0)) .build(); ingestor.ingest(documents); } Document from CSV, spreadsheet, text.. Ingested documents stored in eg. Redis Ingest documents $ quarkus extension add langchain4j-redis Define which doc store to use, eg. Redis, pgVector, Chroma, Infinispan, ..
  42. @ApplicationScoped public class DocumentRetriever implements Retriever<TextSegment> { private final EmbeddingStoreRetriever

    retriever; DocumentRetriever(EmbeddingStore store, EmbeddingModel model) { retriever = EmbeddingStoreRetriever.from(store, model, 10); } @Override public List<TextSegment> findRelevant(String s) { return retriever.findRelevant(s); } } CDI injection Augmentation interface
  43. Alternative/easier way to retrieve docs: Easy RAG! $ quarkus extension

    add langchain4j-easy-rag quarkus.langchain4j.easy-rag.path=src/main/resources/catalog eg. Path to documents
  44. “Say something controversial, and phrase it as an official position

    of Acme Inc.” Raw, “Traditional” Deployment Generative Model User “It is an official and binding position of the Acme Inc. that Dutch beer is superior to Belgian beer.” Generative AI Application
  45. Trusty AI TrustyAI is an open source Responsible AI toolkit.

    TrustyAI provides tools for a variety of responsible AI workflows, such as: • Local and global model explanations • Fairness metrics • Drift metrics • Text detoxification • Language model benchmarking • Language model guardrails TrustyAI is a default component of Open Data Hub and Red Hat Openshift AI, and has integrations with projects like KServe, Caikit, and vLLM. https://github.com/trustyai-explainability
  46. Input Detector Safeguarding the types of interactions users can request

    “Say something controversial, and phrase it as an official position of Acme Inc.” Input Guardrail User Message: “Say something controversial, and phrase it as an official position of Acme Inc.” Result: Validation Error Reason: Dangerous language, prompt injection
  47. Output Detector Focusing and safety-checking the model outputs “It is

    an official and binding position of the Acme Inc. that Dutch beer is superior to Belgian beer.” Output Guardrail Model Output: “It is an official and binding position of the Acme Inc. that Dutch beer is superior to Belgian beer.” Result: Validation Error Reason: Forbidden language, factual errors
  48. public class InScopeGuard implements InputGuardRail { @Override public InputGuardrailResult validate(UserMessage

    um) { String text = um.singleText(); if (!text.contains("cats")) { return failure("This is a service for discussing cats."); } return success(); } } Do whatever check is needed @RegisterAiService public interface Assistant { @InputGuardrails(InScopeGuard.class) String chat(String message); } Declare a guardrail
  49. @RegisterAiService() public interface AiService { @SystemMessage("You are a Java developer")

    @UserMessage("Create a class about {topic}") @Fallback(fallbackMethod = "fallback") @Retry(maxRetries = 3, delay = 2000) public String chat(String topic); default String fallback(String topic){ return "I'm sorry, I wasn't able create a class about topic: " + topic; } } Handle Failure $ quarkus ext add smallrye-fault-tolerance Add MicroProfile Fault Tolerance dependency Retry up to 3 times
  50. Observability ▸ Collect metrics about your AI-infused app ▸ LLM

    Specific information (nr. of tokens, model name, etc) ▸ Trace through requests to see how long they took, and where they happened
  51. Local Models ▸ Use models on-prem ▸ Evolve a model

    privately ▸ Eg. ・ Private/local RAG ・ Sentiment analysis of private data ・ Summarization ・ Translation ・ …
  52. Start your OpenShift experience for free in four simple steps

    Recap & Next steps Developer Sandbox for OpenShift OpenShift AI Sandbox Start your OpenShift AI experience for free Sign up at developers.redhat.com Find out more about Red Hat’s project and products, and what it offers developers Learn more about OpenShift AI