Upgrade to Pro — share decks privately, control downloads, hide ads and more …

Infuse Intelligence into your Apps with Foundry...

Infuse Intelligence into your Apps with Foundry Local

Slide deck used during the Azure Singapore user group session on how to infuse intelligence into Apps using Foundry Local
https://www.meetup.com/mssgug/events/311370544/

Avatar for Nilesh Gule

Nilesh Gule

November 13, 2025
Tweet

More Decks by Nilesh Gule

Other Decks in Technology

Transcript

  1. $whoami { “name” : “Nilesh Gule”, “role” : “Senior Cloud

    Solutions Architect at Avanade” “website” : “https://www.HandsOnArchitect.com", “github” : “https://GitHub.com/NileshGule" “twitter” : “@nileshgule”, “linkedin” : “https://www.linkedin.com/in/nileshgule”, “YouTube” : “https://www.YouTube.com/@nilesh-gule” “likes” : “Technical Evangelism, Cricket”, }
  2. What is Foundry Local? High Performance ONNX Runtime Windows: Integrated

    and optimized with hardware vendors Mac: GPU Acceleration on Apple Silicon Foundry Local Management Service Download and run models at runtime Foundry CLI & SDK CLI: Manage models, tools & agents SDK: Integrate and interact with model management and local inference Local AI Agents using MCP Call local tools for smart automations
  3. Foundry Local .NET SDK .Net App Logic for interacting with

    the model C# SDK Manages Foundry model OpenAI client Connects to actual model and performs chat completion operations Foundry Local Service Programmatic access to Model cache and catalog
  4. Foundry Local Key features Seamless Integration Connect with your applications

    through SDK, API endpoints or CLI. On Device Inference Run models locally on your own hardware, reducing costs while keeping all your data on your device. Model Customization Use preset models or your own models to meet specific requirements. Cost Efficiency Make AI more accessible by eliminating cloud service costs.
  5. Foundry Local Use Cases Data Protection Keep sensitive data on

    your device. Limited or no internet connectivity Reduce Cloud inference costs Low latency AI responses for real time applications Experimentation Experiment with AI models before deploying to cloud environments!
  6. Resources • MS Learn - What is AI Foundry Local

    • MS Learn - Foundry Local Architecture • Unlock instant on device AI with Foundry Local • Foundry Local SDK • Foundry Local Documentation
  7. Source Code & Slide deck https://speakerdeck.com/nileshgule/ https://www.slideshare.net/nileshgule/ How Would They

    React? Aka Celebrity Impersonator https://github.com/NileshGule/how-would-they-react
  8. Nilesh Gule CLOUD SOLUTIONS ARCHITECT | EX MICROSOFT MVP “Code

    with Passion and Strive for Excellence” nileshgule @nileshgule Nilesh Gule NileshGule www.handsonarchitect.com https://www.youtube.com/@nilesh-gule
  9. Q&A