Nilesh Gule @nileshgule
Infuse Intelligence
into Apps with
Foundry Local
$whoami
{
“name” : “Nilesh Gule”,
“website” : “https://www.HandsOnArchitect.com",
“github” : “https://GitHub.com/NileshGule"
“twitter” : “@nileshgule”,
“linkedin” : “https://www.linkedin.com/in/nileshgule”,
“YouTube” : “https://www.YouTube.com/@nilesh-gule”
“likes” : “Technical Evangelism, Cricket”,
}
What is Foundry Local? High Performance ONNX Runtime
Windows: Integrated and optimized with
hardware vendors
Mac: GPU Acceleration on Apple Silicon
Foundry Local Management
Service
Download and run models at runtime
Foundry CLI & SDK
CLI: Manage models, tools & agents
SDK: Integrate and interact with model
management and local inference
Local AI Agents using MCP
Call local tools for smart automations
/demo
Foundry Local CLI
Foundry Local .NET SDK .Net App
Logic for interacting with the model
C# SDK
Manages Foundry model
OpenAI client
Connects to actual model and performs chat
completion operations
Foundry Local Service
Programmatic access to Model cache and
catalog
/demo
Celebrity Impersonator
Foundry Local Key features
Seamless
Integration
Connect with
your applications
through SDK, API
endpoints or CLI.
On Device
Inference
Run models
locally on your
own hardware,
reducing costs
while keeping all
your data on your
device.
Model
Customization
Use preset
models or your
own models to
meet specific
requirements.
Cost Efficiency
Make AI more
accessible by
eliminating cloud
service costs.
Foundry Local Use Cases
Data Protection
Keep sensitive
data on your
device.
Limited or no
internet
connectivity
Reduce Cloud
inference costs
Low latency
AI responses for
real time
applications
Experimentation
Experiment with
AI models before
deploying to
cloud
environments!
MS Build 2025 videos
Building cutting edge on-device experiences Local AI dev with .NET Aspire
Resources
• MS Learn - What is AI Foundry Local
• MS Learn - Foundry Local Architecture
• Unlock instant on device AI with Foundry Local
• Foundry Local SDK
• Foundry Local Documentation
Source Code & Slide deck
https://speakerdeck.com/nileshgule/
https://www.slideshare.net/nileshgule/
How Would They React? Aka Celebrity Impersonator
https://github.com/NileshGule/how-would-they-react
Nilesh Gule
ARCHITECT | MICROSOFT MVP
“Code with Passion and
Strive for Excellence”
nileshgule
@nileshgule Nilesh Gule
NileshGule
www.handsonarchitect.com
https://www.youtube.com/@nilesh-gule
Q&A

Infuse Intelligence Into your App with Foundry Local.pdf

  • 1.
    Nilesh Gule @nileshgule InfuseIntelligence into Apps with Foundry Local
  • 2.
    $whoami { “name” : “NileshGule”, “website” : “https://www.HandsOnArchitect.com", “github” : “https://GitHub.com/NileshGule" “twitter” : “@nileshgule”, “linkedin” : “https://www.linkedin.com/in/nileshgule”, “YouTube” : “https://www.YouTube.com/@nilesh-gule” “likes” : “Technical Evangelism, Cricket”, }
  • 3.
    What is FoundryLocal? High Performance ONNX Runtime Windows: Integrated and optimized with hardware vendors Mac: GPU Acceleration on Apple Silicon Foundry Local Management Service Download and run models at runtime Foundry CLI & SDK CLI: Manage models, tools & agents SDK: Integrate and interact with model management and local inference Local AI Agents using MCP Call local tools for smart automations
  • 4.
  • 5.
    Foundry Local .NETSDK .Net App Logic for interacting with the model C# SDK Manages Foundry model OpenAI client Connects to actual model and performs chat completion operations Foundry Local Service Programmatic access to Model cache and catalog
  • 6.
  • 7.
    Foundry Local Keyfeatures Seamless Integration Connect with your applications through SDK, API endpoints or CLI. On Device Inference Run models locally on your own hardware, reducing costs while keeping all your data on your device. Model Customization Use preset models or your own models to meet specific requirements. Cost Efficiency Make AI more accessible by eliminating cloud service costs.
  • 8.
    Foundry Local UseCases Data Protection Keep sensitive data on your device. Limited or no internet connectivity Reduce Cloud inference costs Low latency AI responses for real time applications Experimentation Experiment with AI models before deploying to cloud environments!
  • 9.
    MS Build 2025videos Building cutting edge on-device experiences Local AI dev with .NET Aspire
  • 10.
    Resources • MS Learn- What is AI Foundry Local • MS Learn - Foundry Local Architecture • Unlock instant on device AI with Foundry Local • Foundry Local SDK • Foundry Local Documentation
  • 11.
    Source Code &Slide deck https://speakerdeck.com/nileshgule/ https://www.slideshare.net/nileshgule/ How Would They React? Aka Celebrity Impersonator https://github.com/NileshGule/how-would-they-react
  • 12.
    Nilesh Gule ARCHITECT |MICROSOFT MVP “Code with Passion and Strive for Excellence” nileshgule @nileshgule Nilesh Gule NileshGule www.handsonarchitect.com https://www.youtube.com/@nilesh-gule
  • 13.