1 | © Copyright 8/16/23 Zilliz
1 | © Copyright 8/16/23 Zilliz
Agentic AI in Action
Real-Time Vision, Memory & Autonomy with Browser Use &
Milvus
Webinar - 20250313
Stephen Batifol
2 | © Copyright 8/16/23 Zilliz
2 | © Copyright 8/16/23 Zilliz
Stephen Batifol
Developer Advocate, Zilliz / Milvus
About Me
stephen.batifol@zilliz.com
linkedin.com/in/stephen-batifol/
@stephenbtl
3 | © Copyright 8/16/23 Zilliz
3 | © Copyright 8/16/23 Zilliz
| © Copyright 8/16/23 Zilliz
3
What are Agentic
Systems?
4 | © Copyright 8/16/23 Zilliz
4 | © Copyright 8/16/23 Zilliz
● Workflows – Systems where LLMs and tools are orchestrated
through predefined code paths
● Agents – Systems where LLMs dynamically direct their own
processes and tool usage, maintaining control over how they
accomplish tasks.
Agentic Systems
https://www.anthropic.com/engineering/building-effective-agents
5 | © Copyright 8/16/23 Zilliz
5 | © Copyright 8/16/23 Zilliz
● Agentic Systems trade latency and cost for better performance
● Workflows offer predictability and consistency for well-defined
tasks
● Agents are a better option when flexibility and model-driven
decision-making are needed at scale.
When (and when not) to use Agents
6 | © Copyright 8/16/23 Zilliz
6 | © Copyright 8/16/23 Zilliz
6 | © Copyright 8/16/23 Zilliz
6 | © Copyright 8/16/23 Zilliz
Examples
7 | © Copyright 8/16/23 Zilliz
7 | © Copyright 8/16/23 Zilliz
Augmented LLM
8 | © Copyright 8/16/23 Zilliz
8 | © Copyright 8/16/23 Zilliz
Workflow: Prompt Chaining
9 | © Copyright 8/16/23 Zilliz
9 | © Copyright 8/16/23 Zilliz
Workflow: Routing
10 | © Copyright 8/16/23 Zilliz
10 | © Copyright 8/16/23 Zilliz
Agents
11 | © Copyright 8/16/23 Zilliz
11 | © Copyright 8/16/23 Zilliz
When to use Agents?
● Used for open-ended problems
○ Difficult to predict the required number of steps
○ Canʼt hardcode a fixed path
● Use sandboxed environments if possible
● Appropriate Guardrails
12 | © Copyright 8/16/23 Zilliz
12 | © Copyright 8/16/23 Zilliz
12 | © Copyright 8/16/23 Zilliz
12 | © Copyright 8/16/23 Zilliz
Vector Search
& Vector DBs
13 | © Copyright 8/16/23 Zilliz
13 | © Copyright 8/16/23 Zilliz
Vectors Unlock Unstructured Data
14 | © Copyright 8/16/23 Zilliz
14 | © Copyright 8/16/23 Zilliz
Vector Space
15 | © Copyright 8/16/23 Zilliz
15 | © Copyright 8/16/23 Zilliz
How Similarity Search works
Vn, 1
…
…
…
1
2
3
4
5
Transform into
Vectors
Unstructured Data
Images
User Generated
Content
Video
Documents
Audio
Vector Embeddings
Perform Approximate
Nearest Neighbor
Similarity Search
Perform Query
Get Results
Store in Vector Database
16 | © Copyright 8/16/23 Zilliz
16 | © Copyright 8/16/23 Zilliz
| © Copyright 8/16/23 Zilliz
16
Milvus
17 | © Copyright 8/16/23 Zilliz
17 | © Copyright 8/16/23 Zilliz
● pip install on your laptop
● Plug into your favorite AI dev tools
● Push to production with a single line of code
Easy to start
18 | © Copyright 8/16/23 Zilliz
18 | © Copyright 8/16/23 Zilliz
Bulk Import GPU, Intel & ARM
CPU support
Disk Based
Index
Tiered Storage
Million+ level
tenant support
Hybrid Search
Dense & Sparse
RBAC, TLS,
Encryption
Float, Binary, &
Sparse Vector
Tag+Vector
Optimized Filtering
Dynamic Schema
Feature rich
19 | © Copyright 8/16/23 Zilliz
19 | © Copyright 8/16/23 Zilliz
Milvus Lite Milvus Standalone Milvus Distributed
● Ideal for prototyping,
small scale
experiments.
● Easy to set up and
use, pip install
pymilvus
● Scale to ≈1M vectors
● Run on K8s
● Load balancer and
Multi-Node
Management
● Scaling of each
component
independently
● Scale to 100B
vectors
● Single-Node
Deployment
● Bundled in a single
Docker Image
● Supports Primary/
Secondary
● Scale up to 100M
vectors
Ready to scale 🚀
Write your code once, and run it everywhere, at scale!
● API and SDK are the same
20 | © Copyright 8/16/23 Zilliz
20 | © Copyright 8/16/23 Zilliz
20 | © Copyright 8/16/23 Zilliz
20 | © Copyright 8/16/23 Zilliz
More than Vectors?
21 | © Copyright 8/16/23 Zilliz
21 | © Copyright 8/16/23 Zilliz
Future of search is combining different search techniques
1. Semantic Search
2. Keyword Search
3. Filtering
All, in one unified platform
Our Vision is more than just Vectors
22 | © Copyright 8/16/23 Zilliz
22 | © Copyright 8/16/23 Zilliz
● Vector search is great for semantic
understanding
○ Can miss exact keyword matches
● Run two separate systems
○ Vector DB for semantic search
○ Elasticsearch/similar for keyword search
Results in complex architecture and
operational overhead
The Search Dilemma
23 | © Copyright 8/16/23 Zilliz
23 | © Copyright 8/16/23 Zilliz
| © Copyright 8/16/23 Zilliz
23
Milvus - Full Text Search
24 | © Copyright 8/16/23 Zilliz
24 | © Copyright 8/16/23 Zilliz
Why Full Text Search?
● Augment the Search Quality of Embedding based
Semantic Search
● Provide Search with more emphasis on Keyword
Matching
● Easy Hybrid Search of BM25  Dense
Embeddings in a single system.
25 | © Copyright 8/16/23 Zilliz
25 | © Copyright 8/16/23 Zilliz
Milvusʼ Approach
The user can almost forget about Vectors
● Insert the raw text into Milvus and search using the text.
Milvus takes care of
● Text Analyzing and Tokenization
● Term distribution statistics management
● Document/Query vector encoding
● BM25 based scoring
26 | © Copyright 8/16/23 Zilliz
26 | © Copyright 8/16/23 Zilliz
Full Text Search
27 | © Copyright 8/16/23 Zilliz
27 | © Copyright 8/16/23 Zilliz
27 | © Copyright 8/16/23 Zilliz
27 | © Copyright 8/16/23 Zilliz
What we are building
28 | © Copyright 8/16/23 Zilliz
28 | © Copyright 8/16/23 Zilliz
A Search on X about Milvus
29 | © Copyright 8/16/23 Zilliz
29 | © Copyright 8/16/23 Zilliz
● Combining visual understanding with context awareness
● Build an assistant that knows the difference between a black kiteʼs
migration patterns and a new article about Milvus Vector DB
● In the future - Tell us about what Users are saying
AI for Smarter Browsing on Socials
30 | © Copyright 8/16/23 Zilliz
30 | © Copyright 8/16/23 Zilliz
30 | © Copyright 8/16/23 Zilliz
30 | © Copyright 8/16/23 Zilliz
Tech Stack
31 | © Copyright 8/16/23 Zilliz
31 | © Copyright 8/16/23 Zilliz
● Enable AI Agents to control your browser
Features:
● Multi-tab Management
● Vision + HTML Extraction
● Custom Actions
● Self-Correcting
● Different LLM support
Browser Use
32 | © Copyright 8/16/23 Zilliz
32 | © Copyright 8/16/23 Zilliz
● Natively multimodal
● Strong performance on multimodal tasks,
excels in instruction following
● 1M Input token
● Supports Text, Image, Video, Audio
Gemini Flash
33 | © Copyright 8/16/23 Zilliz
33 | © Copyright 8/16/23 Zilliz
Structured Output
Make it possible to generate either a JSON or a Pydantic object.
Benefits:
● Reliable type-safety: No need to validate or retry incorrectly
formatted responses
● Explicit refusals: Safety-based model refusals are now
programmatically detectable
● Simpler prompting: No need for strongly worded prompts to
achieve consistent formatting
34 | © Copyright 8/16/23 Zilliz
34 | © Copyright 8/16/23 Zilliz
Structured Output
JSON Pydantic
35 | © Copyright 8/16/23 Zilliz
35 | © Copyright 8/16/23 Zilliz
● pip install on your laptop
● Plug into your favorite AI dev tools
● Push to production with a single line of code
Milvus
36 | © Copyright 8/16/23 Zilliz
36 | © Copyright 8/16/23 Zilliz
Architecture
37 | © Copyright 8/16/23 Zilliz
37 | © Copyright 8/16/23 Zilliz
| © Copyright 8/16/23 Zilliz
37
Demo!
38 | © Copyright 8/16/23 Zilliz
38 | © Copyright 8/16/23 Zilliz
milvus.io
github.com/milvus-io/
@milvusio
@stephenbtl
/in/stephen-batifol
Thank you

Agentic AI in Action: Real-Time Vision, Memory & Autonomy with Browser Use & Milvus

  • 1.
    1 | ©Copyright 8/16/23 Zilliz 1 | © Copyright 8/16/23 Zilliz Agentic AI in Action Real-Time Vision, Memory & Autonomy with Browser Use & Milvus Webinar - 20250313 Stephen Batifol
  • 2.
    2 | ©Copyright 8/16/23 Zilliz 2 | © Copyright 8/16/23 Zilliz Stephen Batifol Developer Advocate, Zilliz / Milvus About Me stephen.batifol@zilliz.com linkedin.com/in/stephen-batifol/ @stephenbtl
  • 3.
    3 | ©Copyright 8/16/23 Zilliz 3 | © Copyright 8/16/23 Zilliz | © Copyright 8/16/23 Zilliz 3 What are Agentic Systems?
  • 4.
    4 | ©Copyright 8/16/23 Zilliz 4 | © Copyright 8/16/23 Zilliz ● Workflows – Systems where LLMs and tools are orchestrated through predefined code paths ● Agents – Systems where LLMs dynamically direct their own processes and tool usage, maintaining control over how they accomplish tasks. Agentic Systems https://www.anthropic.com/engineering/building-effective-agents
  • 5.
    5 | ©Copyright 8/16/23 Zilliz 5 | © Copyright 8/16/23 Zilliz ● Agentic Systems trade latency and cost for better performance ● Workflows offer predictability and consistency for well-defined tasks ● Agents are a better option when flexibility and model-driven decision-making are needed at scale. When (and when not) to use Agents
  • 6.
    6 | ©Copyright 8/16/23 Zilliz 6 | © Copyright 8/16/23 Zilliz 6 | © Copyright 8/16/23 Zilliz 6 | © Copyright 8/16/23 Zilliz Examples
  • 7.
    7 | ©Copyright 8/16/23 Zilliz 7 | © Copyright 8/16/23 Zilliz Augmented LLM
  • 8.
    8 | ©Copyright 8/16/23 Zilliz 8 | © Copyright 8/16/23 Zilliz Workflow: Prompt Chaining
  • 9.
    9 | ©Copyright 8/16/23 Zilliz 9 | © Copyright 8/16/23 Zilliz Workflow: Routing
  • 10.
    10 | ©Copyright 8/16/23 Zilliz 10 | © Copyright 8/16/23 Zilliz Agents
  • 11.
    11 | ©Copyright 8/16/23 Zilliz 11 | © Copyright 8/16/23 Zilliz When to use Agents? ● Used for open-ended problems ○ Difficult to predict the required number of steps ○ Canʼt hardcode a fixed path ● Use sandboxed environments if possible ● Appropriate Guardrails
  • 12.
    12 | ©Copyright 8/16/23 Zilliz 12 | © Copyright 8/16/23 Zilliz 12 | © Copyright 8/16/23 Zilliz 12 | © Copyright 8/16/23 Zilliz Vector Search & Vector DBs
  • 13.
    13 | ©Copyright 8/16/23 Zilliz 13 | © Copyright 8/16/23 Zilliz Vectors Unlock Unstructured Data
  • 14.
    14 | ©Copyright 8/16/23 Zilliz 14 | © Copyright 8/16/23 Zilliz Vector Space
  • 15.
    15 | ©Copyright 8/16/23 Zilliz 15 | © Copyright 8/16/23 Zilliz How Similarity Search works Vn, 1 … … … 1 2 3 4 5 Transform into Vectors Unstructured Data Images User Generated Content Video Documents Audio Vector Embeddings Perform Approximate Nearest Neighbor Similarity Search Perform Query Get Results Store in Vector Database
  • 16.
    16 | ©Copyright 8/16/23 Zilliz 16 | © Copyright 8/16/23 Zilliz | © Copyright 8/16/23 Zilliz 16 Milvus
  • 17.
    17 | ©Copyright 8/16/23 Zilliz 17 | © Copyright 8/16/23 Zilliz ● pip install on your laptop ● Plug into your favorite AI dev tools ● Push to production with a single line of code Easy to start
  • 18.
    18 | ©Copyright 8/16/23 Zilliz 18 | © Copyright 8/16/23 Zilliz Bulk Import GPU, Intel & ARM CPU support Disk Based Index Tiered Storage Million+ level tenant support Hybrid Search Dense & Sparse RBAC, TLS, Encryption Float, Binary, & Sparse Vector Tag+Vector Optimized Filtering Dynamic Schema Feature rich
  • 19.
    19 | ©Copyright 8/16/23 Zilliz 19 | © Copyright 8/16/23 Zilliz Milvus Lite Milvus Standalone Milvus Distributed ● Ideal for prototyping, small scale experiments. ● Easy to set up and use, pip install pymilvus ● Scale to ≈1M vectors ● Run on K8s ● Load balancer and Multi-Node Management ● Scaling of each component independently ● Scale to 100B vectors ● Single-Node Deployment ● Bundled in a single Docker Image ● Supports Primary/ Secondary ● Scale up to 100M vectors Ready to scale 🚀 Write your code once, and run it everywhere, at scale! ● API and SDK are the same
  • 20.
    20 | ©Copyright 8/16/23 Zilliz 20 | © Copyright 8/16/23 Zilliz 20 | © Copyright 8/16/23 Zilliz 20 | © Copyright 8/16/23 Zilliz More than Vectors?
  • 21.
    21 | ©Copyright 8/16/23 Zilliz 21 | © Copyright 8/16/23 Zilliz Future of search is combining different search techniques 1. Semantic Search 2. Keyword Search 3. Filtering All, in one unified platform Our Vision is more than just Vectors
  • 22.
    22 | ©Copyright 8/16/23 Zilliz 22 | © Copyright 8/16/23 Zilliz ● Vector search is great for semantic understanding ○ Can miss exact keyword matches ● Run two separate systems ○ Vector DB for semantic search ○ Elasticsearch/similar for keyword search Results in complex architecture and operational overhead The Search Dilemma
  • 23.
    23 | ©Copyright 8/16/23 Zilliz 23 | © Copyright 8/16/23 Zilliz | © Copyright 8/16/23 Zilliz 23 Milvus - Full Text Search
  • 24.
    24 | ©Copyright 8/16/23 Zilliz 24 | © Copyright 8/16/23 Zilliz Why Full Text Search? ● Augment the Search Quality of Embedding based Semantic Search ● Provide Search with more emphasis on Keyword Matching ● Easy Hybrid Search of BM25  Dense Embeddings in a single system.
  • 25.
    25 | ©Copyright 8/16/23 Zilliz 25 | © Copyright 8/16/23 Zilliz Milvusʼ Approach The user can almost forget about Vectors ● Insert the raw text into Milvus and search using the text. Milvus takes care of ● Text Analyzing and Tokenization ● Term distribution statistics management ● Document/Query vector encoding ● BM25 based scoring
  • 26.
    26 | ©Copyright 8/16/23 Zilliz 26 | © Copyright 8/16/23 Zilliz Full Text Search
  • 27.
    27 | ©Copyright 8/16/23 Zilliz 27 | © Copyright 8/16/23 Zilliz 27 | © Copyright 8/16/23 Zilliz 27 | © Copyright 8/16/23 Zilliz What we are building
  • 28.
    28 | ©Copyright 8/16/23 Zilliz 28 | © Copyright 8/16/23 Zilliz A Search on X about Milvus
  • 29.
    29 | ©Copyright 8/16/23 Zilliz 29 | © Copyright 8/16/23 Zilliz ● Combining visual understanding with context awareness ● Build an assistant that knows the difference between a black kiteʼs migration patterns and a new article about Milvus Vector DB ● In the future - Tell us about what Users are saying AI for Smarter Browsing on Socials
  • 30.
    30 | ©Copyright 8/16/23 Zilliz 30 | © Copyright 8/16/23 Zilliz 30 | © Copyright 8/16/23 Zilliz 30 | © Copyright 8/16/23 Zilliz Tech Stack
  • 31.
    31 | ©Copyright 8/16/23 Zilliz 31 | © Copyright 8/16/23 Zilliz ● Enable AI Agents to control your browser Features: ● Multi-tab Management ● Vision + HTML Extraction ● Custom Actions ● Self-Correcting ● Different LLM support Browser Use
  • 32.
    32 | ©Copyright 8/16/23 Zilliz 32 | © Copyright 8/16/23 Zilliz ● Natively multimodal ● Strong performance on multimodal tasks, excels in instruction following ● 1M Input token ● Supports Text, Image, Video, Audio Gemini Flash
  • 33.
    33 | ©Copyright 8/16/23 Zilliz 33 | © Copyright 8/16/23 Zilliz Structured Output Make it possible to generate either a JSON or a Pydantic object. Benefits: ● Reliable type-safety: No need to validate or retry incorrectly formatted responses ● Explicit refusals: Safety-based model refusals are now programmatically detectable ● Simpler prompting: No need for strongly worded prompts to achieve consistent formatting
  • 34.
    34 | ©Copyright 8/16/23 Zilliz 34 | © Copyright 8/16/23 Zilliz Structured Output JSON Pydantic
  • 35.
    35 | ©Copyright 8/16/23 Zilliz 35 | © Copyright 8/16/23 Zilliz ● pip install on your laptop ● Plug into your favorite AI dev tools ● Push to production with a single line of code Milvus
  • 36.
    36 | ©Copyright 8/16/23 Zilliz 36 | © Copyright 8/16/23 Zilliz Architecture
  • 37.
    37 | ©Copyright 8/16/23 Zilliz 37 | © Copyright 8/16/23 Zilliz | © Copyright 8/16/23 Zilliz 37 Demo!
  • 38.
    38 | ©Copyright 8/16/23 Zilliz 38 | © Copyright 8/16/23 Zilliz milvus.io github.com/milvus-io/ @milvusio @stephenbtl /in/stephen-batifol Thank you