From the course: Oracle Cloud Infrastructure Generative AI Professional Cert Prep
Unlock this course with a free trial
Join today to access over 24,900 courses taught by industry experts.
Chatbot technical architecture - Oracle Cloud Infrastructure Tutorial
From the course: Oracle Cloud Infrastructure Generative AI Professional Cert Prep
Chatbot technical architecture
(gentle music) - [Hemant] So far, we have seen how retrieval augmented generation works through lessons and demos. Let us recap it through right now. RAG is split into two parts, indexing and retrieval and generation. We index documents by first loading them. Next we split documents into manageable chunks. Next, we embed the chunks and persist the embedded chunks in the vector databases, like Chroma for FAISS for retrieval later. LangChain offers a variety of document loaders, splitters, and embedding models, and we have used some of these in our demos. We also have seen how we persist embedded chunks into the Vector databases like Chroma and FAISS. For document retrieval, we begin with establishing a connection to a persistent database or loading a database. Next, we create a retrieval chain. Chain uses a retriever and memory to pull relevant documents and chat history by using the retrieval memory. These are sent to the LLM along with a question that is input from a UI like…
Contents
-
-
-
-
-
-
(Locked)
Module introduction54s
-
(Locked)
Chatbot introduction1m 16s
-
(Locked)
Demo: Chatbot7m 25s
-
(Locked)
Q&A chatbot architecture and basic components2m 31s
-
(Locked)
Models, prompts, and chains4m 15s
-
(Locked)
Demo: Set up development environment2m 44s
-
(Locked)
Demo: Use prompts, models, and chains10m 22s
-
(Locked)
Extending a chatbot by adding memory1m 52s
-
(Locked)
Demo: Using memory5m 47s
-
(Locked)
Demo: Using memory with Streamlit5m 59s
-
(Locked)
Extending a chatbot by adding RAG and a vector database2m 5s
-
(Locked)
Demo: RAG: Indexing using a vector database5m 24s
-
(Locked)
Demo: RAG: Retrieval and generation using a vector database5m 12s
-
(Locked)
Extending a chatbot by adding RAG plus memory45s
-
(Locked)
Demo: RAG plus memory plus tracing8m 43s
-
(Locked)
Demo: Model evaluation7m 19s
-
(Locked)
Chatbot technical architecture1m 41s
-
(Locked)
Deploy a chatbot to a VM2m 16s
-
(Locked)
Demo: Deploy a chatbot6m 19s
-
(Locked)
Deploy a chatbot to OCI Data Science1m 49s
-
(Locked)
-