From the course: Oracle Cloud Infrastructure Generative AI Professional
Unlock this course with a free trial
Join today to access over 24,900 courses taught by industry experts.
Chat models - Oracle Cloud Infrastructure Tutorial
From the course: Oracle Cloud Infrastructure Generative AI Professional
Chat models
(bright music) - [Instructor] Welcome to this lesson on track models available in the OCI generative AI service. Before we dive deeper, let us look at tokens, first. Large language models understand tokens rather than characters. One token can be part of a word, an entire word, or even a punctuation symbol. A common word such as apple is a token. A word such as friendship is made up of two tokens, friend and ship. Number of tokens per word depend on the complexity of the text. So for a simple text, you can assume one token per word on average. For complex text, meaning text with less common words, you can assume two to three tokens per word on average. So for example, if you have a sentence like this, many words map to one token, but some don't, indivisible, and you run this through a token for a large language model. This is an example of what a tokenizer would do. So it would break this particular sentence into multiple tokens. If you count, the total number of tokens is 15, whereas…
Contents
-
-
-
-
(Locked)
Module introduction45s
-
(Locked)
OCI Generative AI7m 21s
-
(Locked)
Demo: OCI Generative AI12m 22s
-
(Locked)
Chat models9m 58s
-
(Locked)
Demo: Chat models7m 5s
-
(Locked)
Demo: OCI Generative AI service inference API7m 37s
-
(Locked)
Demo: Config setup for generative AI inference API5m 35s
-
(Locked)
Embedding models13m 4s
-
(Locked)
Demo: Embedding models7m 25s
-
(Locked)
Prompt engineering11m 10s
-
(Locked)
Customize LLMs with your data9m 56s
-
(Locked)
Fine-tuning and inference in OCI Generative AI11m 24s
-
(Locked)
Dedicated AI cluster sizing and pricing10m 45s
-
(Locked)
Demo: Dedicated AI clusters6m 16s
-
(Locked)
Fine-tuning configuration9m 42s
-
(Locked)
Demo: Fine-tuning and custom models6m 13s
-
(Locked)
Demo: Inference using endpoint5m 53s
-
(Locked)
OCI Generative AI security4m 31s
-
(Locked)
-
-
-