From the course: Cloud Architecture: Design Decisions
Unlock this course with a free trial
Join today to access over 24,900 courses taught by industry experts.
Advanced architecture scenario: Generative AI computing
From the course: Cloud Architecture: Design Decisions
Advanced architecture scenario: Generative AI computing
- In this video we will break down a cloud-based generative AI architecture, specifically, a case study examines enterprise content generation applications using generative AI. For example, a mid-size marketing firm must generate customized content at scale while maintaining brand consistency. They need something that works today, not a science project that takes too long to build. Here's a no-nonsense generative AI cloud architecture. Core components. We have a data layer, AWS S3, which is a data lake, Pinecone, which is a vector database, and Redis for data caching. We have a processing layer, GPT-4 on Azure from OpenAI, which is the primary large language model, SageMaker, which is the custom model, and Kubernetes orchestration. And then we have an integration layer, API Gateway for external access, gRPC for internal communication, and OAuth2 for authentication. The system uses a three-tier architecture with automatic scaling and failover. Data flows from the storage layer through…
Contents
-
-
-
Taking your architecture to the next level3m 43s
-
(Locked)
Advanced architecture scenario: HR systems3m 4s
-
(Locked)
Advanced architecture scenario: Edge computing2m 27s
-
(Locked)
Advanced architecture scenario: Containers and Kubernetes3m 32s
-
(Locked)
Advanced architecture scenario: Serverless computing3m 55s
-
(Locked)
Advanced architecture scenario: Generative AI computing2m 16s
-
(Locked)
Advanced architecture scenario: Agentic AI computing2m 26s
-
(Locked)
Necessary skills, tools, and processes2m 54s
-
-
-
-
-
-