Kubernetes + Operator + PaaSTA = Flink @ Yelp - Antonio Verardi, Yelp

Kubernetes + Operator + PaaSTA =
Flink@Yelp
Oct 9, 2019

Yelp’s Mission
Connecting
people with great
local businesses

What you’ll see What Flink at Yelp looks like
What Yelp uses Flink for and what using Flink at Yelp
looks like
WHAT YOU’LL SEE

looks like
How Kubernetes can power Flink
How Kubernetes and Operators can be used to power
Flink clusters deployment and operations
WHAT YOU’LL SEE

looks like
How Kubernetes can power Flink
Why platform integration matters
How Kubernetes and Operators can be used to power
Flink clusters deployment and operations
Why integrating Flink with Yelp’s platform as a service
(PaaSTA) is the key to unlock value for the users
WHAT YOU’LL SEE

FLINK@YELP
Powering Data Enrichment and Transformation as a Service
StreamSQL manipulations and multi-stream
unwindowed joins as a service

FLINK@YELP
Real-time Notiﬁcations
Customized push notiﬁcation to suggest relevant
businesses nearby

FLINK@YELP
Real-time Notiﬁcations
User Activity Sessions
Customized push notiﬁcation to suggest relevant
businesses nearby
Multi-platform user activity sessions out of event logs

FLINK@YELP
Powering
Connectors

FLINK@YELP
The scale ~10 apps
~50 clusters

FLINK@YELP
~1000 jobs
The scale ~10 apps
~50 clusters

THE STATUS QUO
Flink on
AWS EMR

THE STATUS QUO
Meh. Both complex and slow
Running a dockerized Puppet monolith, 15 minutes
boot time and depending on AWS for Flink updates

THE STATUS QUO
Still pretty manual
Each cluster needs trained operators to manually
deploy new versions or scale up resources

THE STATUS QUO
Still pretty manual
Just diﬀerent
Each cluster needs trained operators to manually
deploy new versions or scale up resources
Diﬀerent UX and infrastructure from the rest of Yelp led
to high barrier to entry and knowledge impedance

MEET KUBERNETES
Hello, I’m... an open-source system for automating deployment, scaling, and
management of containerized applications.
(The Internet)

MEET KUBERNETES
I like... Horizontal scaling
Scale applications up and down with a simple
command or automatically based on CPU usage

MEET KUBERNETES
Self-healing systems
Restart containers that fails, reschedule them when
nodes die, support user-deﬁned health-checks

MEET KUBERNETES
Self-healing systems
Powerful primitives
Restart containers that fails, reschedule them when
nodes die, support user-deﬁned health-checks
Pods, ReplicaSets, Services, Jobs and friends can be
used to model complex applications and workﬂows

MEET KUBERNETES
My hobbies are... Automatic bin packing
Place containers based on their requirements and
constraints, to drive up utilization and save resources

MEET KUBERNETES
Service discovery and load balancing
Give pods their own IP and a single DNS name for a set
of Pods and can load-balance across them

MEET KUBERNETES
Service discovery and load balancing
Storage orchestration
Give pods their own IP and a single DNS name for a set
of Pods and can load-balance across them
Automatically mount the storage system of your choice
and maintain state across application restarts

ASSEMBLING FLINK CLUSTERS
Job Manager is a Deployment of a Pod

Job Manager
Pod
Co-located group of containers with shared storage,
network and a spec for how to run the containers
is a Deployment of a Pod

Job Manager
Pod
Co-located group of containers with shared storage,
network and a spec for how to run the containers
is a Deployment of a Pod
Deployments
Provides declarative updates for Pods and ReplicaSets
to automate containers deployments and rollbacks

Task Managers are a Deployment of a ReplicaSet

Task Managers are a Deployment of a ReplicaSet
ReplicaSets
Maintain a stable set of identical Pods running at any
given time

Static IPs or DNS are replaced by a Service and a Proxy

Static IPs or DNS
Service
Exposes an application running on a set of Pods as a
network service regardless of their ephemeral IPs
are replaced by a Service and a Proxy

Static IPs or DNS
Service
Exposes an application running on a set of Pods as a
network service regardless of their ephemeral IPs
are replaced by a Service and a Proxy
Kube-proxy
Network proxy running on each node reﬂecting
Services and doing port-forwarding and round-robin

Flink jobs are deployed by the Supervisor

Flink jobs
Flink Supervisor
Yelp’s in-house daemon responsible of deployment,
state management and monitoring of Flink jobs on EMR
are deployed by Supervisor

Cluster shutdown is signaled via a Job

Cluster shutdown
Jobs
Create Pods and ensure that a speciﬁed number of
them successfully terminate.
is signaled via a Job

software extensions to Kubernetes that make use of custom
resources to manage applications and their components.
(The Internet)
Operators are...
KUBERNETES OPERATORS

Human VS K8s
manages a service or a
set of services
set of services
Kubernetes OperatorHuman Operator

Human VS K8s
set of services
set of services
has deep knowledge of
how the system is
expected to behave
how the system is
expected to behave

Human VS K8s
set of services
set of services
how the system is
expected to behave
knows how to deploy it
how the system is
expected to behave

Human VS K8s
set of services
set of services
how the system is
expected to behave
knows how to react if
there are problems
how the system is
expected to behave
there are problems

Human VS K8s
set of services
set of services
how the system is
expected to behave
there are problems
automates repetitive
tasks
how the system is
expected to behave
there are problems
uses automation for
repetitive tasks

Human VS K8s
set of services
set of services
how the system is
expected to behave
there are problems
automates repetitive
tasks
how the system is
expected to behave
there are problems
uses automation for
repetitive tasks
can only manage a
limited number of
instances
can manage a very
high number of
instances

Flink Custom
Resource
Declarative model
Model the conﬁguration and the deployment of a Flink
cluster

Flink Custom
Resource
Declarative model
cluster
State representation
Used by the operator to keep track of the state of any
Flink cluster

Flink Custom
Resource
Declarative model
cluster
State representation
Labels and Annotations
Used by the operator to keep track of the state of any
Flink cluster
Used for selecting the components to update or to signal
that the user requested a shutdown

Flink Dashboard is accessible via an Ingress rule

Flink Dashboard
Ingress
Exposes HTTP and HTTPS routes from outside the
cluster to services within the cluster
is accessible via an Ingress rule

Flink Dashboard
Ingress
Exposes HTTP and HTTPS routes from outside the
cluster to services within the cluster
is accessible via an Ingress rule
Ingress Controller
Ingresses and ingress rules are managed by their own
“operator”

YELP PAASTA
PaaSTA is...
a highly-available, distributed system for building, deploying, and
running services using containers and Apache Mesos.
(Yelp)

YELP PAASTA
PaaSTA is...
a highly-available, distributed system for building, deploying, and
running services using containers and Apache Mesos Kubernetes.
(Yelp)

YELP PAASTA
Why integrating? Consistent interface
Every PaaSTA user knows how to interact with any
service regardless of its nature

YELP PAASTA
Infrastructure as a Service
Whether it is a Web server, a Cassandra cluster or a
Flink job, to the user everything is a service

YELP PAASTA
Infrastructure as a Service
Platform engineers are users too
Whether it is a Web server, a Cassandra cluster or a
Flink job, to the user everything is a service
Shared infrastructure and tools are exposed as
services, libraries and CLIs to platform developers

main:
job_type: stateful
checkpoint_interval_ms : 30000
deploy_group: prod
taskmanager:
cpus: 2.0
mem: 10G
instances: 3
checkpoint_path : s3://flink-state/service/main/checkpoints
savepoint_path : s3://flink-state/service/main/savepoints
flink_conf:
taskmanager.network.detailed-metrics : "true"
env.java.opts.taskmanager : "-XX:+UseConcMarkSweepGC"

main:
job_type: stateful
checkpoint_interval_ms : 30000
deploy_group: prod
taskmanager:
cpus: 2.0
mem: 10G
instances: 3
checkpoint_path : s3://flink-state/service/main/checkpoints
savepoint_path : s3://flink-state/service/main/savepoints
flink_conf:
taskmanager.network.detailed-metrics : "true"
env.java.opts.taskmanager : "-XX:+UseConcMarkSweepGC"
Custom
Resource
Definition

YELP PAASTA
User Interaction Check status
paasta status -s service -i instance -r region

paasta logs -s service -i instance -n 100
YELP PAASTA
Read logs

paasta logs -s service -i instance -n 100
YELP PAASTA
Read logs
Deploy a new version
Diﬀerent UX and infrastructure from the rest of Yelp led
to high barrier to entry and knowledge impedance
git commit && git push origin master

THE FUTURE
Python
on Beam
on Flink
on Kubernetes

What’s next Job Oriented Deployment
More isolation, faster restarts and simpler deployment
by running a single job per Flink cluster
THE FUTURE

Reactive Container Mode and Autoscaling
Flink will automatically react to new resources available
in K8s by rescaling the job (FLINK-10407)
THE FUTURE

Reactive Container Mode and Autoscaling
Thinner Supervisor
Flink will automatically react to new resources available
in K8s by rescaling the job (FLINK-10407)
Move savepoints, jobs lifecycle and conﬁguration
management from the Supervisor to the Operator
THE FUTURE

Let’s do it!
SHOULD I DO IT?
O(1) people for O(N) clusters
A K8s operator allows you to scale up your number of
Flink clusters without adding more human operators

Let’s do it! O(1) people for O(N) clusters
Operators to codify knowledge
Codifying operational knowledge is easier than passing
it all down to new hires
SHOULD I DO IT?

Let’s do it! O(1) people for O(N) clusters
Operators to codify knowledge
A catalyst for users
Codifying operational knowledge is easier than passing
it all down to new hires
Once integrated with your platform, users don’t have to
learn how to deploy or conﬁgure a Flink job anymore
SHOULD I DO IT?

Or maybe not The Kubernetes Tax
Embedding Kubernetes into your platform requires a
pretty solid eﬀort, if you haven’t done it yet
SHOULD I DO IT?

(Build ∨ Buy) → Time
It takes some time to write your own operator or to ﬁt
an existing one into your platform
SHOULD I DO IT?

(Build ∨ Buy) → Time
It takes some time to write your own operator or to ﬁt
an existing one into your platform
SHOULD I DO IT?
There is always the cloud
Cloud providers are starting to oﬀer managed platforms
based on Kubernetes operators

www.yelp.com/careers/
We're Hiring!

@YelpEngineering
fb.com/YelpEngineers
engineeringblog.yelp.com
github.com/yelp

Questions/Suggestions?
antonio@yelp.com

Kubernetes + Operator + PaaSTA = Flink @ Yelp - Antonio Verardi, Yelp

More Related Content

What's hot

Similar to Kubernetes + Operator + PaaSTA = Flink @ Yelp - Antonio Verardi, Yelp

More from Flink Forward

Recently uploaded

Kubernetes + Operator + PaaSTA = Flink @ Yelp - Antonio Verardi, Yelp