How to Utilize MLflow and Kubernetes to Build an Enterprise ML Platform

WIFI SSID:SparkAISummit | Password: UnifiedAnalytics

Nick Pinckernell, Comcast Applied AI Research
Utilizing MLFlow and
Kubernetes to build an
Enterprise ML Platform
#UnifiedAnalytics #SparkAISummit

Topics
TOPIC WHY?
Example of data pipeline abstraction Modular components and reuse are
important for abstracting complex
systems
Ways to package and track ML project
and experiments
Consistency and reproducibility is key
for scale
How Comcast uses Kubeflow to serve
and deploy models and pipelines
A tangible example to help you
brainstorm about your organizations
requirements
3#UnifiedAnalytics #SparkAISummit

Challenges and motivations
• Before, there was no
– model management or tracking
– standardization for model packaging or deployments
• Cumbersome deployment process
– Deployment required code rewrite from research to operations
– Days or weeks to deploy
• Response and tradeoff: restrict model complexity

Requirements
Minimum requirements from our organization
• Zero code refactoring or rewriting between research ready models and
production
• Easier experiment and model tracking
• Researchers need to deploy their own models
• A/B testing for quick model enhancement testing in production
• Ability to modularize and inject custom metrics and workflows at each step

Solution – existing technologies
Research
Model serving
Images,
containers

Data pipeline abstraction
• Determine use cases
• Identify commonalities for modularization
• Abstract interfaces
• Automate configuration

Pipeline abstraction

Seldon inference graphs
Allows for complex graphs
• A/B testing
• Ensembles
• Multi-armed bandit
• Custom combinations
https://github.com/SeldonIO/seldon-core/blob/release-0.2/notebooks/advanced_graphs.ipynb

Packaging and tracking
1. Researchers code and train models with Databricks, Spark
2. Experiments tracked with MLFlow
3. Packaging and model tracking with MLFlow and Kubeflow
• MLFLow standard packaging formats
• scikit-learn
• h2o
• TensorFlow
• more

An MLFlow experiment

MLFlow – multiple experiments

MLFlow packaging

Research and model flow

Research and model flow – at scale

Model serving with Kubeflow
Considerations and requirements
• Resilient
• Highly available
• Rate limiting
• Shadow deployments
• Auto-scaling (WIP)
Ambassador
http://www.getambassador.io

Throughput
Static number of replicas Determined after
• Constant and burst load testing with Locust

DEMO
A demonstration of
• MLFlow experiments
– Serving the chosen model
• Implementation of components
– Consumer pod
– Model pod
– Producer logic (to simulate real requests)

Choosing the run

Choosing the model

Implementing the model

Implementing the consumer

Implementing the producer

Deploy the model
• Define the YAML / JSON Seldon deployment
• Build the image
– s2i build -E environment_rest .
seldonio/seldon-core-s2i-python3:0.6-SNAPSHOT
sklearn-iris-mlflow:0.3
• Deploy
– kubectl create -f sklearn_iris_deployment.json
-n kubeflow

Grafana metrics

COMCAST IS HIRING
PHILADELPHIA
WASHINGTON, D.C.
SILICON VALLEY
DENVER

DON’T FORGET TO RATE
AND REVIEW THE SESSIONS
SEARCH SPARK + AI SUMMIT

How to Utilize MLflow and Kubernetes to Build an Enterprise ML Platform

More Related Content

What's hot

Similar to How to Utilize MLflow and Kubernetes to Build an Enterprise ML Platform

More from Databricks

Recently uploaded

How to Utilize MLflow and Kubernetes to Build an Enterprise ML Platform