From the course: Cloud Hadoop: Scaling Apache Spark
Unlock the full course today
Join today to access over 24,900 courses taught by industry experts.
Spark with ADAM for genomics - Apache Spark Tutorial
From the course: Cloud Hadoop: Scaling Apache Spark
Spark with ADAM for genomics
- [Instructor] We've covered a lot of territory looking at computation with Spark, both at the core level with RDDs and some of the lower-level objects and with libraries, such SQL for Spark, R for Spark, machine learning, so on and so forth. I thought it would be interesting to kind of pull this together and think about some use cases. And a great place for us to look is where Spark was actually invented at UC Berkeley. So the AMPLab is a place that not only works with Spark, lot of committers to Spark actually are working there, but they extend Spark for some important use cases. One that I've been particularly interested in is cancer genomics. So you can see in this relatively complex diagram how Spark works in ecosystems to solve important problems. At the very bottom, you'll remember from earlier movies, you'll start with some sort of resource virtualization. And your choices include Apache Mesos or YARN. Next level…
Practice while you learn with exercise files
Download the files the instructor uses to teach the course. Follow along and learn by watching, listening and practicing.
Contents
-
-
-
-
-
-
-
(Locked)
Spark SQL8m 34s
-
(Locked)
SparkR6m 54s
-
(Locked)
Spark ML: Preparing data4m 21s
-
(Locked)
Spark ML: Building the model3m 50s
-
(Locked)
Spark ML: Evaluating the model3m 41s
-
(Locked)
Advanced machine learning on Spark1m 35s
-
(Locked)
MXNet25s
-
(Locked)
Spark with ADAM for genomics2m 5s
-
(Locked)
Spark architecture for genomics2m 1s
-
(Locked)
-
-
-