From the course: Cloud Hadoop: Scaling Apache Spark

Unlock the full course today

Join today to access over 24,900 courses taught by industry experts.

Spark with ADAM for genomics

Spark with ADAM for genomics

- [Instructor] We've covered a lot of territory looking at computation with Spark, both at the core level with RDDs and some of the lower-level objects and with libraries, such SQL for Spark, R for Spark, machine learning, so on and so forth. I thought it would be interesting to kind of pull this together and think about some use cases. And a great place for us to look is where Spark was actually invented at UC Berkeley. So the AMPLab is a place that not only works with Spark, lot of committers to Spark actually are working there, but they extend Spark for some important use cases. One that I've been particularly interested in is cancer genomics. So you can see in this relatively complex diagram how Spark works in ecosystems to solve important problems. At the very bottom, you'll remember from earlier movies, you'll start with some sort of resource virtualization. And your choices include Apache Mesos or YARN. Next level…

Contents