From the course: Cloud Hadoop: Scaling Apache Spark
Unlock the full course today
Join today to access over 24,900 courses taught by industry experts.
Run WordCount of Spark with Scala - Apache Spark Tutorial
From the course: Cloud Hadoop: Scaling Apache Spark
Run WordCount of Spark with Scala
- [Instructor] All right, let's bring in our next notebook. Let's go to our workspace. Click here. Import. Make this a little smaller. Go over here. Let's bring the Scala notebook, just to see what that looks like. Import, maximize. Notice up top says Scala, we've got Scala code, we don't have to have the percent sign Scala because the default execution environment is Scala. Now, I can ya how to do class without word count, so I'm going to do it a little bit differently because word count is the Hello World of Hadoop and it's always fun to put a little bit different flavor on it when showing it. So for those of you who are newer to Hadoop, the reason word count is the Hello World is because of course the whole Hadoop ecosystem derived out of technologies that were originally more than 10 years ago and rented at Google to solve the problem of indexing all of the text available in the public internet. So…
Practice while you learn with exercise files
Download the files the instructor uses to teach the course. Follow along and learn by watching, listening and practicing.
Contents
-
-
-
-
-
-
(Locked)
Tour the Databricks Environment4m 36s
-
(Locked)
Tour the notebook5m 29s
-
(Locked)
Import and export notebooks2m 56s
-
(Locked)
Calculate Pi on Spark8m 30s
-
(Locked)
Run WordCount of Spark with Scala4m 59s
-
(Locked)
Import data2m
-
(Locked)
Transformations and actions3m 21s
-
(Locked)
Caching and the DAG6m 49s
-
(Locked)
Architecture: Streaming for prediction3m 51s
-
(Locked)
-
-
-
-