From the course: Complete Guide to Databricks for Data Engineering
Unlock this course with a free trial
Join today to access over 24,900 courses taught by industry experts.
Project use case - Databricks Tutorial
From the course: Complete Guide to Databricks for Data Engineering
Project use case
- [Instructor] Now it's time to put all our knowledge, which we have learned throughout this course, into this Capstone project. So our project is retail sales analysis for optimizing the store performance. Let's understand this project. So our main objective is to clean and prepare the sales and the store data. It is needed to add some couple of columns for advanced insights. You also need to analyze store performance and product trends. If we talk about the dataset, we have the sales data, which is available in the CSV file. It has columns like sales ID, store ID, product ID, sales data, quantity, and total amount. We also have one more dataset that is store data. It is also available as a CSV file. You can find all of these files under the exercise section of this course. And the columns for this data set are store_id, store_region, store_size, open_date. If we talk about this specific project requirement. First, it is expected to have the data cleaning. You need to handle the null…
Practice while you learn with exercise files
Download the files the instructor uses to teach the course. Follow along and learn by watching, listening and practicing.