From the course: Advanced NoSQL for Data Science

Unlock the full course today

Join today to access over 24,900 courses taught by industry experts.

Performing data science tasks with NoSQL

Performing data science tasks with NoSQL - NoSQL Tutorial

From the course: Advanced NoSQL for Data Science

Performing data science tasks with NoSQL

As for working with data science applications, it's important to keep in mind some of the common tasks that we have to deal with, and how that relates to NoSQL. One of the most time consuming parts of data science and other business intelligence activities is data preparation. Collecting data is the first step. Often times we'll need to collect data from multiple types of systems, like application servers, databases, lab files, sometimes even Excel spread sheets, and once we've got those consolidated, we often have to filter the data. This is especially true when we deal with streaming data and large volume data, such as logs, and then finally, the last big step that we have to keep in mind is we have to restructure this data once we've collected it and filtered it. Sometimes this means mapping it into tabular structures. Sometimes it means mapping it into document structures. The actual output of the restructured data really depends on the application that we're working toward. Now…

Contents