From the course: Advanced NoSQL for Data Science

Unlock the full course today

Join today to access over 24,900 courses taught by industry experts.

Prepare data with graph databases

Prepare data with graph databases - NoSQL Tutorial

From the course: Advanced NoSQL for Data Science

Prepare data with graph databases

- [Instructor] As you are preparing data for graphs, here are a few things to keep mind. Start by identifying nodes and edges in your graphs. You'll have to load data in a way that separates nodes and edges. This can be somewhat more complicated than loading data into relational databases, but it's not too different. Keep in mind that if you're using a tabular data format, like CSV, then columns can be nodes, edges, or properties. Data files can be organized in a variety of ways. For example, you may find some files that have only nodes, some others that have only edges, and still others that have both nodes and edges. As a general rule, separating nodes and edges can sometimes simplify loading logic. As with any time you load data in a database, you'll want to perform some quality checks. Make sure all nodes have an identifier. This is like having a primary key and helps to keep track of individual entities. Edges may have unique IDs, but they don't have to. We can usually identify…

Contents