From the course: Advanced NoSQL for Data Science
Unlock the full course today
Join today to access over 24,900 courses taught by industry experts.
Tips for using wide-column databases for data science - NoSQL Tutorial
From the course: Advanced NoSQL for Data Science
Tips for using wide-column databases for data science
- Let's summarize some of the most important pieces of information about wide-column databases like Cassandra. Build your data model based on how you will query your data. Do not model based on the rules of normalization. This is a common mistake among new Cassandra data modelers. It can lead to terrible performance. Start your data modeling work by documenting the kinds of queries you will be responding to. If you don't know what those queries are, then stop modeling. There's no point in proceeding until you know your queries. Tables should be designed to answer one or more queries. Ideally, you should be able to use a table for more than query, but that's not always possible. Remember, tables in Cassandra are ordered and sorting is not provided in the Cassandra query language. Consider the importance of consistency over performance. Cassandra implements eventual consistency which means there are times when replicas are out of sync. Be aware that a read operation can return…
Practice while you learn with exercise files
Download the files the instructor uses to teach the course. Follow along and learn by watching, listening and practicing.
Contents
-
-
-
-
-
-
(Locked)
Wide-column data models2m 56s
-
(Locked)
Prepare data with wide-column databases2m 19s
-
(Locked)
Install the Java Development Kit1m 4s
-
(Locked)
Install Cassandra1m 49s
-
(Locked)
Prepare data for Cassandra6m 26s
-
(Locked)
Load data into Cassandra4m 30s
-
(Locked)
Cassandra and Spark1m 29s
-
(Locked)
Tips for using wide-column databases for data science2m 14s
-
(Locked)
-
-