From the course: Advanced NoSQL for Data Science

Unlock the full course today

Join today to access over 24,900 courses taught by industry experts.

Tips for using wide-column databases for data science

Tips for using wide-column databases for data science - NoSQL Tutorial

From the course: Advanced NoSQL for Data Science

Tips for using wide-column databases for data science

- Let's summarize some of the most important pieces of information about wide-column databases like Cassandra. Build your data model based on how you will query your data. Do not model based on the rules of normalization. This is a common mistake among new Cassandra data modelers. It can lead to terrible performance. Start your data modeling work by documenting the kinds of queries you will be responding to. If you don't know what those queries are, then stop modeling. There's no point in proceeding until you know your queries. Tables should be designed to answer one or more queries. Ideally, you should be able to use a table for more than query, but that's not always possible. Remember, tables in Cassandra are ordered and sorting is not provided in the Cassandra query language. Consider the importance of consistency over performance. Cassandra implements eventual consistency which means there are times when replicas are out of sync. Be aware that a read operation can return…

Contents