Best Practices for Data Modeling
Blog post from ScyllaDB
The webinar hosted by ScyllaDB Field Engineer Juliana Oliveira offers insights into best practices for data modeling specifically tailored for ScyllaDB, emphasizing the importance of understanding data storage and distribution within the database. Oliveira contrasts SQL with ScyllaDB's Cassandra Query Language (CQL), highlighting the differences in data modeling approaches, where ScyllaDB relies on denormalization and data organization based on query patterns. The session delves into partitioning and clustering keys, using a veterinary clinic example to demonstrate how data is sorted and queried, and addresses potential issues like large and hot partitions. Furthermore, it explores ScyllaDB’s underlying storage mechanics, including the use of memtables and SSTables, and discusses various compaction strategies such as size-tiered and leveled compaction, which impact performance and resource utilization. The webinar aims to equip attendees with foundational knowledge to optimize data distribution and manage partition sizes effectively, enhancing database performance and efficiency.