Home / Companies / ScyllaDB / Blog / Post Details
Content Deep Dive

Selecting Compression Chunk Sizes for ScyllaDB

Blog post from ScyllaDB

Post Details
Company
Date Published
Author
Glauber Costa
Word Count
863
Language
English
Hacker News Points
-
Summary

ScyllaDB employs chunk-based compression for SSTables, with a default chunk size of 4kB, which can impact performance based on the workload type. Larger chunk sizes are beneficial for write-heavy systems as they reduce disk bandwidth usage, while smaller chunks are advantageous for read-heavy systems, particularly those with single-partition reads, as they minimize unnecessary disk I/O and improve latency. The trade-off lies in the balance between read and write demands; adjusting chunk size can either optimize disk usage or increase overhead depending on whether reads or writes dominate. The chunk size can be altered using the ScyllaDB command line, and while smaller chunks increase memory usage due to more metadata, they can lead to significant disk bandwidth savings in read-mostly workloads. Additionally, larger chunks can offer better compression ratios, but users with very low chunk size requirements might consider disabling compression. Overall, selecting the appropriate chunk size requires understanding the specifics of the workload to optimize performance effectively.