Content Deep Dive
Six parameters to tune for cqlsh COPY FROM performance
Blog post from DataStax
Post Details
Company
Date Published
Author
Stefania Alborghetti
Word Count
750
Language
English
Hacker News Points
-
Summary
This article discusses methods for improving performance when importing CSV data into Cassandra via cqlsh COPY FROM. Key suggestions include installing a Python driver with Cython and libev C extensions, setting the CQLSH_NO_BUNDLED environment variable to use an installed driver, compiling the clqsh copy module with Cython for additional performance boosts, adjusting CPU scheduling on Linux systems, and experimenting with six COPY FROM parameters to optimize performance for specific workloads. These methods can result in significant improvements in import speed and efficiency.