Home / Companies / ScyllaDB / Blog / Post Details
Content Deep Dive

Spark Powered by ScyllaDB – Your Questions Answered

Blog post from ScyllaDB

Post Details
Company
Date Published
Author
Eyal Gutkind
Word Count
700
Language
English
Hacker News Points
-
Summary

The blog post discusses the integration of Apache Spark with ScyllaDB, highlighting key considerations and best practices for deploying Spark alongside ScyllaDB. It advises against co-deploying Spark and ScyllaDB on the same nodes due to their high resource demands, recommending separate deployments to avoid resource contention. The post provides tuning tips for handling high write workloads from Spark to ScyllaDB, such as optimizing connection settings and utilizing efficient batch processing. It also recommends compressing data transfer between Spark and ScyllaDB and adjusting input split sizes to improve data fetching efficiency. The post notes that the demo uses Spark standalone mode, which is common in setups involving ScyllaDB, and encourages viewers to watch the related on-demand webinar for further insights.