Spark Powered by ScyllaDB – Your Questions Answered

Post Details

Company

ScyllaDB

Date Published

July 17, 2018

Author

Eyal Gutkind

Word Count

700

Company Posts That Month

10

Language

English

Hacker News Points

-

Source URL

www.scylladb.com/2018/07/17/spark-webinar-questions-answered

Summary

The blog post discusses the integration of Apache Spark with ScyllaDB, highlighting key considerations and best practices for deploying Spark alongside ScyllaDB. It advises against co-deploying Spark and ScyllaDB on the same nodes due to their high resource demands, recommending separate deployments to avoid resource contention. The post provides tuning tips for handling high write workloads from Spark to ScyllaDB, such as optimizing connection settings and utilizing efficient batch processing. It also recommends compressing data transfer between Spark and ScyllaDB and adjusting input split sizes to improve data fetching efficiency. The post notes that the demo uses Spark standalone mode, which is common in setups involving ScyllaDB, and encourages viewers to watch the related on-demand webinar for further insights.

Trends Found in this Post

No tracked trend matches for this post yet.