Content Deep Dive
Zen and the Art of Spark Maintenance
Blog post from DataStax
Post Details
Company
Date Published
Author
Russell Spitzer
Word Count
2,971
Language
English
Hacker News Points
-
Summary
This blog post delves into deeper detail on the inner workings of Apache Spark and how to shape your application to take advantage of interactions between Spark and Apache Cassandra. It covers key components of Spark, such as its four processes, executor JVMs, heap memory allocation, and RDDs. The post also discusses troubleshooting connections between the driver and executors, minimizing shuffles, caching RDDs, leveraging Cassandra's advantages within Spark, and using metrics to monitor throughput to and from Cassandra.