Home / Companies / SingleStore / Blog / Post Details
Content Deep Dive

Run Real-Time Applications with Spark and the SingleStore Spark Connector

Blog post from SingleStore

Post Details
Company
Date Published
Author
Wayne Song
Word Count
325
Company Posts That Month
6
Language
English
Hacker News Points
-
Summary

Apache Spark is a powerful distributed computing framework that excels at processing large datasets, but it requires a solution for data persistence. To address this, the SingleStore team has released the SingleStore Spark connector, which enables seamless integration between Spark and SingleStore. This connector provides several optimizations, including parallel reading of data from SingleStore and colocating data with SingleStore nodes on the same physical machines. It also offers two main components: a `SingleStoreRDD` class for loading data from SingleStore queries and a `saveToSingleStore` function for persisting results to SingleStore tables. The connector is open source, and a comprehensive 79-page guide provides code samples and performance recommendations for deploying Spark applications with SingleStore.

Trends Found in this Post
Trend Post Mentions Total Month Mentions Posts Companies MoM
Real-time 3 201 25 10 +48%