Hooking up Spark and ScyllaDB: Part 4

Post Details

Company

ScyllaDB

Date Published

Nov. 13, 2018

Author

Itamar Ravid

Word Count

2,100

Company Posts That Month

6

Language

English

Hacker News Points

-

Post removed?

No

Source URL

www.scylladb.com/2018/11/13/hooking-up-spark-and-scylladb-part-4

Summary

The blog post discusses integrating Spark Structured Streaming with ScyllaDB to handle infinite streaming data for real-time analytics, specifically using stock quotes as an example. It outlines the setup of a microservice that leverages Spark, ScyllaDB, and Kafka to process and store stock data, enabling the computation of daily statistics such as price changes. The article describes how to perform streaming computations using Spark's DataFrame API and introduces a custom solution for writing streaming data to ScyllaDB, as the default Datastax connector does not support this functionality out of the box. Additionally, it explains the use of ScyllaDB's materialized views to optimize data queries for different key structures, facilitating efficient computation of aggregate statistics across different dimensions. The blog emphasizes the importance of managing streaming queries' lifecycle and highlights resources for further learning about streaming systems.

Trends Found in this Post

Trend	Post Mentions	Total Month Mentions	Posts	Companies	MoM
Real-time	32	359	114	47	-3%

Use This Data

Use this post, company, and trend context to find content marketing opportunities, perform competitive analysis, or address product feature gaps via the Plushcap MCP server or the Plushcap API.