Databricks Discusses Stateful Streaming Applications with Apache Spark at ScyllaDB Summit 2017
Blog post from ScyllaDB
Burak Yavuz, a software engineer at Databricks, will be presenting at the ScyllaDB Summit 2017 on the topic of stateful streaming applications using Apache Spark's Structured Streaming. Yavuz, who is part of the team responsible for building internal streaming ETL pipelines at Databricks, will discuss how Structured Streaming simplifies the processing, cleaning, enrichment, and reporting of data by allowing the same business logic to be applied in both batch and streaming contexts, thus enabling lower latency data processing. His talk will focus on stateful operations and demonstrate how NoSQL stores can be integrated as fault-tolerant state stores and streaming sinks, which are beneficial for applications requiring real-time updates and low-latency data delivery, such as updating user recommendations based on recent purchases. The summit will also feature other technical discussions and workshops on big data and NoSQL technologies, scheduled to take place in San Francisco on October 24-25, 2017.