Google Cloud Dataflow example project released
Blog post from Snowplow
The announcement introduces a new Google Cloud Dataflow Example Project by Snowplow, designed to facilitate real-time event processing on Google Cloud Platform. This project uses Scala to process JSON events from Google Cloud Pub/Sub and aggregates them into Google Cloud Bigtable, demonstrating an "analytics-on-write" job. The project aims to broaden Snowplow's adoption across various cloud platforms, ensuring pipeline portability and flexibility. It includes a detailed setup guide that covers building the project, setting up Google Cloud services, creating Pub/Sub topics, and storing aggregates in Bigtable, all while teaching users about key Google Cloud services like Dataflow, Pub/Sub, and Bigtable. Additionally, the project encourages engagement with Google Cloud's ecosystem and sets the stage for future Snowplow developments on the platform.