Introducing a better way to measure the latency of your data in BigQuery
Blog post from Snowplow
Version 0.5.1 of BigQuery Loader introduces a refined latency metric that measures the time taken for data to travel from the collector to its ingestion in BigQuery, utilizing Google Cloud Platform's Logging service for easy access and integration with monitoring tools like Prometheus and Grafana. The metric is calculated by sampling data every second during loading and determining latency based on the timestamps at the collector and just before loading to BigQuery. Insights Customers can access this metric directly in the Google Cloud Console's Monitoring UI, while open source users must create a custom logs-based metric using the Logs Viewer. The aggregation settings in Google Cloud Console or Grafana can be adjusted to enhance data granularity, with recommendations to align data at one-minute intervals and use the mean as the aggregator. The release notes for versions 0.5.0 and 0.5.1 on GitHub detail the changes, with automatic upgrades for Insights Customers and specific guidance available for open source users upgrading from older versions.