Company
Date Published
Author
Leo Folsom
Word count
467
Language
English
Hacker News points
None

Summary

Datafold enables data developers to detect potential production data issues caused by SQL code changes through a process known as data diffing in continuous integration (CI). This approach allows analytics engineers to preview the impact of their code before it is merged and deployed, helping to catch data changes and errors that traditional assertion tests might miss. Datafold's CI integration provides future impact analysis directly in version control platforms like GitHub, GitLab, Bitbucket, and Azure DevOps. While originally focused on teams using dbt, Datafold has broadened its applicability to various data transformation and orchestration tools, including Airflow, Dagster, and Prefect, among others. This expansion aims to ensure that all data teams, regardless of their chosen technologies, can benefit from automated data testing, ultimately improving data quality governance and streamlining the pull request review process.