Contract Autopilot: From Zero to Full Coverage
Blog post from Soda
Soda Contract Autopilot is an AI-driven feature designed to automate the generation of data contracts by analyzing actual production data, thus addressing the scalability issue faced by data teams when writing individual contracts for numerous datasets. Unlike starting from a blank YAML file, Autopilot provides a pre-generated contract based on profiling metrics from approximately 10,000 data rows, offering schema definitions and recommended quality checks that reflect a dataset's characteristics. This approach allows teams to review, refine, and deploy contracts more efficiently, while still requiring human validation to ensure the accuracy and applicability of the generated contracts. Autopilot operates in batch mode across multiple datasets, offering a solution for achieving comprehensive contract coverage at scale, which is often unattainable through manual authoring. It complements Soda's Contract Copilot by providing a foundation that accelerates the contract creation process without replacing human judgment, aligning with Soda's larger vision of a self-driving data quality platform introduced in Soda 4.0.