Home / Companies / Gretel.ai / Blog / Post Details
Content Deep Dive

Introducing world's largest synthetic open-source Text-to-SQL dataset

Blog post from Gretel.ai

Post Details
Company
Date Published
Author
Yev Meyer
Word Count
1,602
Company Posts That Month
2
Language
English
Hacker News Points
-
Summary

Gretel has introduced the world's largest synthetic open-source Text-to-SQL dataset, available on Hugging Face under Apache 2.0 license. The gretelai/synthetic_text_to_sql dataset is designed and generated using Gretel Navigator and includes over 105,851 records with diverse SQL tasks and complexity levels. This synthetic data accelerates the transition to data-centric AI by allowing teams to produce high-quality data while preserving privacy and security. The release of this dataset marks a significant milestone in the world of synthetic data and encourages developers, researchers, and data enthusiasts to leverage it for their projects.

Trends Found in this Post
Trend Post Mentions Total Month Mentions Posts Companies MoM
LLM 14 3,398 379 136 +44%
RAG 2 1,795 223 72 +55%
AI Model Fine-tuning 1 742 135 73 +71%