Company
Date Published
Author
Nikita Shamgunov
Word count
1451
Language
English
Hacker News points
None

Summary

In May and June 2025, the Neon platform faced stability challenges due to an unexpected surge in database and branch creation, primarily driven by agentic AI partners, which far exceeded initial projections. This resulted in a series of incidents that affected database operations, with a significant increase in operational load leading to service degradation. The company identified scalability limits with its Kubernetes-based infrastructure and a control plane database under strain from increased metadata handling. To address these issues, Neon implemented a horizontally-scalable architecture known as "Cells" and enforced stricter workload limits. They also plan to isolate critical components to improve system resilience and predictability. Despite the incidents, Neon has taken steps to enhance its infrastructure, aiming to achieve better reliability and customer satisfaction in the future.