AI Brittleness vs. Non-Determinism: The Real Reliability Problem

Post Details

Company

Galileo

Date Published

May 15, 2026

Author

Pratik Bhavsar

Word Count

2,757

Company Posts That Month

16

Language

English

Hacker News Points

-

Source URL

galileo.ai/blog/ai-brittleness-vs-non-determinism-reliability

Summary

The text discusses the challenges of ensuring reliability in AI systems, particularly in distinguishing between non-determinism and brittleness. Non-determinism is when identical inputs produce different outputs due to factors like stochastic sampling, which can be controlled with engineering techniques such as temperature settings and fixed seeds. Brittleness, however, arises when semantically equivalent inputs yield different outputs due to slight variations in phrasing, which is not addressed by controlling temperature. The text highlights the importance of identifying brittleness through methods like paraphrase testing and adversarial input variation, as well as the cost implications of conflating these issues. It emphasizes that production-ready AI must demonstrate stable behavior across a range of real-world input variations, rather than just achieving high accuracy on clean test sets, to avoid costly errors and ensure reliability.

Trends Found in this Post

Trend	Post Mentions	Total Month Mentions	Posts	Companies	MoM
LLM	7	9,074	1,640	224	+53%
AI Agents	5	4,942	1,264	250	+12%
Observability	3	3,421	707	180	-24%
AI Guardrails	1	216	116	52	-40%