We Spent a Decade Making AI Feel Instant. Here's What We Learned.

Post Details

Company

Moss

Date Published

March 10, 2026

Author

Sri Raghu Malireddi, Harsha Nalluru

Word Count

1,314

Company Posts That Month

3

Language

English

Hacker News Points

-

Post removed?

No

Source URL

www.moss.dev/blog/we-spent-a-decade-making-ai-feel-instant

Summary

Sri Raghu Malireddi and Harsha Nalluru, former leads at Grammarly and Microsoft respectively, developed Moss to address the latency issues faced by AI agents in delivering real-time interactions. Traditional reliance on network-based retrieval from vector databases, such as Pinecone and Weaviate, resulted in delays that disrupted user experiences in chatbots, voice agents, and copilots. By embedding the semantic search index within the same process as the AI agent, Moss eliminates the need for network hops, achieving sub-10ms retrieval times. Built with Rust and WebAssembly for performance and portability, Moss provides a compact and efficient solution for instant local lookups, enhancing the responsiveness of AI systems. Launched through Y Combinator, Moss is gaining traction with platforms where retrieval latency critically impacts user experience, promising further insights into AI architecture in their upcoming series.

Trends Found in this Post

Trend	Post Mentions	Total Month Mentions	Posts	Companies	MoM
Voice AI	6	2,447	202	43	+13%
Real-time	5	6,457	1,307	242	+28%
Vector Search	5	2,370	415	145	+7%
AI Agents	4	4,545	963	231	+27%
RAG	3	1,806	326	91	+5%
LLM	2	6,078	960	218	+18%
AI Coding Assistant	1	1,255	319	126	+24%
Developer Experience	1	482	254	106	+18%

Use This Data

Use this post, company, and trend context to find content marketing opportunities, perform competitive analysis, or address product feature gaps via the Plushcap MCP server or the Plushcap API.