Home / Companies / Portkey / Blog / Post Details
Content Deep Dive

Unpacking Semantic Caching at Walmart

Blog post from Portkey

Post Details
Company
Date Published
Author
Vrushank Vyas
Word Count
576
Company Posts That Month
2
Language
English
Hacker News Points
-
Post removed?
No
Summary

Rohit Chatter, Chief Software Architect at Walmart Tech Global, participated in a fireside chat with the LLMs in the Prod community, discussing Walmart's transition to Generative AI and semantic caching in its retail operations. The conversation covered Walmart's shift from traditional NLP to Generative AI models, such as BERT-based Mini LM V6, to improve e-commerce search by handling complex and contextually relevant product groupings and enhancing product recommendations. The company fine-tunes models like MiniLMv2 and T0 with customer engagement data to enhance search relevance for ambiguous queries, using techniques like Approximate Nearest Neighbour search for relevance matching. Walmart also implements semantic caching to cluster queries based on conceptual similarity, achieving about a 50% cache hit rate. Challenges include reducing search latency and plans for future developments in personalization, voice, and visual search. The session highlighted Walmart's strategies for improving customer experience and long-term ROI from Generative AI implementations.

Trends Found in this Post
Trend Post Mentions Total Month Mentions Posts Companies MoM
LLM 4 2,401 292 122 -7%
Vector Search 2 2,087 216 81 +23%
AI Model Fine-tuning 1 474 91 59 +12%
Use This Data

Use this post, company, and trend context to find content marketing opportunities, perform competitive analysis, or address product feature gaps via the Plushcap MCP server or the Plushcap API.