Memory and context poisoning: Don't let attackers rewrite your AI agent's memory

Post Details

Company

WorkOS

Date Published

June 9, 2026

Author

Maria Paktiti

Word Count

2,135

Company Posts That Month

31

Language

English

Hacker News Points

-

Post removed?

No

Source URL

workos.com/blog/ai-agent-memory-poisoning

Summary

In December 2025, researchers introduced the concept of MemoryGraft, a method to compromise AI agents by embedding malicious entries in their long-term memory through seemingly harmless content. This attack could result in AI agents adopting harmful behaviors by retrieving and acting upon these poisoned memories, believing them to be part of their own successful experiences. The MINJA attack, revealed at NeurIPS 2025, demonstrated a more sophisticated version whereby an attacker could corrupt an agent's memory merely through regular interactions without direct memory access. This poses a significant security threat distinct from prompt injection attacks due to its temporal decoupling and implicit trust in memory. Memory poisoning affects future decisions and can spread across multi-agent systems, making detection and mitigation challenging. Various defense strategies are suggested, including validating content at ingestion, tracking memory provenance, isolating memory by trust scope, setting expiration policies, monitoring for behavioral drift, and implementing incident response processes to trace and quarantine poisoned memories. These strategies are crucial for maintaining the integrity of AI agents and preventing compromised decision-making processes.

Trends Found in this Post

Trend	Post Mentions	Total Month Mentions	Posts	Companies	MoM
RAG	5	1,000	260	106	-52%
AI Agents	3	6,005	1,359	264	+22%
Harness engineering	2	253	138	69	+37%
MCP	2	7,550	833	207	+6%
Multi-agent systems	2	532	166	79	-3%
Observability	1	4,166	768	194	+22%
Vector Search	1	1,895	382	133	-16%

Use This Data

Use this post, company, and trend context to find content marketing opportunities, perform competitive analysis, or address product feature gaps via the Plushcap MCP server or the Plushcap API.