Designing Memory for AI Agents: Inside Linkedin’s Cognitive Memory Agent

- Cognitive Memory Agent 提供高效上下文管理能力。
- 新架构优化了 AI 模型的实时决策性能。
- 内存设计结合了短期和长期记忆机制。
Designing Memory for AI Agents: Inside Linkedin’s Cognitive Memory Agent - InfoQ
Your choice regarding cookies on this site
We use cookies to optimise site functionality and give you the best possible experience.
I Accept I Do Not Accept Settings
[BT](http://www.infoq.com/int/bt/ "bt")
InfoQ Software Architects' Newsletter
A monthly overview of things you need to know as an architect or aspiring architect.
Enter your e-mail address
Select your country - [x] I consent to InfoQ.com handling my data as explained in this Privacy Notice.
Close
QCon San Francisco (Nov 16-20): What's next in AI? What's next in software? Learn from the teams already doing it.Register Now
Close
Toggle Navigation
Facilitating the Spread of Knowledge and Innovation in Professional Software Development
English edition
[Write for InfoQ](http://www.infoq.com/write-for-infoq/ "Write for InfoQ")
Search
Unlock the full InfoQ experience
Unlock the full InfoQ experience by logging in! Stay updated with your favorite authors and topics, engage with content, and download exclusive resources.
or
Don't have an InfoQ account?
- **Stay updated on topics and peers that matter to you**Receive instant alerts on the latest insights and trends.
- **Quickly access free resources for continuous learning**Minibooks, videos with transcripts, and training materials.
- **Save articles and read at anytime**Bookmark articles to read whenever youre ready.
NewsArticlesPresentationsPodcastsGuides
Topics
[Development](http://www.infoq.com/development/ "Development")
- [Java](http://www.infoq.com/java/ "Java")
- [Kotlin](http://www.infoq.com/kotlin/ "Kotlin")
- [.Net](http://www.infoq.com/dotnet/ ".Net")
- [C#](http://www.infoq.com/c_sharp/ "C#")
- [Swift](http://www.infoq.com/swift/ "Swift")
- [Go](http://www.infoq.com/golang/ "Go")
- [Rust](http://www.infoq.com/rust/ "Rust")
- [JavaScript](http://www.infoq.com/javascript/ "JavaScript")
Featured in Development
Dany Lepage discusses the architectural journey of porting a hit VR title to seven non-VR platforms. He explains how his team solved the challenges of cross-progression, diverse input paradigms, and maintaining release velocity across Steam, iOS, and PlayStation. Beyond the tech, he shares candid lessons on the "product fit" gap when translating immersive social presence to 2D screens.

All in developmentFollow Topic
[Architecture & Design](http://www.infoq.com/architecture-design/ "Architecture & Design")
- [Architecture](http://www.infoq.com/architecture/ "Architecture")
- [Enterprise Architecture](http://www.infoq.com/enterprise-architecture/ "Enterprise Architecture")
- [Scalability/Performance](http://www.infoq.com/performance-scalability/ "Scalability/Performance")
- [Design](http://www.infoq.com/design/ "Design")
- [Case Studies](http://www.infoq.com/Case_Study/ "Case Studies")
- [Microservices](http://www.infoq.com/microservices/ "Microservices")
- [Service Mesh](http://www.infoq.com/servicemesh/ "Service Mesh")
- [Patterns](http://www.infoq.com/DesignPattern/ "Patterns")
- [Security](http://www.infoq.com/Security/ "Security")
Featured in Architecture & Design
Chris Tacey-Green discusses the shift from synchronous commands to asynchronous events within highly regulated environments. He explains the critical role of Inbox and Outbox patterns in preventing data loss, the nuances of event versioning, and how to maintain decoupling between domains. He shares "battle-tested" principles for implementing fault tolerance and managing eventual consistency.

All in architecture-designFollow Topic
[AI Infrastructure](http://www.infoq.com/ai-ml-data-eng/ "AI Infrastructure")
- [Big Data](http://www.infoq.com/bigdata/ "Big Data")
- [Machine Learning](http://www.infoq.com/machinelearning/ "Machine Learning")
- [NoSQL](http://www.infoq.com/nosql/ "NoSQL")
- [Database](http://www.infoq.com/database/ "Database")
- [Data Analytics](http://www.infoq.com/data-analytics/ "Data Analytics")
- [Streaming](http://www.infoq.com/streaming/ "Streaming")
Featured in AI, ML & Data Engineering
Lakehouse architectures enable multiple engines to operate on shared data using open table formats such as Apache Iceberg. However, differences in SQL identifier resolution and catalog naming rules create interoperability failures. This article examines these behaviors and explains why enforcing consistent naming conventions and cross-engine validation is critical.

All in ai-ml-data-engFollow Topic
[Culture & Methods](http://www.infoq.com/culture-methods/ "Culture & Methods")
- [Agile](http://www.infoq.com/agile/ "Agile")
- [Diversity](http://www.infoq.com/diversity/ "Diversity")
- [Leadership](http://www.infoq.com/leadership/ "Leadership")
- [Lean/Kanban](http://www.infoq.com/lean/ "Lean/Kanban")
- [Personal Growth](http://www.infoq.com/personal-growth/ "Personal Growth")
- [Scrum](http://www.infoq.com/scrum/ "Scrum")
- [Sociocracy](http://www.infoq.com/sociocracy/ "Sociocracy")
- [Software Craftmanship](http://www.infoq.com/software_craftsmanship/ "Software Craftmanship")
- [Team Collaboration](http://www.infoq.com/team-collaboration/ "Team Collaboration")
- [Testing](http://www.infoq.com/testing/ "Testing")
- [UX](http://www.infoq.com/ux/ "UX")
Featured in Culture & Methods
Celine Pypaert discusses the ubiquitous nature of open-source software and shares a blueprint for securing modern applications. She explains how to prioritize high-risk vulnerabilities using exploitability data, the role of Software Bill of Materials (SBOM), and the importance of bridging the gap between DevOps and Security through clear accountability and automated governance.

All in culture-methodsFollow Topic
- [Infrastructure](http://www.infoq.com/infrastructure/ "Infrastructure")
- [Continuous Delivery](http://www.infoq.com/continuous_delivery/ "Continuous Delivery")
- [Automation](http://www.infoq.com/automation/ "Automation")
- [Containers](http://www.infoq.com/containers/ "Containers")
- [Cloud](http://www.infoq.com/cloud-computing/ "Cloud")
- [Observability](http://www.infoq.com/observability/ "Observability")
Featured in DevOps
Docker Extensions boost developer speed but create a "visibility gap" by isolating telemetry. To meet enterprise needs, extensions must act as bridges to centralized platforms. This article details how to use OpenTelemetry, policy-as-code, and encryption to build secure pipelines. Learn to balance developer productivity with the governance required for scalable, compliant observability.

All in devopsFollow Topic
[Events](https://events.infoq.com/ "Events")
Helpful links
- [About InfoQ](http://www.infoq.com/about-infoq "About InfoQ")
- [InfoQ Editors](http://www.infoq.com/infoq-editors "InfoQ Editors")
- [Write for InfoQ](http://www.infoq.com/write-for-infoq "Write for InfoQ")
- [About C4Media](https://c4media.com/ "About C4Media")
- [Diversity](https://c4media.com/diversity "Diversity")
Choose your language

[InfoQ Homepage](http://www.infoq.com/ "InfoQ Homepage")[News](http://www.infoq.com/news "News")Designing Memory for AI Agents: Inside Linkedin’s Cognitive Memory Agent
[Architecture & Design](http://www.infoq.com/architecture-design/ "Architecture & Design")
Designing Memory for AI Agents: Inside Linkedin’s Cognitive Memory Agent
Apr 20, 2026 2 min read
by
- Leela Kumili
Follow Lead Engineer
#### Write for InfoQ
**Feed your curiosity.**Help 550k+ global
senior developers
each month stay ahead.Get in touch
Log in to listen to this article
Loading audio
Your browser does not support the audio element.
0:00 0:00
Normal 1.25x 1.5x
Like
LinkedIn has introduced a Cognitive Memory Agent (CMA) as part of its generative AI application stack to enable stateful, context-aware AI systems that retain and reuse knowledge across interactions. The system is designed to power applications such as its Hiring Assistant, addressing a fundamental limitation of large language model-based workflows: statelessness and the resulting loss of continuity across sessions.
CMA functions as a shared memory infrastructure layer between application agents and underlying language models. Instead of reconstructing context through repeated prompting, agents can persist, retrieve, and update memory through a dedicated system. This enables continuity across sessions, reduces redundant reasoning, and improves personalization in production environments where user context evolves.
!Image 12/filters:no_upscale()/news/2026/04/linkedin-cognitive-memory-agent/en/resources/1memorylayer-1776257738689.jpeg)
_Conversational memory layer illustration (Source: LinkedIn Blog Post)_
The architecture organizes memory into three distinct layers. Episodic memory captures interaction history and conversational events, allowing agents to recall past exchanges. Semantic memory stores structured knowledge derived from interactions, enabling reasoning over persistent facts about users, entities, or preferences. Procedural memory encodes learned workflows and behavioral patterns, helping agents improve task execution strategies over time. Together, these layers shift agent behavior from single-turn responses to longitudinal adaptation.
Xiaofeng Wang, an engineer at LinkedIn, noted in a post,
Memory is one of the most challenging and impactful pieces of building production agents, adding that it enables real personalization, continuity, and adaptation at scale.
CMA also plays a critical role in multi-agent systems. Rather than each agent maintaining an isolated context, CMA provides a shared memory substrate accessible across specialized agents responsible for planning, reasoning, and execution. This shared layer reduces state duplication, improves coordination, and ensures consistency in outputs across distributed workflows.
From a systems perspective, CMA integrates multiple retrieval and lifecycle management mechanisms. Recent context retrieval supports short-term relevance, while semantic search enables access to long-term historical interactions. Memory compaction through summarization helps control storage growth and maintain performance at scale. These mechanisms introduce core engineering challenges around relevance ranking, staleness management, and consistency of evolving user context.
Karthik Ramgopal, Distinguished Engineer at LinkedIn, emphasized the shift toward persistent context in agentic systems, stating
Good agentic AI isn't stateless: It remembers, adapts, and compounds. One of the key capabilities enabling this is memory that lives beyond context windows
Operationally, persistent memory systems introduce classic trade-offs in distributed systems. Determining what to store, when to retrieve it, and how to handle staleness becomes central to system correctness.
Subhojit Banerjee,a MLOPS Data Engineer, highlights,
Cache invalidation is one of the hardest problems in computer science, and glad you made the caveat clear. The obvious challenge in extracting this memory is correctly identifying episode boundaries, staleness, and conflict resolution.
In user-facing applications such as recruiting, LinkedIn also incorporates human validation into the workflow. This hybrid approach helps ensure that AI-generated outputs, augmented by persistent memory, remain aligned with user intent and business requirements, particularly in high-stakes decision environments.
CMA reflects a broader architectural shift in AI systems from stateless generation to stateful, memory-driven agent design. By externalizing memory into a dedicated infrastructure layer, LinkedIn positions CMA as a horizontal platform for building adaptive, personalized, and collaborative agentic systems at scale. The direction highlights a growing industry consensus: production-grade AI systems are not defined by models alone, but by the memory, context management, and infrastructure layers that surround them.
About the Author

#### **Leela Kumili**
Leela is a Lead Software Engineer at Starbucks with deep expertise in building scalable, cloud-native systems and distributed platforms. She drives architecture, delivery, and operational excellence across the Rewards Platform, leading efforts to modernize systems, improve scalability, and enhance reliability. In addition to her technical leadership, Leela serves as an AI Champion for the organization, identifying opportunities to improve developer productivity and workflows using LLM-based tools and establishing best practices for AI adoption. She is passionate about building production-ready systems, enhancing developer experience, and mentoring engineers to grow in both technical and strategic impact. Her interests include platform engineering, distributed systems, developer productivity, and bridging technical solutions with business and product goals.
Show more Show less
#### This content is in the Platform Engineering topic
Follow Topic
##### Related Topics:
Followers: 4088
Follow Topic
Followers: 10203
Follow Topic
Followers: 5867
Follow Topic
Followers: 0
Follow Topic
Followers: 39
Follow Topic
Followers: 7
Follow Topic
Followers: 1
Follow Topic
Followers: 30
Follow Topic
Followers: 125
Follow Topic
Followers: 332
Follow Topic
Followers: 137
Follow Topic
Followers: 8
Follow Topic
Followers: 30
Follow Topic
* #### Related Editorial
- ##### Stripe Engineers Deploy Minions, Autonomous Agents Producing Thousands of Pull Requests Weekly
* #### Related Sponsors
* #### Related Sponsor

- May 12, 2026, 1:30 PM EDT
##### Designing Data Layers for Agentic AI: Patterns for State, Memory, and Coordination at Scale
SPONSORED BY YUGABYTEDB Save your seat
Related Content
Apr 20, 2026
Apr 16, 2026
Apr 09, 2026
Apr 06, 2026
Apr 04, 2026
Mar 29, 2026
Mar 27, 2026
Mar 27, 2026
Apr 02, 2026 
Related Sponsors
- #### Designing Data Layers for Agentic AI: Patterns for State, Memory, and Coordination at Scale (Live Webinar May 12, 2026) - Save Your Seat
AI agents create new architectural challenges: shared memory, cross-agent state, and auditability. This session explores data layer patterns—conversation state, knowledge persistence, coordination—and tradeoffs in consistency, latency, and cost at scale using AWS and YugabyteDB.
- Sponsored by

Related Content
Mar 18, 2026 
Mar 16, 2026 
Feb 19, 2026 
Apr 02, 2026 
Apr 16, 2026
Apr 12, 2026
**The InfoQ** Newsletter
A round-up of last week’s content on InfoQ sent out every Tuesday. Join a community of over 250,000 senior developers. View an example
Enter your e-mail address
Select your country - [x] I consent to InfoQ.com handling my data as explained in this Privacy Notice.
- ##### [C++26: Reflection, Memory Safety, Contracts, and a New Async Model](http://www.infoq.com/news/2026/04/cpp-26-reflection-safety-async/ "C++26: Reflection, Memory Safety, Contracts, and a New Async Model")
- ##### [From VR to Flat Screens: Bridging the Input and Immersion Gap](http://www.infoq.com/presentations/game-vr-flat-screens/ "From VR to Flat Screens: Bridging the Input and Immersion Gap")
- ##### [Cursor 3 Introduces Agent-First Interface, Moving beyond the IDE Model](http://www.infoq.com/news/2026/04/cursor-3-agent-first-interface/ "Cursor 3 Introduces Agent-First Interface, Moving beyond the IDE Model")
- ##### [Designing Memory for AI Agents: Inside Linkedin’s Cognitive Memory Agent](http://www.infoq.com/news/2026/04/linkedin-cognitive-memory-agent/ "Designing Memory for AI Agents: Inside Linkedin’s Cognitive Memory Agent")
- ##### [Event-Driven Patterns for Cloud-Native Banking - What Works, What Hurts?](http://www.infoq.com/presentations/patterns-payment-system/ "Event-Driven Patterns for Cloud-Native Banking - What Works, What Hurts?")
- ##### [Engineering Stable, Secure and Scalable Platforms: A Conversation with Matthew Liste](http://www.infoq.com/podcasts/engineering-stable-secure-scalable-platforms/ "Engineering Stable, Secure and Scalable Platforms: A Conversation with Matthew Liste")
- ##### [Platform as a Product: Delivering Value While Balancing Competing Priorities](http://www.infoq.com/news/2026/04/platform-product-deliver-value/ "Platform as a Product: Delivering Value While Balancing Competing Priorities")
- ##### [Empower Your Developers: How Open Source Dependencies Risk Management Can Unlock Innovation](http://www.infoq.com/presentations/open-source-dependencies/ "Empower Your Developers: How Open Source Dependencies Risk Management Can Unlock Innovation")
- ##### [Tiger Teams, Evals and Agents: The New AI Engineering Playbook](http://www.infoq.com/podcasts/tiger-teams-evals-agents/ "Tiger Teams, Evals and Agents: The New AI Engineering Playbook")
- ##### [Subagents in Gemini CLI Enable Task Delegation and Parallel Agent Workflows](http://www.infoq.com/news/2026/04/subagents-gemini-cli/ "Subagents in Gemini CLI Enable Task Delegation and Parallel Agent Workflows")
- ##### [Google’s Aletheia Advances the State of the Art of Fully Autonomous Agentic Math Research](http://www.infoq.com/news/2026/04/deepmind-aletheia-agentic-math/ "Google’s Aletheia Advances the State of the Art of Fully Autonomous Agentic Math Research")
- ##### [Anthropic Introduces Agent-Based Code Review for Claude Code](http://www.infoq.com/news/2026/04/claude-code-review/ "Anthropic Introduces Agent-Based Code Review for Claude Code")
- ##### [AWS Announces General Availability of DevOps Agent for Automated Incident Investigation](http://www.infoq.com/news/2026/04/aws-devops-agent-ga/ "AWS Announces General Availability of DevOps Agent for Automated Incident Investigation")
- ##### [Pulumi Adds Full Bun Runtime Support](http://www.infoq.com/news/2026/04/pulumi-bun-support/ "Pulumi Adds Full Bun Runtime Support")
- ##### [CNCF Warns Kubernetes Alone Is Not Enough to Secure LLM Workloads](http://www.infoq.com/news/2026/04/kubernetes-secure-workloads/ "CNCF Warns Kubernetes Alone Is Not Enough to Secure LLM Workloads")
**The InfoQ** Newsletter
A round-up of last week’s content on InfoQ sent out every Tuesday. Join a community of over 250,000 senior developers. View an example
- Get a quick overview of content published on a variety of innovator and early adopter technologies
- Learn what you don’t know that you don’t know
- Stay up to date with the latest information from the topics you are interested in
Enter your e-mail address
Select your country - [x] I consent to InfoQ.com handling my data as explained in this Privacy Notice.
[Home](http://www.infoq.com/ "Home")[Create account](http://www.infoq.com/reginit.action "Create account")Log In[QCon Conferences](http://qconferences.com/ "QCon Conferences")Events[Write for InfoQ](http://www.infoq.com/write-for-infoq/ "Write for InfoQ")[InfoQ Editors](http://www.infoq.com/infoq-editors/ "InfoQ Editors")[About InfoQ](http://www.infoq.com/about-infoq/ "About InfoQ")[About C4Media](https://c4media.com/ "About C4Media")[Media Kit](https://get.infoq.com/infoq-mediakit/ "Media Kit")[InfoQ Developer Marketing Blog](https://devmarketing.c4media.com/?utm_source=infoq "InfoQ Developer Marketing Blog")[Diversity](https://c4media.com/diversity "Diversity")
#### Events
May 7, 2026
- ##### QCon AI Boston
June 1-2, 2026
June 10, 2026
- ##### QCon San Francisco
November 16-20, 2026
#### Follow us on
Youtube 232K FollowersLinkedin 26K FollowersRSS 19K ReadersX 57.1k FollowersFacebook 21K LikesBluesky New
#### Stay in the know
The InfoQ PodcastEngineering Culture PodcastThe Software Architects' Newsletter
General Feedback [feedback@infoq.com](mailto:feedback@infoq.com) Advertising [sales@infoq.com](mailto:sales@infoq.com) Editorial [editors@infoq.com](mailto:editors@infoq.com) Marketing [marketing@infoq.com](mailto:marketing@infoq.com)
InfoQ.com and all content copyright © 2006-2026 C4Media Inc.
Privacy Notice, Terms And Conditions, Cookie Policy
Close
[BT](http://www.infoq.com/int/bt/ "bt")