Smarter AI starts with short-term memory

Imagine trying to hold a conversation with someone who forgets everything you said the day before.

It would quickly become frustrating. This is exactly the kind of limitation many generative AI systems face today. Large Language Models (LLMs) may be rich in information, but they often fall short when it comes to memory, which is key to making interactions smarter and more personal.

For AI to reach its full potential, it needs more than just knowledge. Like the human brain, AI must be able to draw on past experiences to inform present interactions. While we do this naturally, AI depends on purpose-built memory systems to achieve the same effect and deliver conversations that feel more connected and meaningful.

Memory as the foundation for AI agents

Large Language Models (LLMs) come with extensive built-in knowledge about the world, which makes them highly capable when it comes to answering questions and generating content. Yet this knowledge is static and generalised as it represents a broad, averaged understanding rather than something tailored to individual users. On their own LLMs lack memory. They don’t retain information from one interaction to the next, and their context window is limited, meaning they can quickly lose track of longer conversations or personal details. This results in experiences that feel repetitive, impersonal and unable to evolve over time.

Human interactions work differently. When you speak to a customer service agent, what makes the conversation feel productive and relevant is their ability to remember previous touchpoints, keep track of an ongoing issue and adapt to your specific circumstances. It’s this memory that enables a more human, response exchange – something LLMs, without memory systems, find difficult to mirror.

Research from Deloitte shows just how important this is: 80% of customers want brands to recognise and respond to their individual needs and two-thirds expect companies to anticipate them. Without memory, AI agents fall short. They can’t build context, remember preferences, or respond proactively, which means users are often left repeating themselves and setting for disjointed one-size-fits-all experiences.

Reimagining AI with memory at its core

Modern AI systems are becoming more sophisticated by integrating various memory mechanisms that enhance user interactions and functionality. Context retrieval (RAG) acts like a research assistant, fetching real-time information from external sources to provide more relevant and up-to-date responses. This is especially valuable in industries like healthcare, where AI can access the latest treatment guidelines, even after the system’s initial training. Meanwhile, semantic caching helps reduce processing loads by storing frequently used responses, making high-traffic applications more efficient. This is particularly useful for customer service platforms, which can offer quick, consistent answers during peak times like holiday shopping seasons.

Other memory systems, such as agentic memory, take personalisation to the next level. This mechanism retains long-term user preferences across multiple interactions, enabling AI to remember details like seating preferences or hotel choices. Agent state functions as a short-term, real-time workspace that helps manage complex tasks or ongoing conversations. In e-commerce, this allows AI assistants to keep track of things like product comparisons or customer preferences, making the shopping experience more seamless and personalised. Together, these memory systems make AI interactions more intelligent and seamless, ensuring they don’t feel disconnected or forgetful.

Building trust and connection through intelligent design

When AI can dynamically recall information and tailor responses in real-time, customer interactions transform from transactional to truly personal. Instead of relying on static profiles, AI can adapt to customer behaviours, moods, and histories – creating experiences that feel remarkably intuitive.

These capabilities are already powering real-world AI agents today, quietly transforming the way we interact with technology. In healthcare, memory systems are helping platforms track medication allergies and past symptoms, making it possible to connect the dots between seemingly unrelated complaints and improve diagnostic accuracy. In finance, AI advisors are learning from past client behaviours – remembering investment preferences and spotting shifts in risk tolerance – to offer more personalised guidance. Even in our homes, smart systems are learning our habits, recognising simple cues like ‘movie night’ to automatically create the perfect setting, from lighting and temperature to opening your go-to streaming service.

Imagine an e-commerce assistant that not only remembers your previous purchases but understands your style preferences, anticipates seasonal needs, and adjusts tone based on your interaction patterns. This goes beyond convenience to create genuine connection and brand loyalty.

At scale, these memory capabilities enable businesses to identify patterns, predict trends, and continuously refine customer experiences, turning individual interactions into collective intelligence.

Performance built into every layer

Even the most advanced memory systems become useless if they’re too slow. Users abandon conversations when AI responses lag, making response time a critical success factor for AI applications. For example, studies show that 40% of website visitors abandon sites that take more than 3 seconds to load. AI interactions face even stricter standards users expect near-human response times of less than a second.

This is where infrastructure becomes decisive. High-performance data storage systems, with sub-millisecond latency and massive scalability, provide the foundation needed for AI systems to retrieve and process memory at human-conversation speeds. By enabling seamless context switching and real-time adaptation, these tools help AI systems operate with human-like recall efficiency.

Say goodbye to forgetful AI

Leading AI platforms are now equipped with advanced memory systems that retain user preferences and past interactions across sessions. This allows them to deliver more personalised, seamless responses without repeating the same questions or missing key context. As AI becomes a core part of customer experience, it’s memory not just intelligence, that will set competitors apart. Businesses that invest in strong memory infrastructure will be able to offer interactions that feel truly personal. In the future, AI success won’t just hinge on how smart systems are, but on how well they remember and use that memory to build genuine human connections.

About the Author

Manvinder Singh is VP of Product Management for AI at Redis. Redis is the world’s fastest data platform. We provide cloud and on-prem solutions for caching, vector search, and more that seamlessly fit into any tech stack. With fast setup and fast support, we make it simple for digital customers to build, scale, and deploy the fast apps our world runs on.

Featured image: Rawpixel

Smarter AI starts with short-term memory

Imagine trying to hold a conversation with someone who forgets everything you said the day before.

more insights

Employee Experience 2.0: AI as the Performance Engine of the Work Operating System

AI, energy and infrastructure: The future of data centre infrastructure in a high-demand world

The growing cyber exposure risk you can’t afford to ignore

Contact Us

Join The list