The Memory Curse: Longer Context Windows Degrade AI Cooperation in Multi-Agent Games

Expanding context windows degrades cooperation between AI agents in social dilemmas, according to new research analyzing 7 LLMs across 4 games over 500 rounds. The phenomenon, termed the "memory curse," occurred in 18 of 28 model-game settings, challenging assumptions that expanded capabilities represent straightforward improvements.

Expanded Memory Erodes Forward-Looking Intent

Rather than agents becoming paranoid or distrustful, the underlying cause is more subtle: expanded memory erodes forward-looking intent. The researchers analyzed 378,000 reasoning traces and found that with longer accessible history, agents focus more on past events rather than future cooperative goals. Critically, deliberation amplifies this problem—explicit Chain-of-Thought reasoning paradoxically worsens the cooperative breakdown.

Memory Content, Not Length, Triggers the Effect

The researchers demonstrated that memory content rather than prompt length drives the memory curse. When they replaced actual history with synthetic cooperative records while holding prompt length constant, cooperation substantially restored. This memory sanitization approach proved the trigger is memory content itself, not the additional tokens or computational overhead.

Ablating explicit Chain-of-Thought reasoning often reduced the collapse, showing that more careful reasoning can actually worsen outcomes in multi-agent settings. This counterintuitive finding suggests that current reasoning approaches may not transfer effectively to social dilemmas.

Fine-Tuning on Forward-Looking Patterns Mitigates the Decay

The researchers found that fine-tuning models on forward-looking reasoning patterns mitigates the cooperative decay and transfers to new games. This intervention targets the root cause: the shift in temporal focus that longer context windows induce. The finding suggests that addressing the memory curse may require architectural or training changes rather than simply adjusting context window size.

Implications for Multi-Agent AI Systems

This research demonstrates that memory is an active determinant of multi-agent behavior that can destabilize cooperation based on reasoning patterns it elicits. The findings have significant implications for deploying AI agents in collaborative settings, suggesting that expanded capabilities require careful evaluation of their effects on emergent behavior rather than assuming monotonic improvements.

Key Takeaways

Expanding context windows degraded cooperation in 18 of 28 model-game settings across 7 LLMs and 4 games over 500 rounds
The memory curse stems from expanded memory eroding forward-looking intent, causing agents to focus on past events rather than future cooperation
Memory content, not length, triggers the effect—replacing history with synthetic cooperative records restores cooperation while holding prompt length constant
Chain-of-Thought reasoning paradoxically amplifies the memory curse, with more deliberation worsening cooperative breakdown
Fine-tuning models on forward-looking reasoning patterns mitigates the decay and transfers to new games

Expanded Memory Erodes Forward-Looking Intent

Memory Content, Not Length, Triggers the Effect

Fine-Tuning on Forward-Looking Patterns Mitigates the Decay

Implications for Multi-Agent AI Systems

Key Takeaways

Expanding context windows degraded cooperation in 18 of 28 model-game settings across 7 LLMs and 4 games over 500 rounds

The memory curse stems from expanded memory eroding forward-looking intent, causing agents to focus on past events rather than future cooperation

Memory content, not length, triggers the effect—replacing history with synthetic cooperative records restores cooperation while holding prompt length constant

Chain-of-Thought reasoning paradoxically amplifies the memory curse, with more deliberation worsening cooperative breakdown

Fine-tuning models on forward-looking reasoning patterns mitigates the decay and transfers to new games