Skip to main content

Write a PREreview

State Drift in Language-Conditioned Autonomous Agents: A Failure Mode of Long-Horizon Reasoning

Posted
Server
Preprints.org
DOI
10.20944/preprints202601.0910.v1

Language-conditioned autonomous agents rely on natural language to represent internal state, reason about goals, and select actions. Despite recent advances in reasoning and planning, such agents remain unreliable in long-horizon tasks. In this work, we identify state drift as a fundamental and underexplored failure mode, characterized by persistent divergence between an agent’s internal textual state and the true environment state over time. We study state drift through controlled experiments with language-driven agents operating in long-horizon settings. By comparing fact-level internal belief representations against ground-truth environment states across sequential interactions, we show that state drift can arise and persist even when individual reasoning steps are locally coherent and logically valid. This indicates that long-horizon failures cannot be explained solely by step-wise reasoning errors. Moreover, we find that increasing context capacity does not mitigate state drift in deterministic environments, suggesting that the phenomenon is not simply a consequence of limited memory or forgetting. Instead, our results point to a structural limitation of using natural language as an internal state representation. Ensuring semantic state consistency over extended horizons thus emerges as a distinct and unresolved challenge for language-conditioned autonomy, with important implications for the design and evaluation of reliable autonomous agents.

You can write a PREreview of State Drift in Language-Conditioned Autonomous Agents: A Failure Mode of Long-Horizon Reasoning. A PREreview is a review of a preprint and can vary from a few sentences to a lengthy report, similar to a journal-organized peer-review report.

Before you start

We will ask you to log in with your ORCID iD. If you don’t have an iD, you can create one.

What is an ORCID iD?

An ORCID iD is a unique identifier that distinguishes you from everyone with the same or similar name.

Start now