Saltar al contenido principal

Escribe una PREreview

Context Curves Behavior: Measuring AI Relational Dynamics with ΔRCI

Publicada
Servidor
Preprints.org
DOI
10.20944/preprints202601.1881.v2

Current AI evaluation focuses on accuracy and safety benchmarks, neglecting relational dynamics—how models utilize conversational context. We introduce ΔRCI (Delta Relational Coherence Index), a novel metric measuring context sensitivity through a three-condition protocol (TRUE/COLD/SCRAMBLED). Across 1,000 trials (90,000 API calls) spanning 7 models and 2 epistemological domains (6 models in medical due to safety filtering), we find: (1) Instrument validation: TRUE (coherent history) > SCRAMBLED (randomized) > COLD (none) in 14/16 model-domain combinations, demonstrating that ΔRCI measures structured context utilization, not mere token presence; (2) Vendor-specific patterns in context utilization (F(2,697)=6.52, p=0.0015); (3) Protocol sensitivity: Cross-domain comparisons are affected by methodological differences between our philosophy and medical experiments, limiting domain-level conclusions in this paper; (4) Safety interference: Progressive content filtering by vendors affects research accessibility. To our knowledge, ΔRCI provides the first cosine-similarity-based instrument for measuring AI context sensitivity. A follow-up study with standardized protocols across 14 models is forthcoming.

Puedes escribir una PREreview de Context Curves Behavior: Measuring AI Relational Dynamics with ΔRCI. Una PREreview es una revisión de un preprint y puede variar desde unas pocas oraciones hasta un extenso informe, similar a un informe de revisión por pares organizado por una revista.

Antes de comenzar

Te pediremos que inicies sesión con tu ORCID iD. Si no tienes un iD, puedes crear uno.

¿Qué es un ORCID iD?

Un ORCID iD es un identificador único que te distingue de otros/as con tu mismo nombre o uno similar.

Comenzar ahora