Deven Mahesh Mistry, Anooshka Bajaj, Yash Aggarwal, Sahaj Singh Maini, Zoran Tiganj
Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the
Association for Computational Linguistics (NAACL 2025)
Pages: 8893–8911 | Albuquerque, New Mexico
Abstract:
We investigate in-context temporal biases in attention heads
and transformer outputs. Using cognitive science methodologies, we analyze attention scores
and outputs of the GPT-2 models of varying sizes. Across attention heads, we observe effects
characteristic of human episodic memory, including temporal contiguity, primacy and recency.
Transformer outputs demonstrate a tendency toward in-context serial recall. Importantly,
this effect is eliminated after the ablation of the induction heads, which are the driving
force behind the contiguity effect. Our findings offer insights into how transformers
organize information temporally during in-context learning, shedding light on their
similarities and differences with human memory and learning.
Citation (APA):
Mistry, D. M., Bajaj, A., Aggarwal, Y., Maini, S. S., & Tiganj, Z. (2025). Emergence of
Episodic Memory in Transformers: Characterizing Changes in Temporal Structure of Attention
Scores During Training. ACL Anthology, 8893–8911.
https://aclanthology.org/2025.naacl-long.448/
Anooshka Bajaj, Deven Mahesh Mistry, Sahaj Singh Maini, Yash Aggarwal, Zoran Tiganj
Proceedings of the 19th Conference of the European Chapter of the Association for
Computational Linguistics (EACL 2026)
Pages: 7562–7581 | Rabat, Morocco
Abstract:
In-context learning depends not only on what appears in the
prompt but also on when it appears. To isolate this temporal component from semantic
confounds, we construct prompts with repeated anchor tokens and average the model’s
predictions over hundreds of random permutations of the intervening context. This approach
ensures that any observed position-dependent effects are driven purely by temporal structure
rather than token identity or local semantics. Across four transformer LLMs and three
state-space/recurrent models, we observe a robust serial recall signature: models allocate
disproportionate probability mass to the tokens that previously followed the anchor, but the
strength of this signal is modulated by serial position, yielding model-specific
primacy/recency profiles. We then introduce an overlapping-episode probe in which only a
short cue from one episode is re-presented; retrieval is reliably weakest for episodes
embedded in the middle of the prompt, consistent with "lost-in-the-middle" behavior.
Mechanistically, ablating high-induction-score attention heads in transformers reduces
serial recall and episodic separation. For state-space models, ablating a small fraction of
high-attribution channels produces analogous degradations, suggesting a sparse subspace
supporting induction-style copying. Together, these results clarify how temporal biases
shape retrieval across architectures and provide controlled probes for studying long-context
behavior.
Citation (APA):
Bajaj, A., Mistry, D. M., Maini, S. S., Aggarwal, Y., & Tiganj, Z. (2026).
Beyond Semantics: How Temporal Biases Shape Retrieval in Transformer and State-Space Models.
arXiv preprint arXiv:2510.22752.
https://aclanthology.org/2026.eacl-long.355/
Anooshka Bajaj, Deven Mahesh Mistry, Sahaj Singh Maini, Yash Aggarwal, Billy Dickson, Zoran
Tiganj
arXiv preprint
Abstract:
Large language models (LLMs) exhibit strong in-context
learning capabilities, but how they track and retrieve information from context remains
underexplored. Drawing on the free recall paradigm in cognitive science (where participants
recall list items in any order), we show that several open-source LLMs consistently display
a serial-recall-like pattern, assigning peak probability to tokens that immediately follow a
repeated token in the input sequence. Through systematic ablation experiments, we show that
induction heads, specialized attention heads that attend to the token following a previous
occurrence of the current token, play an important role in this phenomenon. Removing heads
with a high induction score substantially reduces the +1 lag bias, whereas ablating random
heads does not reproduce the same reduction. We also show that removing heads with high
induction scores impairs the performance of models prompted to do serial recall using
few-shot learning to a larger extent than removing random heads. Our findings highlight a
mechanistically specific connection between induction heads and temporal context processing
in transformers, suggesting that these heads are especially important for ordered retrieval
and serial-recall-like behavior during in-context learning.
Citation (APA):
Bajaj, A., Mistry, D. M., Maini, S. S., Aggarwal, Y., Dickson B., & Tiganj, Z. (2026).
Temporal Dependencies in In-Context Learning: The Role of Induction Heads.
arXiv preprint arXiv:2604.01094.
https://arxiv.org/abs/2604.01094
Anooshka Bajaj, Zoran Tiganj
arXiv preprint
Abstract:
Large language models (LLMs) increasingly operate in
environments where they encounter social information such as other agents' answers, tool
outputs, or human recommendations. In humans, such inputs influence judgments in ways that
depend on the source's credibility and the strength of consensus. This paper investigates
whether LLMs exhibit analogous patterns of influence and whether they privilege feedback
from humans over feedback from other LLMs. Across three binary decision-making tasks,
reading comprehension, multi-step reasoning, and moral judgment, we present four
instruction-tuned LLMs with prior responses attributed either to friends, to human experts,
or to other LLMs. We manipulate whether the group is correct and vary the group size. In a
second experiment, we introduce direct disagreement between a single human and a single LLM.
Across tasks, models conform significantly more to responses labeled as coming from human
experts, including when that signal is incorrect, and revise their answers toward experts
more readily than toward other LLMs. These results reveal that expert framing acts as a
strong prior for contemporary LLMs, suggesting a form of credibility-sensitive social
influence that generalizes across decision domains.
Citation (APA):
Bajaj, A., & Tiganj, Z. (2026).
Who Do LLMs Trust? Human Experts Matter More Than Other LLMs.
arXiv preprint arXiv:2602.13568.
https://arxiv.org/abs/2602.13568