Full metadata record

DC Field Value Language
dc.contributor.authorBhattacharjee, Satadeep-
dc.contributor.authorLee, Seung-Cheol-
dc.date.accessioned2026-03-27T05:30:06Z-
dc.date.available2026-03-27T05:30:06Z-
dc.date.created2026-03-24-
dc.date.issued2026-01-
dc.identifier.issn2470-0045-
dc.identifier.urihttps://pubs.kist.re.kr/handle/201004/154498-
dc.description.abstractThe recently proposed physics-based framework by Huo and Johnson [arXiv:2504.04600]. models the attention mechanism of large language models as an interacting two-body spin system, offering a first-principles explanation for phenomena like repetition and bias. Building on this hypothesis, we extract the complete query-key weight matrices from a production-grade GPT-2 model and derive the corresponding effective Hamiltonian for every attention head. From these Hamiltonians we obtain analytic phase boundary and logit gap criteria that predict which token should dominate the next-token distribution for a given context. A systematic evaluation on 144 heads across 20 factual-recall prompts reveals a strong negative correlation between the theoretical logit gaps and the model&apos;s empirical token rankings (π‘Ÿβ‰ˆβˆ’0.70, 𝑝<10βˆ’3). Targeted ablations further show that suppressing the heads most aligned with the spin-bath predictions induces the anticipated shifts in output probabilities, confirming a causal link rather than a coincidental association. Taken together, our findings provide the first strong empirical evidence for the spin-bath analogy in a production-grade model. In this work, we utilize the context-field lens, which provides physics-grounded interpretability and motivates the development of novel generative models bridging theoretical condensed-matter physics and artificial intelligence.-
dc.languageEnglish-
dc.publisherAMER PHYSICAL SOC-
dc.titleTesting the spin-bath view of self-attention: A Hamiltonian analysis of GPT-2 transformer-
dc.typeArticle-
dc.identifier.doi10.1103/rkyb-d7d2-
dc.description.journalClass1-
dc.identifier.bibliographicCitationPhysical Review E, v.113, no.1-
dc.citation.titlePhysical Review E-
dc.citation.volume113-
dc.citation.number1-
dc.description.isOpenAccessN-
dc.description.journalRegisteredClassscie-
dc.description.journalRegisteredClassscopus-
dc.identifier.wosid001696126800001-
dc.identifier.scopusid2-s2.0-105029375952-
dc.relation.journalWebOfScienceCategoryPhysics, Fluids & Plasmas-
dc.relation.journalWebOfScienceCategoryPhysics, Mathematical-
dc.relation.journalResearchAreaPhysics-
dc.type.docTypeArticle-
Appears in Collections:
KIST Article > 2026
Export
RIS (EndNote)
XLS (Excel)
XML

qrcode

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

BROWSE