Linear Decoding of Morphology Relations in Language Models (Student Abstract)

Authors

  • Eric Xia Brown University
  • Jugal Kalita University of Colorado Colorado Springs

DOI:

https://doi.org/10.1609/aaai.v39i28.35315

Abstract

The recent success of transformer language models owes much to their conversational fluency, which includes linguistic and morphological proficiency. An affine Taylor approximation has been found to be a good approximation for transformer computations over certain factual and encyclopedic relations. We show that the truly linear approximation W s, where s is a early layer representation of the base form and W is a local model derivative, is necessary and sufficient to approximate morphological derivation, achieving above 80% top-1 accuracy across most morphological tasks in the Bigger Analogy Test Set. We argue that many morphological forms in transformer models are likely linearly encoded.

Published

2025-04-11

How to Cite

Xia, E., & Kalita, J. (2025). Linear Decoding of Morphology Relations in Language Models (Student Abstract). Proceedings of the AAAI Conference on Artificial Intelligence, 39(28), 29532–29534. https://doi.org/10.1609/aaai.v39i28.35315