Language Models Do Not Embed Numbers Continuously (Student Abstract)

Alex Davies; Roussel Nzoyem Ngueguin; Nirav Ajmeri; Telmo de Menezes E Silva Filho

doi:10.1609/aaai.v40i48.42206

Language Models Do Not Embed Numbers Continuously (Student Abstract)

Authors

Alex Davies University of Bristol
Roussel Nzoyem Ngueguin University of Bristol
Nirav Ajmeri University of Bristol
Telmo de Menezes E Silva Filho University of Bristol

DOI:

https://doi.org/10.1609/aaai.v40i48.42206

Abstract

We evaluate how well large language model embeddings represent continuous numerical values across different precisions and ranges. Using linear models and principal component analysis on models from major providers, we show that while embeddings can reconstruct numbers with high fidelity (R2 ≥ 0.95), they introduce substantial noise, with principal components explaining less than 40% of embedding variance. Performance degrades with increasing decimal precision and mixed-sign values, revealing fundamental limitations in how these models encode numerical information.

AAAI-26 / IAAI-26 / EAAI-26 Proceedings Cover

Downloads

Published

2026-03-14

How to Cite

Davies, A., Nzoyem Ngueguin, R., Ajmeri, N., & de Menezes E Silva Filho, T. (2026). Language Models Do Not Embed Numbers Continuously (Student Abstract). Proceedings of the AAAI Conference on Artificial Intelligence, 40(48), 41180–41181. https://doi.org/10.1609/aaai.v40i48.42206

Download Citation

Issue

Vol. 40 No. 48: EAAI-26 AI for Education, Model AI Assignments, AAAI-26 Emerging Trends, Doctoral Consortium, Student Abstracts, Undergraduate Consortium and Demonstrations

Section

AAAI Student Abstract and Poster Program

Language Models Do Not Embed Numbers Continuously (Student Abstract)

Authors

DOI:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information