Symbol Description Reading


  • Karol Lynch IBM Research Europe, Dublin, Ireland
  • Bradley Eck IBM Research Europe, Dublin, Ireland
  • Joern Ploennigs IBM Research Europe, Dublin, Ireland University of Rostock, Rostock, Germany



Information Extraction , Generative AI, Information and Knowledge Access , Track: Emerging Applications


Mathematical formulas give concise representations of a document's key ideas in many natural sciences and engineering domains. The symbols that make up formulas carry semantic meaning that may differ by document or equation. What does ? mean in a given paper? Interpreting the symbols that comprise formulas requires identifying descriptions from the surrounding text. We approach this task of symbol description reading as an application of current AI technologies targeting the tuning of large language models for particular domains and automation of machine learning. Our pipeline integrates AI question answering and natural language processing to read symbol descriptions. We consider extractive and generative AI model variations and apply our pipeline on two example tasks of symbol description reading. Promising results provide motivation for wider deployment for which we describe a microservice architecture and related challenges.




How to Cite

Lynch, K., Eck, B., & Ploennigs, J. (2024). Symbol Description Reading. Proceedings of the AAAI Conference on Artificial Intelligence, 38(21), 22934-22940.