Can Large Language Models Grasp 3D Medical Anatomy Shapes? (Student Abstract)

Authors

  • Yao Gao KU Leuven, 3000 Leuven, Belgium University Hospitals Leuven (UZ Leuven), 3000 Leuven, Belgium
  • Feng Li KU Leuven, 3000 Leuven, Belgium University Hospitals Leuven (UZ Leuven), 3000 Leuven, Belgium
  • Jeroen Van Dessel KU Leuven, 3000 Leuven, Belgium University Hospitals Leuven (UZ Leuven), 3000 Leuven, Belgium
  • Yi Sun KU Leuven, 3000 Leuven, Belgium University Hospitals Leuven (UZ Leuven), 3000 Leuven, Belgium
  • Robin Willaert KU Leuven, 3000 Leuven, Belgium University Hospitals Leuven (UZ Leuven), 3000 Leuven, Belgium

DOI:

https://doi.org/10.1609/aaai.v40i48.42218

Abstract

What if the next generation of human-computer interaction is not a screen... but a conversation? Large Language Models (LLMs) offer a new paradigm for interacting with computers through text, but they lack shape reasoning capabilities. We introduce Textual Anatomy Encoding (TAE), a workflow that connects LLMs with 3D anatomies. TAE employs clinician-validated semantic annotations and rule-based prompts to achieve deterministic and interpretable landmark localization. The results indicate that TAE enables LLMs to move beyond textual knowledge, achieving an accurate understanding of anatomical localization. This framework opens opportunities for diagnosis, surgical planning, and scalable medical annotation, positioning LLMs as a foundation for next-generation human–computer interaction in healthcare.

Downloads

Published

2026-03-14

How to Cite

Gao, Y., Li, F., Van Dessel, J., Sun, Y., & Willaert, R. (2026). Can Large Language Models Grasp 3D Medical Anatomy Shapes? (Student Abstract). Proceedings of the AAAI Conference on Artificial Intelligence, 40(48), 41214–41216. https://doi.org/10.1609/aaai.v40i48.42218