PharmaQA: Prompt-Based Molecular Representation Learning via Pharmacophore-Oriented Question Answering

Authors

  • Chengwei Ai School of Computer Science and Engineering, Central South University
  • Qiaozhen Meng School of Computer Science, Xiangtan University
  • Mengwei Sun School of Computer Science and Engineering, Central South University
  • Ruihan Dong Academy for Advanced Interdisciplinary Studies, Peking University
  • Hongpeng Yang Department of Computer Science and Engineering, University of South Carolina
  • Shiqiang Ma Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences
  • Xiaoyi Liu School of Chinese Materia Medica, Beijing University of Chinese Medicine
  • Cheng Liang School of Information Science and Engineering, Shandong Normal University
  • Fei Guo School of Computer Science and Engineering, Central South University

DOI:

https://doi.org/10.1609/aaai.v40i24.39037

Abstract

Molecular representation plays a central role in computational drug discovery. Pharmacophores, functional groups responsible for molecular bioactivity, have been widely studied in cheminformatics. However, their incorporation into molecular representation learning, particularly in a context reasoning or generalization, remains relatively limited. To address this gap, we propose PharmaQA, a pharmacophore oriented question answering framework that formulates tailored prompts to extract context-aware molecular semantics. Rather than encoding pharmacophore features, PharmaQA learns to answer pharmacophore related queries. This design enables flexible reasoning across diverse tasks, including molecular property prediction, compound-target interaction prediction, and binding affinity estimation. Experimental results on benchmark datasets demonstrate that PharmaQA achieves competitive performance. In a ligand discovery case study using FDA-approved compounds, the framework identified potential inhibitors for three therapeutic targets, with strong docking performance. As a generalizable and modular solution, PharmaQA incorporates pharmacophoric knowledge into molecular embeddings, enhancing both predictive accuracy and interpretability in drug discovery applications.

Published

2026-03-14

How to Cite

Ai, C., Meng, Q., Sun, M., Dong, R., Yang, H., Ma, S., Liu, X., Liang, C., & Guo, F. (2026). PharmaQA: Prompt-Based Molecular Representation Learning via Pharmacophore-Oriented Question Answering. Proceedings of the AAAI Conference on Artificial Intelligence, 40(24), 19580-19588. https://doi.org/10.1609/aaai.v40i24.39037

Issue

Section

AAAI Technical Track on Machine Learning I