Dr. Tongue: Sign-Oriented Multi-label Detection for Remote Tongue Diagnosis

Authors

  • Yiliang Chen Hong Kong Polytechnic University
  • Steven SC Ho Hong Kong Polytechnic University
  • Cheng Xu Hong Kong Polytechnic University
  • Yao Jie Xie Hong Kong Polytechnic University
  • Wing-Fai Yeung Hong Kong Polytechnic University
  • Shengfeng He Singapore Management University
  • Jing Qin Hong Kong Polytechnic University

DOI:

https://doi.org/10.1609/aaai.v39i2.32230

Abstract

Tongue diagnosis is a vital tool in both Western and Traditional Chinese Medicine, providing key insights into a patient's health by analyzing tongue attributes. The COVID-19 pandemic has heightened the need for accurate remote medical assessments, emphasizing the importance of precise tongue attribute recognition via telehealth. To address this, we propose a Sign-Oriented multi-label Attributes Detection Framework. Our approach begins with an adaptive tongue feature extraction module that standardizes tongue images and mitigates environmental factors. This is followed by a Sign-oriented Network (SignNet) that identifies specific tongue attributes, emulating the diagnostic process of experienced practitioners and enabling comprehensive health evaluations. To validate our methodology, we developed an extensive tongue image dataset specifically designed for telemedicine. Unlike existing datasets, ours is tailored for remote diagnosis, with a comprehensive set of attribute labels. This dataset will be openly available, providing a valuable resource for research. Initial tests have shown improved accuracy in detecting various tongue attributes, highlighting our framework's potential as an essential tool for remote medical assessments.

Published

2025-04-11

How to Cite

Chen, Y., Ho, S. S., Xu, C., Xie, Y. J., Yeung, W.-F., He, S., & Qin, J. (2025). Dr. Tongue: Sign-Oriented Multi-label Detection for Remote Tongue Diagnosis. Proceedings of the AAAI Conference on Artificial Intelligence, 39(2), 2302–2310. https://doi.org/10.1609/aaai.v39i2.32230

Issue

Section

AAAI Technical Track on Computer Vision I