Evaluation Dimensions for Assessing Question Answer Systems for Lay Users: The Case of DiseaseGuru

Prakash Chandra Sukhwal; Atreyi Kankanhalli; Vaibhav Rajan

doi:10.1609/aaaiss.v1i1.27484

Evaluation Dimensions for Assessing Question Answer Systems for Lay Users: The Case of DiseaseGuru

Authors

Prakash Chandra Sukhwal National University of Singapore
Atreyi Kankanhalli National University of Singapore
Vaibhav Rajan National University of Singapore

DOI:

https://doi.org/10.1609/aaaiss.v1i1.27484

Keywords:

Artificial Intelligence

Abstract

Question answer (QA) systems can serve as vital tools to address lay users’ information needs in healthcare. While QA systems have the potential to lessen information overload and provide quality answers to users, it is important to holistically evaluate their performance. Here we propose multiple dimensions for this purpose comprising lexical similarity, semantic similarity, absence of contradictions and readability of responses. We then use the dimensions to evaluate DiseaseGuru, a generative large language model-based chronic disease QA system we developed that integrates knowledge graph technology to provide quality responses to lay users. The results are presented comparing it with three benchmark algorithms across the different dimensions. We also propose metrics for lay users and medical professionals for a future field study to evaluate the system.

AAAI Summer Symposium 2023 Proceedings Cover

Downloads

Published

2023-10-03

How to Cite

Sukhwal, P. C., Kankanhalli, A., & Rajan, V. (2023). Evaluation Dimensions for Assessing Question Answer Systems for Lay Users: The Case of DiseaseGuru. Proceedings of the AAAI Symposium Series, 1(1), 98–102. https://doi.org/10.1609/aaaiss.v1i1.27484

Download Citation

Issue

Vol. 1 No. 1: Proceedings of the Inaugural 2023 Summer Symposium Series 2023

Section

Building Connections: From Human-Human to Human-AI Collaboration

Evaluation Dimensions for Assessing Question Answer Systems for Lay Users: The Case of DiseaseGuru

Authors

DOI:

Keywords:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information