Towards a Common Metrics and Evaluation Framework for Assessment of Older Adults and Caregivers Interacting with Artificial Intelligence

Authors

  • Jasmin Marwad New England Robotics Validation and Experimentation (NERVE) Center, University of Massachusetts Lowell
  • Daisy M. Kiyemba New England Robotics Validation and Experimentation (NERVE) Center, University of Massachusetts Lowell
  • Elizabeth J. Carter Robotics Institute, Carnegie Mellon University
  • Adam Norton New England Robotics Validation and Experimentation (NERVE) Center, University of Massachusetts Lowell

DOI:

https://doi.org/10.1609/aaaiss.v4i1.31784

Abstract

Artificial intelligence (AI) has applications in assisting older adults to age in place and provide support to them and their caregivers as their cognition declines with age. However, effective assessment methods of this technology are needed in order to benchmark their performance and a common set of metrics and evaluation methods would enable such assessments to be compared to one another. To this end, we propose a common framework for human-AI interaction involving care recipients and their care networks. From the results of a literature review exercise, a framework with sample metrics, related measures, qualified evaluation tools, and contextual factors that impact assessment are reviewed. This paper provides a sample of common metrics in one of the framework’s measurement spaces (human-AI interaction) and discusses some of the impacts of contextual factors and how use of the common metrics and evaluation framework can be used for meta-analysis and to guide future research. Additional future articles are planned to cover the other measurement spaces in the framework (system performance, task performance, and well-being), including their particular common metrics and evaluation methods. This effort aims to provide guidance for researchers in this domain as well as highlight measurement gaps that can be filled by future research.

Downloads

Published

2024-11-08

How to Cite

Marwad, J., Kiyemba, D. M., Carter, E. J., & Norton, A. (2024). Towards a Common Metrics and Evaluation Framework for Assessment of Older Adults and Caregivers Interacting with Artificial Intelligence. Proceedings of the AAAI Symposium Series, 4(1), 137-145. https://doi.org/10.1609/aaaiss.v4i1.31784

Issue

Section

Artificial Intelligence for Aging in Place