Your Prompt Is My Command: On Assessing the Human-Centred Generality of Multimodal Models (Abstract Reprint)

Authors

  • Wout Schellaert VRAIN, Universitat Politècnica de València, Spain
  • Fernando Martínez-Plumed VRAIN, Universitat Politècnica de València, Spain
  • Karina Vold Institute for the History and Philosophy of Science and Technology, University of Toronto, Canada
  • John Burden Leverhulme Centre for the Future of Intelligence, University of Cambridge, UK
  • Pablo A. M. Casares Universidad Complutense de Madrid, Spain
  • Bao Sheng Loe Psychometrics Centre, University of Cambridge, UK
  • Roi Reichart Technion - Israel Institute of Technology, Israel
  • Sean Ó hÉigeartaigh Centre for the Study of Existential Risk, University of Cambridge, UK
  • Anna Korhonen Language Technology Laboratory (LTL), University of Cambridge, UK
  • José Hernández-Orallo VRAIN, Universitat Politècnica de València, Spain

DOI:

https://doi.org/10.1609/aaai.v38i20.30612

Keywords:

Journal Track

Abstract

Even with obvious deficiencies, large prompt-commanded multimodal models are proving to be flexible cognitive tools representing an unprecedented generality. But the directness, diversity, and degree of user interaction create a distinctive “human-centred generality” (HCG), rather than a fully autonomous one. HCG implies that —for a specific user— a system is only as general as it is effective for the user’s relevant tasks and their prevalent ways of prompting. A human-centred evaluation of general-purpose AI systems therefore needs to reflect the personal nature of interaction, tasks and cognition. We argue that the best way to understand these systems is as highly-coupled cognitive extenders, and to analyse the bidirectional cognitive adaptations between them and humans. In this paper, we give a formulation of HCG, as well as a high-level overview of the elements and trade-offs involved in the prompting process. We end the paper by outlining some essential research questions and suggestions for improving evaluation practices, which we envision as characteristic for the evaluation of general artificial intelligence in the future.

Downloads

Published

2024-03-24

How to Cite

Schellaert, W., Martínez-Plumed, F., Vold, K., Burden, J., Casares, P. A. M., Loe, B. S., Reichart, R., hÉigeartaigh, S. Ó, Korhonen, A., & Hernández-Orallo, J. (2024). Your Prompt Is My Command: On Assessing the Human-Centred Generality of Multimodal Models (Abstract Reprint). Proceedings of the AAAI Conference on Artificial Intelligence, 38(20), 22712-22712. https://doi.org/10.1609/aaai.v38i20.30612