Your Prompt Is My Command: On Assessing the Human-Centred Generality of Multimodal Models (Abstract Reprint)

Wout Schellaert; Fernando Martínez-Plumed; Karina Vold; John Burden; Pablo A. M. Casares; Bao Sheng Loe; Roi Reichart; Sean Ó hÉigeartaigh; Anna Korhonen; José Hernández-Orallo

doi:10.1609/aaai.v38i20.30612

Authors

Wout Schellaert VRAIN, Universitat Politècnica de València, Spain
Fernando Martínez-Plumed VRAIN, Universitat Politècnica de València, Spain
Karina Vold Institute for the History and Philosophy of Science and Technology, University of Toronto, Canada
John Burden Leverhulme Centre for the Future of Intelligence, University of Cambridge, UK
Pablo A. M. Casares Universidad Complutense de Madrid, Spain
Bao Sheng Loe Psychometrics Centre, University of Cambridge, UK
Roi Reichart Technion - Israel Institute of Technology, Israel
Sean Ó hÉigeartaigh Centre for the Study of Existential Risk, University of Cambridge, UK
Anna Korhonen Language Technology Laboratory (LTL), University of Cambridge, UK
José Hernández-Orallo VRAIN, Universitat Politècnica de València, Spain

DOI:

https://doi.org/10.1609/aaai.v38i20.30612

Keywords:

Journal Track

Abstract

Even with obvious deficiencies, large prompt-commanded multimodal models are proving to be flexible cognitive tools representing an unprecedented generality. But the directness, diversity, and degree of user interaction create a distinctive “human-centred generality” (HCG), rather than a fully autonomous one. HCG implies that —for a specific user— a system is only as general as it is effective for the user’s relevant tasks and their prevalent ways of prompting. A human-centred evaluation of general-purpose AI systems therefore needs to reflect the personal nature of interaction, tasks and cognition. We argue that the best way to understand these systems is as highly-coupled cognitive extenders, and to analyse the bidirectional cognitive adaptations between them and humans. In this paper, we give a formulation of HCG, as well as a high-level overview of the elements and trade-offs involved in the prompting process. We end the paper by outlining some essential research questions and suggestions for improving evaluation practices, which we envision as characteristic for the evaluation of general artificial intelligence in the future.

Your Prompt Is My Command: On Assessing the Human-Centred Generality of Multimodal Models (Abstract Reprint)

Authors

DOI:

Keywords:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information