I-athlon: Towards A Multidimensional Turing Test


  • Sam S. Adams IBM T. J. Watson Research Center
  • Guruduth Banavar IBM T. J. Watson Research Center
  • Murray Campbell IBM T. J. Watson Research Center




While the Turing test is a well-known method for evaluating machine intelligence, it has a number of drawbacks that make it problematic as a rigorous and practical test for assessing progress in general-purpose AI. For example, the Turing test is deception based, subjectively evaluated, and narrowly focused on language use. We suggest that a test would benefit from including the following requirements: focus on rational behavior, test several dimensions of intelligence, automate as much as possible, score as objectively as possible, and allow incremental progress to be measured. In this article we propose a methodology for designing a test that consists of a series of events, analogous to the Olympic Decathlon, which complies with these requirements. The approach, which we call the I-athlon, is intended to ultimately enable the community to evaluate progress towards machine intelligence in a practical and repeatable way.




How to Cite

Adams, S. S., Banavar, G., & Campbell, M. (2016). I-athlon: Towards A Multidimensional Turing Test. AI Magazine, 37(1), 78-84. https://doi.org/10.1609/aimag.v37i1.2643