MindVote: When AI Meets the Wild West of Social Media Opinion

Authors

  • Xutao Mao Vanderbilt University
  • Ezra Xuanru Tao Vanderbilt University
  • Leyao Wang Yale University

DOI:

https://doi.org/10.1609/aaai.v40i38.40525

Abstract

Large Language Models (LLMs) are increasingly used as scalable tools for pilot testing, predicting public opinion distributions before deploying costly surveys. However, the prevailing paradigm for evaluating these models relies on traditional structured surveys—a methodology misaligned with the more realistic scenarios like social media where opinions are rich in digital contexts. By design, surveys strip away the social and cultural context that shapes public opinion, and LLM benchmarks built on this paradigm inherit these critical limitations. To bridge this gap, we introduce MindVote, the first benchmark for public opinion prediction grounded in authentic social media discourse. MindVote is constructed from 3,918 naturalistic polls sourced from Reddit and Weibo, spanning 23 topics and enriched with detailed annotations for platform and topical context. Using this benchmark, we conduct a comprehensive evaluation of 15 LLMs, revealing a critical "survey-based specialization pitfall" where models fine-tuned on traditional surveys underperform their general-purpose counterparts and demonstrating the necessity of context in social media. MindVote provides a robust, ecologically valid framework to move beyond survey-based evaluations and advance the development of social intelligent AI systems.

Published

2026-03-14

How to Cite

Mao, X., Tao, E. X., & Wang, L. (2026). MindVote: When AI Meets the Wild West of Social Media Opinion. Proceedings of the AAAI Conference on Artificial Intelligence, 40(38), 32492–32500. https://doi.org/10.1609/aaai.v40i38.40525

Issue

Section

AAAI Technical Track on Natural Language Processing III