Adversarial Language Games for Advanced Natural Language Intelligence
Keywords:Discourse, Pragmatics & Argument Mining, Conversational AI/Dialog Systems, Adversarial Attacks & Robustness
AbstractWe study the problem of adversarial language games, in which multiple agents with conflicting goals compete with each other via natural language interactions. While adversarial language games are ubiquitous in human activities, little attention has been devoted to this field in natural language processing. In this work, we propose a challenging adversarial language game called Adversarial Taboo as an example, in which an attacker and a defender compete around a target word. The attacker is tasked with inducing the defender to utter the target word invisible to the defender, while the defender is tasked with detecting the target word before being induced by the attacker. In Adversarial Taboo, a successful attacker and defender need to hide or infer the intention, and induce or defend during conversations. This requires several advanced language abilities, such as adversarial pragmatic reasoning and goal-oriented language interactions in open domain, which will facilitate many downstream NLP tasks. To instantiate the game, we create a game environment and a competition platform. Comprehensive experiments on several baseline attack and defense strategies show promising and interesting results, based on which we discuss some directions for future research.
How to Cite
Yao, Y., Zhong, H., Zhang, Z., Han, X., Wang, X., Zhang, K., Xiao, C., Zeng, G., Liu, Z., & Sun, M. (2021). Adversarial Language Games for Advanced Natural Language Intelligence. Proceedings of the AAAI Conference on Artificial Intelligence, 35(16), 14248-14256. Retrieved from https://ojs.aaai.org/index.php/AAAI/article/view/17676
AAAI Technical Track on Speech and Natural Language Processing III