Spontaneous Yet Predictable: Shapelet-Driven, Channel-Aware Intention Decoding from Multi-Region ECoG

Authors

  • Keren Cao National Key Laboratory of Human-Machine Hybrid Augmented Intelligence, National Engineering Research Center for Visual Information and Applications, Xi’an Jiaotong University Institute of Artificial Intelligence and Robotics, Xi’an Jiaotong University
  • Yuhang Tian National Key Laboratory of Human-Machine Hybrid Augmented Intelligence, National Engineering Research Center for Visual Information and Applications, Xi’an Jiaotong University
  • Kaizhong Zheng National Key Laboratory of Human-Machine Hybrid Augmented Intelligence, National Engineering Research Center for Visual Information and Applications, Xi’an Jiaotong University Institute of Artificial Intelligence and Robotics, Xi’an Jiaotong University
  • Wei Xi National Key Laboratory of Human-Machine Hybrid Augmented Intelligence, National Engineering Research Center for Visual Information and Applications, Xi’an Jiaotong University
  • Xinjian Li Department of Neurology of the Second Affiliated Hospital and Interdisciplinary Institute of Neuroscience and Technology, School of Medicine, Zhejiang University
  • Liangjun Chen National Key Laboratory of Human-Machine Hybrid Augmented Intelligence, National Engineering Research Center for Visual Information and Applications, Xi’an Jiaotong University Institute of Artificial Intelligence and Robotics, Xi’an Jiaotong University

DOI:

https://doi.org/10.1609/aaai.v40i21.38791

Abstract

Proactive intention decoding remains a critical yet underexplored challenge in brain–machine interfaces (BMIs), especially under naturalistic, self-initiated behavior. Existing systems rely on reactive decoding of motor cortex signals, resulting in substantial latency. To address this, we leverage the common marmoset’s spontaneous vocalizations and develop a high-resolution, dual-region ECoG recording paradigm targeting the prefrontal and auditory cortices and a neural decoding framework that integrates shapelet-based temporal encoding, position-aware attention, frequency-aware channel masking, contrastive clustering and a minimum error entropy-based robust loss. Our approach achieves 91.9% accuracy up to 200 ms before vocal onset—substantially outperforming 13 competitive baselines. Our model also uncovers a functional decoupling between auditory and prefrontal regions. Furthermore, joint modeling in time and frequency domains reveals novel preparatory neural signatures preceding volitional vocal output. Together, our findings bridge the gap between foundational neuroscience and applied BMI engineering, and establish a generalizable framework for intention decoding from ecologically valid, asynchronous behaviors.

Downloads

Published

2026-03-14

How to Cite

Cao, K., Tian, Y., Zheng, K., Xi, W., Li, X., & Chen, L. (2026). Spontaneous Yet Predictable: Shapelet-Driven, Channel-Aware Intention Decoding from Multi-Region ECoG. Proceedings of the AAAI Conference on Artificial Intelligence, 40(21), 17384–17392. https://doi.org/10.1609/aaai.v40i21.38791

Issue

Section

AAAI Technical Track on Humans and AI