Structured Probabilistic Coding

Authors

  • Dou Hu Institute of Information Engineering, Chinese Academy of Sciences School of Cyber Security, University of Chinese Academy of Sciences
  • Lingwei Wei Institute of Information Engineering, Chinese Academy of Sciences
  • Yaxin Liu Institute of Information Engineering, Chinese Academy of Sciences School of Cyber Security, University of Chinese Academy of Sciences
  • Wei Zhou Institute of Information Engineering, Chinese Academy of Sciences
  • Songlin Hu Institute of Information Engineering, Chinese Academy of Sciences School of Cyber Security, University of Chinese Academy of Sciences

DOI:

https://doi.org/10.1609/aaai.v38i11.29142

Keywords:

ML: Representation Learning, NLP: Text Classification

Abstract

This paper presents a new supervised representation learning framework, namely structured probabilistic coding (SPC), to learn compact and informative representations from input related to the target task. SPC is an encoder-only probabilistic coding technology with a structured regularization from the target space. It can enhance the generalization ability of pre-trained language models for better language understanding. Specifically, our probabilistic coding simultaneously performs information encoding and task prediction in one module to more fully utilize the effective information from input data. It uses variational inference in the output space to reduce randomness and uncertainty. Besides, to better control the learning process of probabilistic representations, a structured regularization is proposed to promote uniformity across classes in the latent space. With the regularization term, SPC can preserve the Gaussian structure of the latent code and achieve better coverage of the hidden space with class uniformly. Experimental results on 12 natural language understanding tasks demonstrate that our SPC effectively improves the performance of pre-trained language models for classification and regression. Extensive experiments show that SPC can enhance the generalization capability, robustness to label noise, and clustering quality of output representations.

Published

2024-03-24

How to Cite

Hu, D., Wei, L., Liu, Y., Zhou, W., & Hu, S. (2024). Structured Probabilistic Coding. Proceedings of the AAAI Conference on Artificial Intelligence, 38(11), 12491-12501. https://doi.org/10.1609/aaai.v38i11.29142

Issue

Section

AAAI Technical Track on Machine Learning II