Multi-modal Dynamic Proxy Learning for Personalized Multiple Clustering

Authors

  • Jinfeng Xu The University of Hong Kong
  • Zheyu Chen Beijing Institute of Technology
  • Shuo Yang The University of Hong Kong
  • Jinze Li The University of Hong Kong
  • Ziyue Peng University of Macau
  • Zewei Liu The University of Hong Kong
  • Hewei Wang Carnegie Mellon University
  • Jiayi Zhang University of Nottingham
  • Edith C. H. Ngai The University of Hong Kong

DOI:

https://doi.org/10.1609/aaai.v40i32.39943

Abstract

Multiple clustering aims to discover diverse latent structures from different perspectives, yet existing methods generate exhaustive clusterings without discerning user interest, necessitating laborious manual screening. Current multi-modal solutions suffer from static semantic rigidity: predefined candidate words fail to adapt to dataset-specific concepts, and fixed fusion strategies ignore evolving feature interactions. To overcome these limitations, we propose Multi-DProxy, a novel multi-modal dynamic proxy learning framework that leverages cross-modal alignment through learnable textual proxies. Multi-DProxy introduces 1) gated cross-modal fusion that synthesizes discriminative joint representations by adaptively modeling feature interactions. 2) dual-constraint proxy optimization where user interest constraints enforce semantic consistency with domain concepts while concept constraints employ hard example mining to enhance cluster discrimination. 3) dynamic candidate management that refines textual proxies through iterative clustering feedback. Therefore, Multi-DProxy not only effectively captures a user's interest through proxies but also enables the identification of relevant clusterings with greater precision. Extensive experiments demonstrate state-of-the-art performance with significant improvements over existing methods across a broad set of multi-clustering benchmarks.

Downloads

Published

2026-03-14

How to Cite

Xu, J., Chen, Z., Yang, S., Li, J., Peng, Z., Liu, Z., … Ngai, E. C. H. (2026). Multi-modal Dynamic Proxy Learning for Personalized Multiple Clustering. Proceedings of the AAAI Conference on Artificial Intelligence, 40(32), 27269–27277. https://doi.org/10.1609/aaai.v40i32.39943

Issue

Section

AAAI Technical Track on Machine Learning IX