Learning to Count via Unbalanced Optimal Transport


  • Zhiheng Ma Xi'an Jiaotong University
  • Xing Wei Xi'an Jiaotong University
  • Xiaopeng Hong Xi'an Jiaotong University Peng Cheng Laboratory
  • Hui Lin Xi’an Jiaotong University
  • Yunfeng Qiu Xi'an Jiaotong University
  • Yihong Gong Xi'an Jiaotong University




Counting dense crowds through computer vision technology has attracted widespread attention. Most crowd counting datasets use point annotations. In this paper, we formulate crowd counting as a measure regression problem to minimize the distance between two measures with different supports and unequal total mass. Specifically, we adopt the unbalanced optimal transport distance, which remains stable under spatial perturbations, to quantify the discrepancy between predicted density maps and point annotations. An efficient optimization algorithm based on the regularized semi-dual formulation of UOT is introduced, which alternatively learns the optimal transportation and optimizes the density regressor. The quantitative and qualitative results illustrate that our method achieves state-of-the-art counting and localization performance.




How to Cite

Ma, Z., Wei, X., Hong, X., Lin, H., Qiu, Y., & Gong, Y. (2021). Learning to Count via Unbalanced Optimal Transport. Proceedings of the AAAI Conference on Artificial Intelligence, 35(3), 2319-2327. Retrieved from https://ojs.aaai.org/index.php/AAAI/article/view/16332



AAAI Technical Track on Computer Vision II