On Model and Data Scaling for Skeleton-based Self-Supervised Gait Recognition

Authors

  • Adrian Cosma The National University of Science and Technology Politehnica Bucharest The Dalle Molle Institute for Artificial Intelligence Research
  • Andy Eduard Catruna The National University of Science and Technology Politehnica Bucharest
  • Emilian Radoi The National University of Science and Technology Politehnica Bucharest

DOI:

https://doi.org/10.1609/aaai.v40i5.37340

Abstract

Gait recognition from video streams is a challenging problem in computer vision biometrics due to the subtle differences between gaits and numerous confounding factors. Recent advancements in self-supervised pretraining have led to the development of robust gait recognition models that are invariant to walking covariates. While neural scaling laws have transformed model development in other domains by linking performance to data, model size, and compute, their applicability to gait remains unexplored. In this work, we conduct the first empirical study scaling on skeleton-based self-supervised gait recognition to quantify the effect of data quantity, model size and compute on downstream gait recognition performance. We pretrain multiple variants of GaitPT -- a transformer-based architecture -- on a dataset of 2.7 million walking sequences collected in the wild. We evaluate zero-shot performance across four benchmark datasets to derive scaling laws for data, model size, and compute. Our findings demonstrate predictable power-law improvements in performance with increased scale and confirm that data and compute scaling significantly influence downstream accuracy. We further isolate architectural contributions by comparing GaitPT with GaitFormer under controlled compute budgets. These results provide practical insights into resource allocation and performance estimation for real-world gait recognition systems.

Downloads

Published

2026-03-14

How to Cite

Cosma, A., Catruna, A. E., & Radoi, E. (2026). On Model and Data Scaling for Skeleton-based Self-Supervised Gait Recognition. Proceedings of the AAAI Conference on Artificial Intelligence, 40(5), 3434-3442. https://doi.org/10.1609/aaai.v40i5.37340

Issue

Section

AAAI Technical Track on Computer Vision II