FLAG-4D: Flow-Guided Local-Global Dual-Deformation Model for 4D Reconstruction

Authors

  • Guan Yuan Tan Monash University, Malaysia
  • Ngoc Tuan Vu Monash University, Malaysia
  • Arghya Pal Monash University, Malaysia
  • Sailaja Rajanala Monash University, Malaysia
  • Raphael C W Phan Monash University, Malaysia
  • Mettu Srinivas National Institute of Technology Warangal, India
  • Chee-Ming Ting Monash University, Malaysia

DOI:

https://doi.org/10.1609/aaai.v40i11.37889

Abstract

We introduce FLAG-4D, a novel framework for generating novel views of dynamic scenes by reconstructing how 3D Gaussian primitives evolve through space and time. Existing methods typically rely on a single Multilayer Perceptron(MLP) to model temporal deformations, and they often struggle to capture complex point motions and fine-grained dynamic details consistently over time, especially from sparse input views. Our approach, FLAG-4D overcomes this by employing a dual-deformation network that dynamically warps a canonical set of 3D Gaussians over time into new positions and anisotropic shapes. This dual-deformation network consists of an Instantaneous Deformation Network (IDN) for modeling fine-grained, local deformations, and Global Motion Network (GMN) for capturing long-range dynamics, refined via mutual learning. To ensure these deformations are both accurate and temporally smooth, FLAG-4D incorporates dense motion features from a pretrained optical flow backbone. We fuse these motion cues from adjacent timeframes and use a deformation-guided attention mechanism to align this flow information with the current state of each evolving 3D Gaussian. Extensive experiments demonstrate that FLAG-4D achieves higher-fidelity and more temporally coherent reconstructions with finer detail preservation than state-of-the-art methods.

Downloads

Published

2026-03-14

How to Cite

Tan, G. Y., Tuan Vu, N., Pal, A., Rajanala, S., Phan, R. C. W., Srinivas, M., & Ting, C.-M. (2026). FLAG-4D: Flow-Guided Local-Global Dual-Deformation Model for 4D Reconstruction. Proceedings of the AAAI Conference on Artificial Intelligence, 40(11), 9305–9313. https://doi.org/10.1609/aaai.v40i11.37889

Issue

Section

AAAI Technical Track on Computer Vision VIII