Indoor Multi-View Radar Object Detection via 3D Bounding Box Diffusion

Authors

  • Ryoma Yataka Information Technology R&D Center, Mitsubishi Electric Corporation
  • Pu (Perry) Wang Mitsubishi Electric Research Laboratories
  • Petros Boufounos Mitsubishi Electric Research Laboratories
  • Ryuhei Takahashi Information Technology R&D Center, Mitsubishi Electric Corporation

DOI:

https://doi.org/10.1609/aaai.v40i22.38939

Abstract

Multi-view indoor radar perception has drawn attention due to its cost-effectiveness and low privacy risks. Existing methods often rely on implicit cross-view radar feature association, such as proposal pairing in RFMask or query-to-feature cross-attention in RETR, which can lead to ambiguous feature matches and degraded detection in complex indoor scenes. To address these limitations, we propose REXO (multi-view Radar object dEtection with 3D bounding boX diffusiOn), which lifts the 2D bounding box (BBox) diffusion process of DiffusionDet into the 3D radar space. REXO utilizes these noisy 3D BBoxes to guide an explicit cross-view radar feature association, enhancing the cross-view radar-conditioned denoising process. By accounting for prior knowledge that the person is in contact with the ground, REXO reduces the number of diffusion parameters by determining them from this prior. Evaluated on two open indoor radar datasets, our approach surpasses state-of-the-art methods by a margin of +4.22 AP on the HIBER dataset and +11.02 AP on the MMVR dataset. Our implementation is available at https://github.com/merlresearch/radar-bbox-diffusion.

Downloads

Published

2026-03-14

How to Cite

Yataka, R., Wang, P. (Perry), Boufounos, P., & Takahashi, R. (2026). Indoor Multi-View Radar Object Detection via 3D Bounding Box Diffusion. Proceedings of the AAAI Conference on Artificial Intelligence, 40(22), 18710-18718. https://doi.org/10.1609/aaai.v40i22.38939

Issue

Section

AAAI Technical Track on Intelligent Robotics