FAVOR: Full-Body AR-Driven Virtual Object Rearrangement Guided by Instruction Text

Authors

  • Kailin Li Shanghai Jiao Tong University
  • Lixin Yang Shanghai Jiao Tong University
  • Zenan Lin South China University of Technology
  • Jian Xu XREAL
  • Xinyu Zhan Shanghai Jiao Tong University
  • Yifei Zhao Shanghai Jiao Tong University
  • Pengxiang Zhu Shanghai Jiao Tong University
  • Wenxiong Kang South China University of Technology
  • Kejian Wu XREAL
  • Cewu Lu Shanghai Jiao Tong University

DOI:

https://doi.org/10.1609/aaai.v38i4.28097

Keywords:

CV: Biometrics, Face, Gesture & Pose, CV: 3D Computer Vision, CV: Language and Vision, HAI: Interaction Techniques and Devices

Abstract

Rearrangement operations form the crux of interactions between humans and their environment. The ability to generate natural, fluid sequences of this operation is of essential value in AR/VR and CG. Bridging a gap in the field, our study introduces FAVOR: a novel dataset for Full-body AR-driven Virtual Object Rearrangement that uniquely employs motion capture systems and AR eyeglasses. Comprising 3k diverse motion rearrangement sequences and 7.17 million interaction data frames, this dataset breaks new ground in research data. We also present a pipeline FAVORITE for producing digital human rearrangement motion sequences guided by instructions. Experimental results, both qualitative and quantitative, suggest that this dataset and pipeline deliver high-quality motion sequences. Our dataset, code, and appendix are available at https://kailinli.github.io/FAVOR.

Published

2024-03-24

How to Cite

Li, K., Yang, L., Lin, Z., Xu, J., Zhan, X., Zhao, Y., … Lu, C. (2024). FAVOR: Full-Body AR-Driven Virtual Object Rearrangement Guided by Instruction Text. Proceedings of the AAAI Conference on Artificial Intelligence, 38(4), 3136–3144. https://doi.org/10.1609/aaai.v38i4.28097

Issue

Section

AAAI Technical Track on Computer Vision III