SNS-Grasp: Semantic-guided Noise Scaling for Grasp Generation

Zhenhua Tang; Yudian Zheng; Yuzhang Zhong; Haolun Li; Yanbin Hao; Chi-Man Pun

doi:10.1609/aaai.v40i11.37909

Authors

Zhenhua Tang University of Macau
Yudian Zheng University of Macau
Yuzhang Zhong University of Macau
Haolun Li Nanjing University of Posts and Telecommunications
Yanbin Hao Hefei University of Technology
Chi-Man Pun University of Macau

DOI:

https://doi.org/10.1609/aaai.v40i11.37909

Abstract

While diffusion models show promise for intent-based grasp generation, their isotropic noise schedules struggle with joint-specific sensitivity and task-aware variability. This limitation leads to grasps with suboptimal semantic alignment or physical feasibility. To address this challenge, we propose Semantic-guided Noise Scaling for grasp generation (SNS-Grasp), a novel framework that integrates two key innovations. First, the Semantic-guided Noise Scaling Diffusion (SNS-Diff) module generates intent-aware grasps by replacing isotropic noise with anisotropic modulation, dynamically adapting to task semantics and joint-specific sensitivity. Specifically, SNS-Diff leverages a pretrained Intent Recognizer to extract task-aware confidence scores and joint-specific gradient sensitivities from the interaction context. These signals adjust the noise scaling during denoising, downweighting perturbations for semantically critical joints to ensure semantic alignment. Second, the Fine-grained Grasp Refinement (FGR) module establishes dynamic joint-vertex coupling through fine-grained hand-object spatial relationships, enabling iterative optimization of physically executable grasps. Extensive experiments on OakInk and GRAB demonstrate SNS-Grasp's superior performance in semantic accuracy and physical feasibility, with robust generalization to unseen objects.

SNS-Grasp: Semantic-guided Noise Scaling for Grasp Generation

Authors

DOI:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information