Zhong, Xungao and Luo, Jiaoguo and Zhong, Xunyu and Hu, Huosheng and Liu, Qiang (2025) SS-ARGNet: A Novel Cascaded Schema for Robots’ 7-DoF Grasping in Adjacent and Stacked Object Scenarios. IEEE Transactions on Instrumentation and Measurement, 74. p. 2509312. DOI https://doi.org/10.1109/tim.2025.3544720
Zhong, Xungao and Luo, Jiaoguo and Zhong, Xunyu and Hu, Huosheng and Liu, Qiang (2025) SS-ARGNet: A Novel Cascaded Schema for Robots’ 7-DoF Grasping in Adjacent and Stacked Object Scenarios. IEEE Transactions on Instrumentation and Measurement, 74. p. 2509312. DOI https://doi.org/10.1109/tim.2025.3544720
Zhong, Xungao and Luo, Jiaoguo and Zhong, Xunyu and Hu, Huosheng and Liu, Qiang (2025) SS-ARGNet: A Novel Cascaded Schema for Robots’ 7-DoF Grasping in Adjacent and Stacked Object Scenarios. IEEE Transactions on Instrumentation and Measurement, 74. p. 2509312. DOI https://doi.org/10.1109/tim.2025.3544720
Abstract
Multiple objects in tightly adjacent and stacked configurations pose significant challenges for robotic systems in achieving reliable grasping, as existing algorithms often struggle to distinguish stacked objects and are prone to causing disruptions to the original scene due to improper grasping postures and collisions. In order to address these challenges, we developed a category-agnostic segmentation and cascaded 7-degrees of freedom (DoF) pose prediction approach for adjacent and stacked objects grasping, using a single vision image. Specifically, a stacked segmentation network (SS-Net) was tailored based on transformer and region proposal modules to achieve robust mask prediction, thereby accurately localizing candidates within the scene. Simultaneously, the attention residual grasping network (ARG-Net) was proposed to estimate the 7-DoF pose of individual targets, employing a new collision-free strategy to avoid interference between the gripper and the candidates. The integrated SS-Net and ARG-Net (SS-ARGNet) schema significantly enhances robotic performance in practical applications, achieving grasp completion rates of 92.8% and 89.1% for adjacent scenarios, and 87.4% and 84.9% for stacked scenarios, for similar and unknown objects, respectively, with a grasp response time of less than 0.9 s.
Item Type: | Article |
---|---|
Uncontrolled Keywords: | 7-degrees of freedom (DoF) pose prediction; cascaded schema; category-agnostic segmentation; robot grasping; stacked scenarios |
Subjects: | Z Bibliography. Library Science. Information Resources > ZR Rights Retention |
Divisions: | Faculty of Science and Health > Computer Science and Electronic Engineering, School of |
SWORD Depositor: | Unnamed user with email elements@essex.ac.uk |
Depositing User: | Unnamed user with email elements@essex.ac.uk |
Date Deposited: | 16 Apr 2025 17:19 |
Last Modified: | 16 Apr 2025 17:25 |
URI: | http://repository.essex.ac.uk/id/eprint/40421 |
Available files
Filename: DOI-10.1109-TIM.2025.3544720.pdf