ViTa-Zero: Zero-shot Visuotactile Object 6D Pose Estimation

Li, Hongyu; Akl, James; Sridhar, Srinath; Brady, Tye; Padir, Taskin

Computer Science > Robotics

arXiv:2504.13179 (cs)

[Submitted on 17 Apr 2025]

Title:ViTa-Zero: Zero-shot Visuotactile Object 6D Pose Estimation

Authors:Hongyu Li, James Akl, Srinath Sridhar, Tye Brady, Taskin Padir

View PDF HTML (experimental)

Abstract:Object 6D pose estimation is a critical challenge in robotics, particularly for manipulation tasks. While prior research combining visual and tactile (visuotactile) information has shown promise, these approaches often struggle with generalization due to the limited availability of visuotactile data. In this paper, we introduce ViTa-Zero, a zero-shot visuotactile pose estimation framework. Our key innovation lies in leveraging a visual model as its backbone and performing feasibility checking and test-time optimization based on physical constraints derived from tactile and proprioceptive observations. Specifically, we model the gripper-object interaction as a spring-mass system, where tactile sensors induce attractive forces, and proprioception generates repulsive forces. We validate our framework through experiments on a real-world robot setup, demonstrating its effectiveness across representative visual backbones and manipulation scenarios, including grasping, object picking, and bimanual handover. Compared to the visual models, our approach overcomes some drastic failure modes while tracking the in-hand object pose. In our experiments, our approach shows an average increase of 55% in AUC of ADD-S and 60% in ADD, along with an 80% lower position error compared to FoundationPose.

Comments:	Accepted by ICRA 2025
Subjects:	Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2504.13179 [cs.RO]
	(or arXiv:2504.13179v1 [cs.RO] for this version)
	https://doi.org/10.48550/arXiv.2504.13179

Submission history

From: Hongyu Li [view email]
[v1] Thu, 17 Apr 2025 17:59:56 UTC (5,108 KB)

Computer Science > Robotics

Title:ViTa-Zero: Zero-shot Visuotactile Object 6D Pose Estimation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Robotics

Title:ViTa-Zero: Zero-shot Visuotactile Object 6D Pose Estimation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators