Simple Kinesthetic Haptics for Object Recognition

Avishai Sintov; Inbar Ben-David

Simple Kinesthetic Haptics for Object Recognition

Avishai Sintov, Inbar Ben-David

TL;DR

This work tackles object recognition without tactile sensors by leveraging kinesthetic glances and a frame-invariant grasp representation learned from CAD models. It introduces the injective map $\Phi_n$ that projects a grasp to a $w$-dimensional vector $\mathbf{q}$ and two inference schemes, Iterative Classification (IC) and Sequential Bayesian Update (BC), to accumulate evidence across multiple grasps. A scale-invariant grasp representation is developed via normalization by $A_v$ and extended to $z$-finger grasps to increase information content. Experiments on KIT and GraspNet datasets, plus real and simulated grippers on YCB objects, show high recognition accuracy, scalability across sizes, and practical data-efficiency for hand-agnostic robotic systems.

Abstract

Object recognition is an essential capability when performing various tasks. Humans naturally use either or both visual and tactile perception to extract object class and properties. Typical approaches for robots, however, require complex visual systems or multiple high-density tactile sensors which can be highly expensive. In addition, they usually require actual collection of a large dataset from real objects through direct interaction. In this paper, we propose a kinesthetic-based object recognition method that can be performed with any multi-fingered robotic hand in which the kinematics is known. The method does not require tactile sensors and is based on observing grasps of the objects. We utilize a unique and frame invariant parameterization of grasps to learn instances of object shapes. To train a classifier, training data is generated rapidly and solely in a computational process without interaction with real objects. We then propose and compare between two iterative algorithms that can integrate any trained classifier. The classifiers and algorithms are independent of any particular robot hand and, therefore, can be exerted on various ones. We show in experiments, that with few grasps, the algorithms acquire accurate classification. Furthermore, we show that the object recognition approach is scalable to objects of various sizes. Similarly, a global classifier is trained to identify general geometries (e.g., an ellipsoid or a box) rather than particular ones and demonstrated on a large set of objects. Full scale experiments and analysis are provided to show the performance of the method.

Simple Kinesthetic Haptics for Object Recognition

TL;DR

This work tackles object recognition without tactile sensors by leveraging kinesthetic glances and a frame-invariant grasp representation learned from CAD models. It introduces the injective map

that projects a grasp to a

-dimensional vector

and two inference schemes, Iterative Classification (IC) and Sequential Bayesian Update (BC), to accumulate evidence across multiple grasps. A scale-invariant grasp representation is developed via normalization by

and extended to

-finger grasps to increase information content. Experiments on KIT and GraspNet datasets, plus real and simulated grippers on YCB objects, show high recognition accuracy, scalability across sizes, and practical data-efficiency for hand-agnostic robotic systems.

Abstract

Paper Structure (20 sections, 3 theorems, 26 equations, 21 figures, 8 tables, 4 algorithms)

This paper contains 20 sections, 3 theorems, 26 equations, 21 figures, 8 tables, 4 algorithms.

Introduction
Related work
Object Learning through tactile perception
Object Recognition
Grasp & Object Parameterization
Grasp representation
Object Recognition
Iterative classification
Sequential Bayesian Update
Scale invariant grasp representation
Embedded $z$-finger grasps
Experimental Results and Analysis
Data collection and model training
Object Recognition
$z$-finger grasps
...and 5 more sections

Key Result

Theorem 1

Let object $\mathcal{O}$ be described relative to coordinate frame $\mathcal{C}_i$, and let coordinate frame $\mathcal{C}_j$ be defined such that $\mathbf{c}_i=R\mathbf{c}_j+\mathbf{d}$ for any $\mathbf{c}_i\in\mathcal{C}_i$ and $\mathbf{c}_j\in\mathcal{C}_j$, and for some $R\in SO(3)$ and $\mathbf{ with $\mathcal{G}_k^{'}=\{R\mathcal{P}_k+\mathbf{d},R\mathcal{N}_k\}$.

Figures (21)

Figure 1: Robotic haptic glances to identify (left) an occluded object in a cabinet and (right) on a table.
Figure 2: Illustration of the (top) training and (bottom) evaluation processes. CAD models of the objects are used to generate labeled data for training a classifier. The classifier, trained solely over CAD models and independent of any particular robot hand, can now be used to classify real object. Recognition of real objects is done by random sampling grasps using a robotic hand in an iterative classification process - Iterative Classification (IC) or Bayesian Classification (BC).
Figure 3: Example of a 4-vertex polygon describing a 4-finger ($n=4$) grasp of object $\mathcal{O}$ with a vector of nine parameters.
Figure 4: Example of a 4-finger grasp of a box. A 4-vertex polyhedron formed by four contact points is defined by six parameters. The contact normals are marked in red while their eight parameterization angles are omitted for simplicity.
Figure 5: Eight objects from the KIT object models database used in the experiments.
...and 16 more figures

Theorems & Definitions (11)

Theorem 1
proof
Definition 1
Theorem 2
proof
Definition 2
Definition 3
Theorem 3
proof
proof
...and 1 more

Simple Kinesthetic Haptics for Object Recognition

TL;DR

Abstract

Simple Kinesthetic Haptics for Object Recognition

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (21)

Theorems & Definitions (11)