Table of Contents
Fetching ...

Making Sense of Touch: Unsupervised Shapelet Learning in Bag-of-words Sense

Zhicong Xian, Tabish Chaudhary, Jürgen Bock

TL;DR

This work tackles unsupervised learning of discriminative time-series shapelets for tactile robotic data by introducing NN-STNE, a neural architecture that embeds shapelet-based representations through a t-SNE hidden layer. The model processes time-series via sliding windows and NCC-based similarity, then maps distances to shapelets into a low-dimensional, interpretable feature space used for clustering, while preserving local structure with a Gaussian-kernel spectral objective and promoting diversity with a shapelet-weight regularization scheme. Evaluations on UCR subsets and a robot switch-on task show that NN-STNE improves clustering performance (on average about 16.7% over strong baselines like UDFS) when used as a pre-clustering feature learner, highlighting the practical value of non-linear, unsupervised shapelet learning for interpreting touch data. The approach offers a scalable, interpretable pathway for enhancing recognition and diagnosis in contact-rich robotic applications, bridging time-series shape features with unsupervised, high-level clustering insights.

Abstract

This paper introduces NN-STNE, a neural network using t-distributed stochastic neighbor embedding (t-SNE) as a hidden layer to reduce input dimensions by mapping long time-series data into shapelet membership probabilities. A Gaussian kernel-based mean square error preserves local data structure, while K-means initializes shapelet candidates due to the non-convex optimization challenge. Unlike existing methods, our approach uses t-SNE to address crowding in low-dimensional space and applies L1-norm regularization to optimize shapelet length. Evaluations on the UCR dataset and an electrical component manipulation task, like switching on, demonstrate improved clustering accuracy over state-of-the-art feature-learning methods in robotics.

Making Sense of Touch: Unsupervised Shapelet Learning in Bag-of-words Sense

TL;DR

This work tackles unsupervised learning of discriminative time-series shapelets for tactile robotic data by introducing NN-STNE, a neural architecture that embeds shapelet-based representations through a t-SNE hidden layer. The model processes time-series via sliding windows and NCC-based similarity, then maps distances to shapelets into a low-dimensional, interpretable feature space used for clustering, while preserving local structure with a Gaussian-kernel spectral objective and promoting diversity with a shapelet-weight regularization scheme. Evaluations on UCR subsets and a robot switch-on task show that NN-STNE improves clustering performance (on average about 16.7% over strong baselines like UDFS) when used as a pre-clustering feature learner, highlighting the practical value of non-linear, unsupervised shapelet learning for interpreting touch data. The approach offers a scalable, interpretable pathway for enhancing recognition and diagnosis in contact-rich robotic applications, bridging time-series shape features with unsupervised, high-level clustering insights.

Abstract

This paper introduces NN-STNE, a neural network using t-distributed stochastic neighbor embedding (t-SNE) as a hidden layer to reduce input dimensions by mapping long time-series data into shapelet membership probabilities. A Gaussian kernel-based mean square error preserves local data structure, while K-means initializes shapelet candidates due to the non-convex optimization challenge. Unlike existing methods, our approach uses t-SNE to address crowding in low-dimensional space and applies L1-norm regularization to optimize shapelet length. Evaluations on the UCR dataset and an electrical component manipulation task, like switching on, demonstrate improved clustering accuracy over state-of-the-art feature-learning methods in robotics.

Paper Structure

This paper contains 21 sections, 8 equations, 3 figures, 3 tables.

Figures (3)

  • Figure 1: A robot application: switch pushing task (left) and examples of identifying discriminative time series sub-sequences that distinguish a time series from others recorded in this application (right)
  • Figure 2: Overview of the network architecture
  • Figure 3: Example of a figure caption.