Comparing Reconstruction- and Contrastive-based Models for Visual Task Planning

Constantinos Chamzas*, Martina Lippi*, Michael C. Welle*, Anastasia Varava, Lydia E. Kavraki, and Danica Kragic

Abstract: Learning state representations enables robotic planning directly from raw observations such as images. Several methods learn state representations by utilizing losses based on the reconstruction of the raw observations from a lower-dimensional latent space. The similarity between observations in the space of images is often assumed and used as a proxy for estimating similarity between the underlying states of the system. However, observations commonly contain task-irrelevant factors of variation which are nonetheless important for reconstruction, such as varying lighting and different camera viewpoints. In this work, we define relevant evaluation metrics and perform a thorough study of different loss functions for state representation learning. We show that models exploiting task priors, such as Siamese networks with a simple contrastive loss, outperform reconstruction-based representations in visual task planning in case of task-irrelevant factors of variations.

*Contributed equally and listed in alphabetical order

Paper preprint

Results

Box manipulation:

Representation evaluation metrics:

Representation t-SNE plots:

Shelf arrangment

Representation evaluation metrics:

Representation t-SNE plots:

Box stacking:

Representation evaluation metrics:

Representation t-SNE plots:

Code Repository

The project code can be found on the gitrepo:

Code Repository

Contact

  • Constantinos Chamzas; chamzas(at)rice.edu; Rice University, USA
  • Martina Lippi; martina.lippi(at)uniroma3.it; University of Roma Tre, Italy
  • Michael C. Welle; mwelle(at)kth.se; KTH Royal Institute of Technology, Sweden
  • Anastasia Varava; varava(at)kth.se; KTH Royal Institute of Technology, Sweden
  • Lydia E. Kavraki; kavraki(at)rice.edu; Rice University, USA
  • Danica Kragic; dani(at)kth.se; KTH Royal Institute of Technology, Sweden