Aller au contenu

Ressources for the Vision Application

This application is based on iNaturalist, from which we extract subdatasets specifically for this course.

Ressources for Session 1

Dataset

Main features :

  • 200 images
  • On the 200 images, 100 are insects and the 100 are plants.

Visualisation of a few examples

Plants

Plants

Insects

Insects

Latent Space

The 200 images have been put in a latent space using the vision encoder VIT-H/14 from OpenClip, a deep learning model from this paper. We will delve into the details of Deep Learning and feature extraction from course 4.

For now, you can just open the numpy array containing all samples in the latent space from the embeddings-cv-lab1.npz.

Work to do

Compute, visualize and interpret the distance matrix, as explained in Lab Session 1 main page.