This project is aimed to implement an image similarity estimation model using deep learning that will be able to generalize the difference between two given images and output a similarity metric.
This similarity metric could be any distance measure. For the sake of simplicity and also since it has been effective in different research papers, we have chosen Euclidean distance as our distance metric as it is also bergman divergence.
We have used the dataset from MVTec AD which contains good and anomaly samples from 15 classes and trained the model using Siamese Neural Networks. We were able to do that by pairing images from same and different classes such that the model can learn the similarity metric.
We have trained the model for 100 epochs on a NVIDIA 3060Ti GPU which took around 3.5 hours. During the test time, we have compared images from same class and as well as images from other class and obtained their distance metric, which helped us analyze the performance of our model.
- Clone the repo using the following command:
git clone https://github.com/Pathi-rao/Project_for_Numerical_methods_for_algorithmic_systems_and_NeuralNetworks
-
Download the data from the provided link in data folder and extract it. Also, please make sure to update the file paths if necessary.
-
Create a virtual environment with Python(For this step I will assume that you are able to create a virtual environment with
virtualenv
, but in any case you can check an example here.)
- Activate the virtual environment and
cd
into the project directory
- Install requirements using
pip
:
pip install -r requirements.txt
- Run
train.py
to start training the model and usetesting_image_pair.py
to do inference on any two images
-
Paul Bergmann, Kilian Batzner, Michael Fauser, David Sattlegger, Carsten Steger: The MVTec Anomaly Detection Dataset: A Comprehensive Real-World Dataset for Unsupervised Anomaly Detection; in: International Journal of Computer Vision, January 2021 pdf
-
Paul Bergmann, Michael Fauser, David Sattlegger, Carsten Steger: MVTec AD – A Comprehensive Real-World Dataset for Unsupervised Anomaly Detection; in: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), June 2019 pdf
-
Siamese Neural Networks for One Shot Image Recognition pdf