Detecting Anomalies in Textured Images Using Modified Transformer Masked Autoencoder
Dini, Afshin; Rahtu, Esa (2024)
Avaa tiedosto
Lataukset:
Dini, Afshin
Rahtu, Esa
2024
Julkaisun pysyvä osoite on
https://urn.fi/URN:NBN:fi:tuni-202405206042
https://urn.fi/URN:NBN:fi:tuni-202405206042
Kuvaus
Peer reviewed
Tiivistelmä
We present a new method for detecting and locating anomalies in textured-type images using transformer-based autoencoders. In this approach, a rectangular patch of an image is masked by setting its value to gray and then fetched into a pre-trained autoencoder with several blocks of transformer encoders and decoders in order to reconstruct the unknown part. It is shown that the pre-trained model is not able to reconstruct the defective parts properly when they are inside the masked patch. In this regard, the combination of the Structural Similarity Index Measure and absolute error between the reconstructed image and the original one can be used to define a new anomaly map to find and locate anomalies. In the experiment with the textured images of the MVTec dataset, we discover that not only can this approach find anomalous samples properly, but also the anomaly map itself can specify the exact locations of defects correctly at the same time. Moreover, not only is our method computatio nally efficient, as it utilizes a pre-trained model and does not require any training, but also it has a better performance compared to previous autoencoders and other reconstruction-based methods. Due to these reasons, one can use this method as a base approach to find and locate irregularities in real-world applications.
Kokoelmat
- TUNICRIS-julkaisut [24611]