Enhancing Multi-Tip Artifact Detection in STM Images Using Fourier Transform and Vision Transformers
Abstract
We address the issue of multi-tip artifacts in Scanning Tunneling Microscopy (STM) images by applying the fast Fourier transform (FFT) as a feature engineering method. We fine-tune various neural network architectures using a synthetic dataset, including Vision Transformers (ViT). The FFT-based preprocessing significantly improves the performance of ViT models compared to using only the grayscale channel. Ablation experiments highlight the optimal conditions for synthetic dataset generation. Unlike traditional methods that are challenging to implement for large datasets and used offline, our method enables on-the-fly classification at scale. Our findings demonstrate the efficacy of combining the Fourier transform with deep learning for enhanced artifact detection in STM images, contributing to more accurate analysis in material science research.
Cite
Text
Rodani et al. "Enhancing Multi-Tip Artifact Detection in STM Images Using Fourier Transform and Vision Transformers." ICML 2024 Workshops: ML4LMS, 2024.Markdown
[Rodani et al. "Enhancing Multi-Tip Artifact Detection in STM Images Using Fourier Transform and Vision Transformers." ICML 2024 Workshops: ML4LMS, 2024.](https://mlanthology.org/icmlw/2024/rodani2024icmlw-enhancing/)BibTeX
@inproceedings{rodani2024icmlw-enhancing,
title = {{Enhancing Multi-Tip Artifact Detection in STM Images Using Fourier Transform and Vision Transformers}},
author = {Rodani, Tommaso and Ansuini, Alessio and Cazzaniga, Alberto},
booktitle = {ICML 2024 Workshops: ML4LMS},
year = {2024},
url = {https://mlanthology.org/icmlw/2024/rodani2024icmlw-enhancing/}
}