Where to Mask: Structure-Guided Masking for Graph Masked Autoencoders

Abstract

In large aperture imaging, the shallow depth of field (DoF) phenomenon requires capturing multiple images at different focal levels, allowing us to infer depth information using depth from focus (DFF) techniques. However, most previous works design convolutional neural networks from a time domain perspective, often leading to blurred fine details in depth estimation. In this work, we propose a frequency-aware deep DFF network (FAD) that couples multi-scale spatial domain local features with frequency domain global structural features. Our main innovations include two key points: First, we introduce a frequency domain feature extraction module that uses the Fourier transform to transfer latent focus features into the frequency domain. This module adaptively captures essential frequency information for focus changes through element-wise multiplication, enhancing fine details in depth results while preserving global structural integrity. Second, the time-frequency joint module of FAD improves the consistency of depth information in sparse texture regions and the continuity in transition areas from both local and global complementary perspectives. Comprehensive experiments demonstrate that our model achieves compelling generalization and state-of-the-art depth prediction across various datasets. Additionally, it can be quickly adapted to real-world applications as a pre-trained model.

Cite

Text

Liu et al. "Where to Mask: Structure-Guided Masking for Graph Masked Autoencoders." International Joint Conference on Artificial Intelligence, 2024. doi:10.24963/ijcai.2024/241

Markdown

[Liu et al. "Where to Mask: Structure-Guided Masking for Graph Masked Autoencoders." International Joint Conference on Artificial Intelligence, 2024.](https://mlanthology.org/ijcai/2024/liu2024ijcai-mask/) doi:10.24963/ijcai.2024/241

BibTeX

@inproceedings{liu2024ijcai-mask,
  title     = {{Where to Mask: Structure-Guided Masking for Graph Masked Autoencoders}},
  author    = {Liu, Chuang and Wang, Yuyao and Zhan, Yibing and Ma, Xueqi and Tao, Dapeng and Wu, Jia and Hu, Wenbin},
  booktitle = {International Joint Conference on Artificial Intelligence},
  year      = {2024},
  pages     = {2180-2188},
  doi       = {10.24963/ijcai.2024/241},
  url       = {https://mlanthology.org/ijcai/2024/liu2024ijcai-mask/}
}