ViTEraser: Harnessing the Power of Vision Transformers for Scene Text Removal with SegMIM Pretraining

Cite

Text

Peng et al. "ViTEraser: Harnessing the Power of Vision Transformers for Scene Text Removal with SegMIM Pretraining." AAAI Conference on Artificial Intelligence, 2024. doi:10.1609/AAAI.V38I5.28245

Markdown

[Peng et al. "ViTEraser: Harnessing the Power of Vision Transformers for Scene Text Removal with SegMIM Pretraining." AAAI Conference on Artificial Intelligence, 2024.](https://mlanthology.org/aaai/2024/peng2024aaai-viteraser/) doi:10.1609/AAAI.V38I5.28245

BibTeX

@inproceedings{peng2024aaai-viteraser,
  title     = {{ViTEraser: Harnessing the Power of Vision Transformers for Scene Text Removal with SegMIM Pretraining}},
  author    = {Peng, Dezhi and Liu, Chongyu and Liu, Yuliang and Jin, Lianwen},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {2024},
  pages     = {4468-4477},
  doi       = {10.1609/AAAI.V38I5.28245},
  url       = {https://mlanthology.org/aaai/2024/peng2024aaai-viteraser/}
}