Intermediate Domain Alignment and Morphology Analogy for Patent-Product Image Retrieval

Abstract

Recent advances in artificial intelligence have significantly impacted image retrieval tasks, yet Patent-Product Image Retrieval (PPIR) has received limited attention. PPIR, which retrieves patent images based on product images to identify potential infringements, presents unique challenges: (1) both product and patent images often contain numerous categories of artificial objects, but models pre-trained on standard datasets exhibit limited discriminative power to recognize some of those unseen objects; and (2) the significant domain gap between binary patent line drawings and colorful RGB product images further complicates similarity comparisons for product-patent pairs. To address these challenges, we formulate it as an open-set image retrieval task and introduce a comprehensive Patent-Product Image Retrieval Dataset (PPIRD) including a test set with 439 product-patent pairs, a retrieval pool of 727,921 patents, and an unlabeled pre-training set of 3,799,695 images. We further propose a novel Intermediate Domain Alignment and Morphology Analogy (IDAMA) strategy. IDAMA maps both image types to an intermediate sketch domain using edge detection to minimize the domain discrepancy, and employs a Morphology Analogy Filter to select discriminative patent images based on visual features via analogical reasoning. Extensive experiments on PPIRD demonstrate that IDAMA significantly outperforms baseline methods (+7.58 mAR) and offers valuable insights into domain mapping and representation learning for PPIR. (The PPIRD dataset is available at: \href{https://loslorien.github.io/idama-project/}https://loslorien.github.io/idama-project/)

Cite

Text

Gong et al. "Intermediate Domain Alignment and Morphology Analogy for Patent-Product Image Retrieval." Advances in Neural Information Processing Systems, 2025.

Markdown

[Gong et al. "Intermediate Domain Alignment and Morphology Analogy for Patent-Product Image Retrieval." Advances in Neural Information Processing Systems, 2025.](https://mlanthology.org/neurips/2025/gong2025neurips-intermediate/)

BibTeX

@inproceedings{gong2025neurips-intermediate,
  title     = {{Intermediate Domain Alignment and Morphology Analogy for Patent-Product Image Retrieval}},
  author    = {Gong, Haifan and Zhang, Xuanye and Zhang, Ruifei and Su, Yun and Li, Zhuo and Du, Yuhao and Gao, Anningzhe and Wan, Xiang and Li, Haofeng},
  booktitle = {Advances in Neural Information Processing Systems},
  year      = {2025},
  url       = {https://mlanthology.org/neurips/2025/gong2025neurips-intermediate/}
}