PatentLMM: Large Multimodal Model for Generating Descriptions for Patent Figures

Cite

Text

Shukla et al. "PatentLMM: Large Multimodal Model for Generating Descriptions for Patent Figures." AAAI Conference on Artificial Intelligence, 2025. doi:10.1609/AAAI.V39I19.34257

Markdown

[Shukla et al. "PatentLMM: Large Multimodal Model for Generating Descriptions for Patent Figures." AAAI Conference on Artificial Intelligence, 2025.](https://mlanthology.org/aaai/2025/shukla2025aaai-patentlmm/) doi:10.1609/AAAI.V39I19.34257

BibTeX

@inproceedings{shukla2025aaai-patentlmm,
  title     = {{PatentLMM: Large Multimodal Model for Generating Descriptions for Patent Figures}},
  author    = {Shukla, Shreya and Sharma, Nakul and Gupta, Manish and Mishra, Anand},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {2025},
  pages     = {20488-20496},
  doi       = {10.1609/AAAI.V39I19.34257},
  url       = {https://mlanthology.org/aaai/2025/shukla2025aaai-patentlmm/}
}