FM-OV3D: Foundation Model-Based Cross-Modal Knowledge Blending for Open-Vocabulary 3D Detection

Cite

Text

Zhang et al. "FM-OV3D: Foundation Model-Based Cross-Modal Knowledge Blending for Open-Vocabulary 3D Detection." AAAI Conference on Artificial Intelligence, 2024. doi:10.1609/AAAI.V38I15.29612

Markdown

[Zhang et al. "FM-OV3D: Foundation Model-Based Cross-Modal Knowledge Blending for Open-Vocabulary 3D Detection." AAAI Conference on Artificial Intelligence, 2024.](https://mlanthology.org/aaai/2024/zhang2024aaai-fm/) doi:10.1609/AAAI.V38I15.29612

BibTeX

@inproceedings{zhang2024aaai-fm,
  title     = {{FM-OV3D: Foundation Model-Based Cross-Modal Knowledge Blending for Open-Vocabulary 3D Detection}},
  author    = {Zhang, Dongmei and Li, Chang and Zhang, Renrui and Xie, Shenghao and Xue, Wei and Xie, Xiaodong and Zhang, Shanghang},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {2024},
  pages     = {16723-16731},
  doi       = {10.1609/AAAI.V38I15.29612},
  url       = {https://mlanthology.org/aaai/2024/zhang2024aaai-fm/}
}