Huang, Weikai

3 publications

ECCV 2024 M&m’s: A Benchmark to Evaluate Tool-Use for Multi-Step Multi-Modal Tasks Zixian Ma, Weikai Huang, Jieyu Zhang, Tanmay Gupta, Ranjay Krishna
NeurIPS 2024 Task Me Anything Jieyu Zhang, Weikai Huang, Zixian Ma, Oscar Michel, Dong He, Tanmay Gupta, Wei-Chiu Ma, Ali Farhadi, Aniruddha Kembhavi, Ranjay Krishna
NeurIPSW 2024 Taskverse: A Benchmark Generation Engine for Multi-Modal Language Model Jieyu Zhang, Weikai Huang, Zixian Ma, Oscar Michel, Dong He, Tanmay Gupta, Wei-Chiu Ma, Ali Farhadi, Aniruddha Kembhavi, Ranjay Krishna