Haddou, Mohammed

1 publications

NeurIPS 2025 Neither Valid nor Reliable? Investigating the Use of LLMs as Judges Khaoula Chehbouni, Mohammed Haddou, Jackie CK Cheung, Golnoosh Farnadi