Wu, Zekun

5 publications

NeurIPSW 2024 From Text to Emoji: How PEFT-Driven Personality Manipulation Unleashes the Emoji Potential in LLMs Navya Jain, Zekun Wu, Cristian Enrique Munoz Villalobos, Airlie Hilliard, Adriano Koshiyama, Emre Kazim, Philip Colin Treleaven
NeurIPSW 2024 HEARTS: A Holistic Framework for Explainable, Sustainable and Robust Text Stereotype Detection Theo King, Zekun Wu, Adriano Koshiyama, Emre Kazim, Philip Colin Treleaven
NeurIPSW 2024 HEARTS: A Holistic Framework for Explainable, Sustainable and Robust Text Stereotype Detection Theo King, Zekun Wu, Adriano Koshiyama, Emre Kazim, Philip Colin Treleaven
NeurIPSW 2024 THaMES: An End-to-End Tool for Hallucination Mitigation and Evaluation in Large Language Models Mengfei Liang, Archish Arun, Zekun Wu, Cristian Enrique Munoz Villalobos, Jonathan Lutch, Emre Kazim, Adriano Koshiyama, Philip Colin Treleaven
NeurIPSW 2023 Towards Auditing Large Language Models: Improving Text-Based Stereotype Detection Zekun Wu, Sahan Bulathwela, Adriano Koshiyama