ML Anthology
Authors
Search
About
Petrova, Nora
3 publications
ICLR
2026
Unpacking Human Preference for LLMs: Demographically Aware Evaluation with the HUMAINE Framework
Nora Petrova
,
Andrew Gordon
,
Enzo Blindow
ICLRW
2025
Latent Adversarial Training Improves the Representation of Refusal
Alexandra Abbas
,
Nora Petrova
,
Hélios Lyons
,
Natalia Perez-Campanero
NeurIPSW
2024
Characterizing Stable Regions in the Residual Stream of LLMs
Jett Janiak
,
Jacek Karwowski
,
Chatrik Singh Mangat
,
Giorgi Giglemiani
,
Nora Petrova
,
Stefan Heimersheim