ML Anthology
Authors
Search
About
Yang, Wannan
4 publications
ICLR
2026
Hallucination Reduction with CASAL: Contrastive Activation Steering for Amortized Learning
Wannan Yang
,
Xinchi Qiu
,
Lei Yu
,
Yuchen Zhang
,
Aobo Yang
,
Narine Kokhlikyan
,
Nicola Cancedda
,
Diego Garcia-Olano
NeurIPSW
2024
Interpretability of LLM Deception: Universal Motif
Wannan Yang
,
Gyorgy Buzsaki
NeurIPSW
2023
Changes in the Geometry of Hippocampal Representations Across Brain States
Wannan Yang
,
Chen Sun
,
Gyorgy Buzsaki
NeurIPS
2023
Contrastive Retrospection: Honing in on Critical Steps for Rapid Learning and Generalization in RL
Chen Sun
,
Wannan Yang
,
Thomas Jiralerspong
,
Dane Malenfant
,
Benjamin Alsbury-Nealy
,
Yoshua Bengio
,
Blake Richards