ML Anthology
Authors
Search
About
Song, Xingyi
2 publications
NeurIPS
2024
Confidence Regulation Neurons in Language Models
Alessandro Stolfo
,
Ben Wu
,
Wes Gurnee
,
Yonatan Belinkov
,
Xingyi Song
,
Mrinmaya Sachan
,
Neel Nanda
ICMLW
2024
Confidence Regulation Neurons in Language Models
Alessandro Stolfo
,
Ben Peng Wu
,
Wes Gurnee
,
Yonatan Belinkov
,
Xingyi Song
,
Mrinmaya Sachan
,
Neel Nanda