ML Anthology
Authors
Search
About
Kumar, Anurakt
2 publications
NeurIPSW
2024
Efficacy of the SAGE-RT Dataset for Model Safety Alignment: A Comparative Study
Tanay Baswa
,
Nitin Aravind Birur
,
Divyanshu Kumar
,
Jatan Loya
,
Anurakt Kumar
,
Prashanth Harshangi
,
Sahil Agarwal
NeurIPSW
2024
SAGE-RT: Synthetic Alignment Data Generation for Safety Evaluation and Red Teaming
Anurakt Kumar
,
Divyanshu Kumar
,
Jatan Loya
,
Nitin Aravind Birur
,
Tanay Baswa
,
Sahil Agarwal
,
Prashanth Harshangi