Aminian, Gholamali
12 publications
NeurIPS
2025
KL-Regularized RLHF with Multiple Reference Models: Exact Solutions and Sample Complexity
AISTATS
2023
How Does Pseudo-Labeling Affect the Generalization Error of the Semi-Supervised Gibbs Algorithm?
12 publications