Aminian, Gholamali
11 publications
NeurIPS
2025
KL-Regularized RLHF with Multiple Reference Models: Exact Solutions and Sample Complexity
AISTATS
2023
How Does Pseudo-Labeling Affect the Generalization Error of the Semi-Supervised Gibbs Algorithm?
11 publications