Yuan, Zheng
11 publications
ICLR
2024
#InsTag: Instruction Tagging for Analyzing Supervised Fine-Tuning of Large Language Models
ACML
2024
PISDR: Page and Item Sequential Decision for Re-Ranking Based on Offline Reinforcement Learning
11 publications