Ren, Jie
52 publications
AISTATS
2025
Superiority of Multi-Head Attention: A Theoretical Study in Shallow Transformers in In-Context Linear Regression
NeurIPSW
2024
Construction and Application of Materials Knowledge Graph in Multidisciplinary Materials Science via Large Language Model
NeurIPSW
2024
FlashDP: Memory-Efficient and High-Throughput DP-SGD Training for Large Language Models
ECCV
2024
Unveiling and Mitigating Memorization in Text-to-Image Diffusion Models Through Cross Attention
ICML
2023
A Simple Zero-Shot Prompt Weighting Technique to Improve Prompt Ensembling in Text-Image Models
NeurIPSW
2022
Improving the Robustness of Conditional Language Models by Detecting and Removing Input Noise
NeurIPSW
2022
Out-of-Distribution Detection and Selective Generation for Conditional Language Models