Huang, Furong
133 publications
NeurIPS
2025
A Technical Report on “Erasing the Invisible”: The 2024 NeurIPS Competition on Stress Testing Image Watermarks
ICLRW
2025
AdvBDGen: A Robust Framework for Generating Adaptive and Stealthy Backdoors in LLM Alignment Attacks
AAAI
2025
Can Watermarking Large Language Models Prevent Copyrighted Text Generation and Hide Training Data?
ICCV
2025
GenFlowRL: Shaping Rewards with Generative Object-Centric Flow in Visual Reinforcement Learning
CVPR
2025
Immune: Improving Safety Against Jailbreaks in Multi-Modal LLMs via Inference-Time Alignment
NeurIPS
2025
SoTA with Less: MCTS-Guided Sample Selection for Data-Efficient Visual Reasoning Self-Improvement
ICLR
2025
TraceVLA: Visual Trace Prompting Enhances Spatial-Temporal Awareness for Generalist Robotic Policies
ICML
2024
A Linear Time and Space Local Point Cloud Geometry Encoder via Vectorized Kernel Mixture (VecKM)
NeurIPSW
2024
AdvBDGen: Adversarially Fortified Prompt-Specific Fuzzy Backdoor Generator Against LLM Alignment
ICMLW
2024
Automatic Pseudo-Harmful Prompt Generation for Evaluating False Refusals in Large Language Models
NeurIPS
2024
Boosting Sample Efficiency and Generalization in Multi-Agent Reinforcement Learning via Equivariance
ICLR
2024
COPlanner: Plan to Roll Out Conservatively but to Explore Optimistically for Model-Based RL
ICMLW
2024
Can Watermarking Large Language Models Prevent Copyrighted Text Generation and Hide Training Data?
NeurIPSW
2024
Can Watermarking Large Language Models Prevent Copyrighted Text Generation and Hide Training Data?
NeurIPS
2024
Easy2Hard-Bench: Standardized Difficulty Labels for Profiling LLM Performance and Generalization
NeurIPSW
2024
LSH-E Tells You What to Discard: An Adaptive Locality-Sensitive Strategy for KV Cache Compression
NeurIPS
2024
Make-an-Agent: A Generalizable Policy Network Generator with Behavior-Prompted Diffusion
ICMLW
2024
MaxMin-RLHF: Towards Equitable Alignment of Large Language Models with Diverse Human Preferences
ICLR
2024
PARL: A Unified Framework for Policy Alignment in Reinforcement Learning from Human Feedback
ICML
2024
PRISE: LLM-Style Sequence Compression for Learning Temporal Action Abstractions in Control
NeurIPSW
2024
PoisonedParrot: Subtle Data Poisoning Attacks to Elicit Copyright-Infringing Content from Large Language Models
ICLR
2024
Rethinking Adversarial Policies: A Generalized Attack Formulation and Provable Defense in RL
NeurIPS
2023
$\texttt{TACO}$: Temporal Latent Action-Driven Contrastive Loss for Visual Reinforcement Learning
NeurIPS
2023
C-Disentanglement: Discovering Causally-Independent Generative Factors Under an Inductive Bias of Confounder
NeurIPSW
2023
COPlanner: Plan to Roll Out Conservatively but to Explore Optimistically for Model-Based RL
ICMLW
2023
Equal Long-Term Benefit Rate: Adapting Static Fairness Notions to Sequential Decision Making
ICML
2023
Learning Unforeseen Robustness from Out-of-Distribution Data Using Equivariant Domain Translator
ICLRW
2023
Learning Unforeseen Robustness from Out-of-Distribution Data Using Equivariant Domain Translator
NeurIPSW
2023
RealFM: A Realistic Mechanism to Incentivize Data Contribution and Device Participation
NeurIPS
2022
Adversarial Auto-Augment with Label Preservation: A Representation Learning Principle Guided Approach
NeurIPSW
2022
Controllable Attack and Improved Adversarial Training in Multi-Agent Reinforcement Learning
NeurIPSW
2022
DP-InstaHide: Data Augmentations Provably Enhance Guarantees Against Dataset Manipulations
NeurIPS
2022
Efficient Adversarial Training Without Attacking: Worst-Case-Aware Robust Reinforcement Learning
NeurIPS
2022
End-to-End Algorithm Synthesis with Recurrent Networks: Extrapolation Without Overthinking
NeurIPSW
2022
Is Model Ensemble Necessary? Model-Based RL via a Single Model with Lipschitz Regularized Value Function
NeurIPSW
2022
Posterior Coreset Construction with Kernelized Stein Discrepancy for Model-Based Reinforcement Learning
ICLR
2022
Reinforcement Learning Under a Multi-Agent Predictive State Representation Model: Method and Theory
NeurIPS
2021
Can You Learn an Algorithm? Generalizing from Easy to Hard Problems with Recurrent Networks
NeurIPS
2021
VQ-GNN: A Universal Framework to Scale up Graph Neural Networks Using Vector Quantization
ICML
2020
An End-to-End Differentially Private Latent Dirichlet Allocation Using a Spectral Algorithm