Zhao, Hai
42 publications
AAAI
2025
SCANS: Mitigating the Exaggerated Safety for LLMs via Safety-Conscious Activation Steering
NeurIPS
2025
SmallKV: Small Model Assisted Compensation of KV Cache Compression for Efficient LLM Inference
NeurIPS
2025
Wide-Horizon Thinking and Simulation-Based Evaluation for Real-World LLM Planning with Multifaceted Constraints
NeurIPS
2024
Reference Trustable Decoding: A Training-Free Augmentation Paradigm for Large Language Models