Huang, Fei
82 publications
NeurIPS
2025
Gated Attention for Large Language Models: Non-Linearity, Sparsity, and Attention-Sink-Free
NeurIPS
2025
Look Before You Leap: A GUI-Critic-R1 Model for Pre-Operative Error Diagnosis in GUI Automation
NeurIPS
2025
Sampling-Efficient Test-Time Scaling: Self-Estimating the Best-of-N Sampling in Early Decoding
ICML
2025
Towards Efficient Online Tuning of VLM Agents via Counterfactual Soft Reinforcement Learning
NeurIPS
2025
VLM-R³: Region Recognition, Reasoning, and Refinement for Enhanced Multimodal Chain-of-Thought
ICLR
2025
mPLUG-Owl3: Towards Long Image-Sequence Understanding in Multi-Modal Large Language Models
ICML
2024
Language Models Are Super Mario: Absorbing Abilities from Homologous Models as a Free Lunch
NeurIPS
2024
MaVEn: An Effective Multi-Granularity Hybrid Visual Encoding Framework for Multimodal Large Language Model
NeurIPS
2024
Mobile-Agent-V2: Mobile Device Operation Assistant with Effective Navigation via Multi-Agent Collaboration
CVPR
2024
OmniParser: A Unified Framework for Text Spotting Key Information Extraction and Table Recognition
NeurIPS
2024
WISE: Rethinking the Knowledge Memory for Lifelong Model Editing of Large Language Models
NeurIPSW
2024
WISE: Rethinking the Knowledge Memory for Lifelong Model Editing of Large Language Models
ICCV
2023
BUS: Efficient and Effective Vision-Language Pre-Training with Bottom-up Patch Summarization.
NeurIPS
2023
Can LLM Already Serve as a Database Interface? a BIg Bench for Large-Scale Database Grounded Text-to-SQLs
NeurIPS
2023
EMMA-X: An EM-like Multilingual Pre-Training Algorithm for Cross-Lingual Representation Learning
AAAI
2023
Graphix-T5: Mixing Pre-Trained Transformers with Graph-Aware Layers for Text-to-SQL Parsing
NeurIPS
2023
SpokenWOZ: A Large-Scale Speech-Text Benchmark for Spoken Task-Oriented Dialogue Agents
AAAI
2022
From Dense to Sparse: Contrastive Pruning for Better Pre-Trained Language Model Compression
IJCAI
2022
Meta-Learning Based Knowledge Extrapolation for Knowledge Graphs in the Federated Setting
AAAI
2021
Bridging the Domain Gap: Improve Informal Language Translation via Counterfactual Domain Adaptation
AAAI
2021
Dynamic Hybrid Relation Exploration Network for Cross-Domain Context-Dependent Semantic Parsing