Zhang, Kexun
13 publications
ICLR
2025
Generalization V.s. Memorization: Tracing Language Models’ Capabilities Back to Pretraining Data
ICLR
2025
SWE-Search: Enhancing Software Agents with Monte Carlo Tree Search and Iterative Refinement
ICLRW
2024
DeFT: Flash Tree-Attention with IO-Awareness for Efficient Tree-Search-Based LLM Inference