Guo, Tianyu
30 publications
NeurIPS
2025
DynaPipe: Dynamic Layer Redistribution for Efficient Serving of LLMs with Pipeline Parallelism
NeurIPS
2025
Generalization or Hallucination? Understanding Out-of-Context Reasoning in Transformers
NeurIPSW
2024
Active-Dormant Attention Heads: Mechanistically Demystifying Extreme-Token Phenomena in LLMs
AAAI
2024
Improving Panoptic Narrative Grounding by Harnessing Semantic Relationships and Visual Confirmation
NeurIPSW
2023
How Do Transformers Learn In-Context Beyond Simple Functions? a Case Study on Learning with Representations