Hausdörfer, Oliver

1 publications

NeurIPSW 2024 Communication Compression for Tensor Parallel LLM Inference Jan Hansen-Palmus, Michael Truong Le, Oliver Hausdörfer, Alok Verma