TransmissionOptimizing LLM Inference at ScaleJanuary 15, 20258 min readBy Alex ChenLLMInfrastructureOptimization