Guoyu Chen, Srinivasan Subramaniyan, Xiaorui Wang: Latency-Guaranteed Co-Location of Inference and Training for Reducing Data Center Expenses. ICDCS 2024: 473-484