"De-Quantization Penalties for Interactive LLM Inference on Prosumer GPUs."

Junaid Ahmad Khan (2026)

Details and statistics

DOI: 10.1109/LCA.2026.3659512

access: closed

type: Journal Article

metadata version: 2026-03-09