Local LLM Inference - Offline Scenario 24/05/2026 Testing local LLM inference speed on consumer hardware — prefill, decode, and KV cache impact across three devices and two models. Read more