System Software Engineer, LLM Inference

, and build automated performance regression suites. ▸ Implement prefix caching, KV quantization (FP8/INT4), and speculative KV... understanding of KV‑cache management algorithms: PagedAttention, prefix caching, chunked prefill, speculative decoding, and grouped...

Lugar: California | 24/03/2026 19:03:45 PM | Salario: S/. No Especificado | Empresa: International Recruiting LLC