System Software Engineer, LLM Inference

, and build automated performance regression suites. ▸ Implement prefix caching, KV quantization (FP8/INT4), and speculative KV... understanding of KV‑cache management algorithms: PagedAttention, prefix caching, chunked prefill, speculative decoding, and grouped...

Lugar: California | 25/03/2026 01:03:07 AM | Salario: S/. No Especificado | Empresa: International Recruiting LLC