Distinguished Engineer - AI

inference engines (TensorRT, vLLM, ONNX Runtime, Triton), model optimization (quantization, pruning, distillation), and serving...

Lugar: San Jose, CA | 05/03/2026 02:03:02 AM | Salario: S/. No Especificado | Empresa: NetApp