Empleos de Triton - Trabajos de Triton

inference engines (TensorRT, vLLM, ONNX Runtime, Triton), model optimization (quantization, pruning, distillation), and serving...

Lugar: San Jose, CA | 05/03/2026 02:03:02 AM | Salario: S/. No Especificado | Empresa: NetApp

, C++ or other relevant coding languages. Expertise in ML inference, model serving frameworks (triton, rayserve, vLLM...

Lugar: Sunnyvale, CA | 05/03/2026 01:03:32 AM | Salario: S/. No Especificado | Empresa: General Motors

innovative solutions Model serving/inference (e.g., ONNX Runtime, vLLM, Triton, quantization, distillation, caching, dynamic...

Lugar: Redmond, WA | 04/03/2026 22:03:05 PM | Salario: S/. No Especificado | Empresa: Microsoft

-level programming to maximize performance for AI operations, leveraging tools like Compute Kernel (CK), CUTLASS, and Triton...

Lugar: Santa Clara, CA | 04/03/2026 20:03:37 PM | Salario: S/. No Especificado | Empresa: Advanced Micro Devices

inference scaling across multi-node clusters using Ray Serve and Triton Experience in leading technical projects and supporting...

Lugar: Mountain View, CA | 04/03/2026 20:03:22 PM | Salario: S/. No Especificado | Empresa: Microsoft

and other academic institutions, as well as in industry and governmental organizations. LIME maintains a Thermo Scientific Triton...

Lugar: Centre County, PA | 04/03/2026 18:03:25 PM | Salario: S/. No Especificado | Empresa: Pennsylvania State University

in software/hardware co-design Deep understanding of GPU and/or other AI accelerators Experience with CUDA, Triton...

Lugar: San Francisco, CA | 04/03/2026 00:03:10 AM | Salario: S/. No Especificado | Empresa: OpenAI

, autoscaling, service mesh, GPU operators) and LLM serving engines (e.g., vLLM, TensorRT-LLM, Triton, KServe/Seldon, Ray Serve...

Lugar: San Diego, CA | 02/03/2026 03:03:26 AM | Salario: S/. No Especificado | Empresa: Qualcomm

with MLIR, MLIR Dialects (LinAlg, Affine), Pytorch 2.0, TVM, Triton, and/or LLVM Bachelor's degree in Computer Science...

Lugar: San Diego, CA | 02/03/2026 01:03:29 AM | Salario: S/. No Especificado | Empresa: Qualcomm

. in Triton, with a focus on generating efficient, low-level code. Experience in workload mapping strategies exhibiting sharding...

Lugar: San Diego, CA | 02/03/2026 00:03:59 AM | Salario: S/. No Especificado | Empresa: Qualcomm