ML Ops Engineer

across training and inference workloads. Configure and manage NVIDIA Triton Inference Server for multi-model serving, dynamic... planning, and failure recovery. Practical MLOps experience: model serving infrastructure (Triton or equivalent), experiment...

Lugar: Seattle, WA | 26/03/2026 18:03:30 PM | Salario: S/. No Especificado | Empresa: International Recruiting LLC

SR Principal Software Engineer - LLM Engineering

servers such as Triton Inference Server and vLLM for high-throughput, low-latency serving at scale. Oversees production... in large-scale environments typical of major tech firms. Hands-on experience building LLM inference engines using Triton...

Lugar: Palo Alto, CA | 26/03/2026 18:03:32 PM | Salario: S/. No Especificado | Empresa: JPMorgan Chase

Developer Relations Manager – AI Natives

AI workloads using NVIDIA technologies including CUDA-X libraries, TensorRT-LLM, Triton Inference Server, NVIDIA NeMo, NIM... such as TensorRT-LLM, Triton Inference Server, CUDA, RAPIDS, or similar GPU acceleration technologies. Experience building or scaling...

Lugar: Santa Clara, CA | 25/03/2026 21:03:40 PM | Salario: S/. No Especificado | Empresa: Nvidia

SR Principal Software Engineer - LLM Engineering

efficiency. Leads deployment and optimization using Model Inference servers such as Triton Inference Server and vLLM for high... environments typical of major tech firms. Hands-on experience building LLM inference engines using Triton Inference Server...

Lugar: Palo Alto, CA | 25/03/2026 21:03:22 PM | Salario: S/. No Especificado | Empresa: JPMorgan Chase

Senior Researcher - Efficient AI

) and inference serving frameworks (e.g., vLLM, Triton Inference Server, TensorRT-LLM, ONNX Runtime, Ray Serve, DeepSpeed-MII). 3...+ years of experience in GPU programming and optimization, with expert knowledge of CUDA, ROCm, Triton, PTX, CUTLASS...

Lugar: Redmond, WA | 25/03/2026 21:03:14 PM | Salario: S/. No Especificado | Empresa: Microsoft

Applied AI ML Director

. Foundational understanding of NVIDIA GPU Infrastructure software (e.g., NVIDIA DCGM, BCM, Triton Inference), Kubernetes and Cloud...

Lugar: Jersey City, NJ | 25/03/2026 18:03:01 PM | Salario: S/. No Especificado | Empresa: JPMorgan Chase