of LLM Serving packages (e.g. vLLM, SGLang, TGI, Triton-Inference server, Dynamo, LLM-d). Work closely with customers... (PyTorch, CUDA, Triton). Knowledge of torch.compile or torchDynamo PhD in Computer Science, Computer Engineering or Machine...
's or PhD in CS/EE/Math or related field. Training & Inference stacks: PyTorch, CUDA/cuDNN, Triton Inference Server, vLLM...
with one of NVIDIA Dynamo, Triton Inference Server, or TensorRT-LLM for model optimization and serving. GPU orchestration using NVIDIA...
with relevant libraries, compilers, and languages - CUDNN, CUBLAS, CUTLASS, MLIR, Triton, CUDA, OpenCL Experience with the...
Lugar:
Santa Clara, CA | 01/03/2026 00:03:00 AM | Salario: S/. $124000 - 195500 per year | Empresa:
Nvidia Certified Professional (OCP) Knowledge of Nvidia Triton servers for LLMs Familiarity with CI/CD DevOps practices and tools...
., NVIDIA DCGM, Triton Inference). Familiarity with big data processing tools and cloud data services. Advanced knowledge...
at scale Experience with one or more of: HIP, CUDA, OpenCL, Triton/Gluon, CUTLASS, CK Experience with GPU compiler toolchains... (e.g., LLVM) and intermediate representations (e.g., MLIR, LLVM IR, Triton IR) is a plus Hands-on experience contributing...
, CUDA, Triton, TensorRT, Nvidia Dynamo, and Python. Good understanding of generative model architectures, including...
Lugar:
San Jose, CA | 27/02/2026 02:02:18 AM | Salario: S/. No Especificado | Empresa:
Adobe: Familiarity with ClearML, Triton, PyTorch, and TensorFlow Familiarity with statistics and healthcare domain Proven expertise...
Lugar:
Lehi, UT | 27/02/2026 00:02:54 AM | Salario: S/. No Especificado | Empresa:
Waystar and improving high-performance kernel implementations in CUDA, TRT-LLM, and Triton. This is an exceptional opportunity... in frameworks such as CUDA, CUTLASS, or Triton. Increasingly known as “the AI computing company” and widely considered...