(Triton, cuTe) Solid grasp of AI models, parallelisms, and/or compiler technologies (e.g. torch.compile) Experience...: Training, Distributed inference, MoE, Reinforcement Learning, kernel authoring (on CUDA, Triton, cuTe, etc). Experience...
, SYCL) Experience with model deployment software libraries and stacks (e.g. NVIDIA TensorRT, Triton Inference Server...
will be responsible for operations and maintenance support of the MQ-4C Triton sensors and communications systems at a Main Operating Base...
and improving high-performance kernel implementations in CUDA, TRT-LLM, and Triton. This is an exceptional opportunity... in frameworks such as CUDA, CUTLASS, or Triton. Increasingly known as “the AI computing company” and widely considered...
generation and processing codes, e.g., APOLLO, CASMO, HELIOS, TRITON (NEWT), Polaris, GenPMAXS Experience with 3D neutronics...
) and generative AI workloads. Enhance performance tuning using TensorRT/TensorRT-LLM, vLLM, Dynamo, and Triton Inference Server... Knowledge of Inference technologies - NVIDIA NIM, TensorRT-LLM, Dynamo, Triton Inference Server, vLLM, etc Proficiency...
Lugar:
Redmond, WA | 18/01/2026 03:01:46 AM | Salario: S/. $124000 - 195500 per year | Empresa:
Nvidia knowledge of CUDA, ROCm, Triton, PTX, CUTLASS, or similar GPU programming frameworks. Experience with machine learning...
for clean extensibility. Experience writing Triton, CUDA, or similar, and an understanding of the resulting mapping of tensor...
profiling (Nsight Systems), Triton Inference Server. LLM Operations (Differentiator): Significant experience with advanced...
Manager, AdsWizz, Triton). Deep familiarity with AWS, Kubernetes, and infrastructure automation. Proficiency...