GPU Software Engineer (CUDA)

NCCL, RDMA, and high-performance networking. Implement custom operators and fused kernels in PyTorch, JAX, or Triton... Experience with Triton, CUTLASS, or other GPU kernel authoring frameworks. Familiarity with TensorRT, FasterTransformer, or vLLM...

Lugar: Bartlett, IL | 25/06/2026 01:06:59 AM | Salario: S/. $100000 - 150000 per year | Empresa: Bright Vision Technologies

Associate Director Quality Assurance

standards. This role is accountable for Quality oversight across the manufacturing process of formulation, filling, triton...

Lugar: El Paso, TX | 24/06/2026 23:06:11 PM | Salario: S/. No Especificado | Empresa: BD

Forward Deployed Architect

in one or more of: NVIDIA Stack (CUDA, NeMo, Triton, TensorRT, NIM, DGX Cloud, and the broader DSX software portfolio), Inference Systems (large...

Lugar: Santa Clara, CA | 24/06/2026 22:06:51 PM | Salario: S/. No Especificado | Empresa: Nvidia

AI Performance Engineer

, and speculative decoding for LLM serving. Drive compiler-level optimizations using Triton, XLA, TorchInductor, or TVM, working..., TensorRT-LLM, DeepSpeed, or similar projects. Familiarity with custom kernel authoring in Triton or CUTLASS. Experience...

Lugar: Newark, CA | 24/06/2026 21:06:41 PM | Salario: S/. No Especificado | Empresa: Bright Vision Technologies

AI Performance Engineer

, and speculative decoding for LLM serving. Drive compiler-level optimizations using Triton, XLA, TorchInductor, or TVM, working..., TensorRT-LLM, DeepSpeed, or similar projects. Familiarity with custom kernel authoring in Triton or CUTLASS. Experience...

Lugar: Bartlett, IL | 24/06/2026 21:06:25 PM | Salario: S/. $100000 - 150000 per year | Empresa: Bright Vision Technologies

MLOps Engineer - Fully Remote

. Experience writing or optimizing custom GPU kernels using Pallas or Triton. Demonstrable career progression. Ability to engage...

Lugar: Nueva York, NY | 24/06/2026 17:06:18 PM | Salario: S/. No Especificado | Empresa: Mercor

ML Kernel Performance Engineer, Edge AI and Science

Performance Engineer to work at the hardware-software boundary of this platform, crafting high-performance CUDA and Triton kernels... job responsibilities Design and implement high-performance CUDA and Triton kernels for quantization-aware training, sparse matrix...

Lugar: Sunnyvale, CA | 24/06/2026 17:06:24 PM | Salario: S/. No Especificado | Empresa: Amazon