the design of distributed ML serving systems optimized for generative AI workloads Drive technical excellence..., and Triton Develop and optimize system components for tensor/data parallelism and disaggregated serving Implement and optimize...
and model performance for real-world use cases. Provide technical leadership and guidance in AI software best practices... AI system performance using tools such as NVIDIA Nsight, TensorRT, Triton Inference Server, or similar profiling...
. We are looking for a deeply technical systems software engineer who will own AI stack readiness on DGX Station. You will profile workloads... with NVIDIA’s framework, compiler (TensorRT, NVCC, Triton), and GPU architecture teams to improve kernel fusion, graph execution...
technical feedback. Develop guidelines and detailed rubrics/evaluation frameworks to assess training pipeline design... and/or PyTorch at scale. Experience writing or optimizing custom GPU kernels using Pallas (JAX) or Triton. Demonstrable career...
technical feedback. Develop guidelines and detailed rubrics/evaluation frameworks to assess training pipeline design... and/or PyTorch at scale. Experience writing or optimizing custom GPU kernels using Pallas (JAX) or Triton. Demonstrable career...
technical feedback. Develop guidelines and detailed rubrics/evaluation frameworks to assess training pipeline design... and/or PyTorch at scale. Experience writing or optimizing custom GPU kernels using Pallas (JAX) or Triton. Demonstrable career...
technical feedback. Develop guidelines and detailed rubrics/evaluation frameworks to assess training pipeline design... and/or PyTorch at scale. Experience writing or optimizing custom GPU kernels using Pallas (JAX) or Triton. Demonstrable career...
Lugar:
Phoenix, AZ | 02/06/2026 20:06:09 PM | Salario: S/. $140 per hour | Empresa:
SaidGigtechnical feedback. Develop guidelines and detailed rubrics/evaluation frameworks to assess training pipeline design... and/or PyTorch at scale. Experience writing or optimizing custom GPU kernels using Pallas (JAX) or Triton. Demonstrable career...
Lugar:
Houston, TX | 02/06/2026 20:06:02 PM | Salario: S/. $140 per hour | Empresa:
SaidGigtechnical feedback. Develop guidelines and detailed rubrics/evaluation frameworks to assess training pipeline design... and/or PyTorch at scale. Experience writing or optimizing custom GPU kernels using Pallas (JAX) or Triton. Demonstrable career...
technical feedback. Develop guidelines and detailed rubrics/evaluation frameworks to assess training pipeline design... and/or PyTorch at scale. Experience writing or optimizing custom GPU kernels using Pallas (JAX) or Triton. Demonstrable career...