Senior Software Development Engineer – LLM Inference Framework
/ FP4 GEMM and FlashAttention / MLA Collaborate with compiler teams (Triton, LLVM, ROCm) to unblock framework-level...
/ FP4 GEMM and FlashAttention / MLA Collaborate with compiler teams (Triton, LLVM, ROCm) to unblock framework-level...
, and Triton Develop and optimize system components for tensor/data parallelism and disaggregated serving Implement and optimize...
Ground Segment team of qualified, diverse individuals in support of the MQ-4C Triton program test activities. This position... will be located at NAS (Naval Air Station) Patuxent River, MD. The Triton – MQ-4C ITT (Integrated & Test Team) is a US Navy...
-on experience with inference engines – vLLM, TensorRT-LLM, Triton, SGLang, llama.cpp and GPU profiling (Nsight, PyTorch profiler...
with NVIDIA’s framework, compiler (TensorRT, NVCC, Triton), and GPU architecture teams to improve kernel fusion, graph execution...: Validate the full NVIDIA AI software stack on DGX Station: CUDA toolkit, cuDNN, TensorRT, NCCL, Triton Inference Server, DCGM...
within our Software organization supporting MQ-4C Triton. This role is located in San Diego, CA. In this role you will be working...
debugging, performance evaluation, and testing. Experience with CUDA, OpenCL, HIP, SYCL, Mojo, Pallas, Triton, Mosaic, Halide...
, specifically JAX, PyTorch, and kernel-level programming (Pallas/Triton). Key Responsibilities Guide research and engineering teams... custom GPU kernels using Pallas (JAX) or Triton. Demonstrable career progression. Ability to engage reliably for at least 30...
, specifically JAX, PyTorch, and kernel-level programming (Pallas/Triton). Key Responsibilities Guide research and engineering teams... custom GPU kernels using Pallas (JAX) or Triton. Demonstrable career progression. Ability to engage reliably for at least 30...
, specifically JAX, PyTorch, and kernel-level programming (Pallas/Triton). Key Responsibilities Guide research and engineering teams... custom GPU kernels using Pallas (JAX) or Triton. Demonstrable career progression. Ability to engage reliably for at least 30...