Staff Software Engineer - AI/ML Platform
with inference optimization using vLLM, TensorRT-LLM, Triton Inference Server, or similar DevOps & Platform Skills Advanced...
with inference optimization using vLLM, TensorRT-LLM, Triton Inference Server, or similar DevOps & Platform Skills Advanced...
, diverse individuals in support of the MQ-4C Triton program test activities. This position will be located at NAS (Naval Air... Station) Patuxent River, MD. The Triton – MQ-4C ITT (Integrated & Test Team) is a US Navy & Northrop Grumman combined test...
Previous Unmanned Air Vehicle system test or maintenance experience is highly desired. MQ-4 Vehicle Test Controller or Triton...
/CuTeDSL/cutlass/Triton). Prior contributions to major LLM inference frameworks (e.g. vLLM) or prior experience with graph...
GenAI models Experience with low-level GPU programming (CUDA, Triton, NCCL) and frameworks such as PyTorch or JAX...
performance for AI operations, leveraging tools like Compute Kernel (CK), CUTLASS, and Triton for multi-GPU and multi-platform...
is a plus Hands-on experience with Kubernetes, Docker, and ML-Ops platforms (e.g., MLflow, KServe, Triton) Familiarity with CUDA...
-performance inference framework (e.g. Triton and TensorRT) Deep understanding of Diffusion Architecture Experience profiling...
, Triton, TVM, or similar systems Exposure to: Neural networks Tree-based models (e.g., LightGBM) State space...
(Microsoft Access, SQL, Python, Power BI, etc.) Experience with actuarial software (ALFA, Triton, and/or CASE preferred). Self...