Senior Compiler Engineer, AI Inference Platforms
, Triton, etc.). Excellent C/C++ and Python programming and software design skills, including debugging, performance analysis...
, Triton, etc.). Excellent C/C++ and Python programming and software design skills, including debugging, performance analysis...
in production (Ray Serve, vLLM, Ollama, TorchServe, Triton, etc.) Understanding of RAG (Retrieval-Augmented Generation...
, SGLang, PyTorch, Triton, NCCL, or related GPU/inference infrastructure;prior maintainer experience is a plus. Shipped...
Previous Unmanned Air Vehicle system test or maintenance experience is highly desired. MQ-4 Vehicle Test Controller or Triton...
Previous Unmanned Air Vehicle system test or maintenance experience is highly desired. MQ-4 Vehicle Test Controller or Triton...
in Fairfax, VA. As a Senior Software Engineer on the Triton team, you will have the following responsibilities: Develop...
, CUDA, Triton, TensorRT, Nvidia Dynamo, and Python. Good understanding of generative model architectures, including...
Proficiency with core technologies such as: Python, PyTorch, TensorFlow NVIDIA Triton, TorchServe, ONNX, AIT, AOT, CUDA...
with GPU scheduling, Ray, Kubeflow, MLflow, or Triton Inference Server. Experience with storage backends (Ceph, vSAN, ZFS...
with low-level performance optimization, such as custom CUDA kernel development or using Triton Language, is a plus. Experience...