NCCL, RDMA, and high-performance networking. Implement custom operators and fused kernels in PyTorch, JAX, or Triton... Experience with Triton, CUTLASS, or other GPU kernel authoring frameworks. Familiarity with TensorRT, FasterTransformer, or vLLM...
with large-scale data quality pipelines, distributed vector databases, or specialized AI inference engines (e.g., Triton, Ray...
NCCL, RDMA, and high-performance networking. Implement custom operators and fused kernels in PyTorch, JAX, or Triton... Experience with Triton, CUTLASS, or other GPU kernel authoring frameworks. Familiarity with TensorRT, FasterTransformer, or vLLM...
meetings Proficiency in MATLAB and R programming languages, including MATLAB-based acoustic analysis software Triton Current...
optimization, continuous batching, and speculative decoding for LLM serving. Drive compiler-level optimizations using Triton, XLA... in Triton or CUTLASS. Experience with FinOps for AI workloads. Publications or talks on AI systems performance...
-level optimizations using Triton, XLA, TorchInductor, or TVM, working with the broader ML framework community to land.... Familiarity with custom kernel authoring in Triton or CUTLASS. Experience with FinOps for AI workloads. Publications or talks...
at scientific meetings Proficiency in MATLAB and R programming languages, including MATLAB-based acoustic analysis software Triton...
-level optimizations using Triton, XLA, Torch Inductor, or TVM, working with the broader ML framework community to land.... Familiarity with custom kernel authoring in Triton or CUTLASS. Experience with FinOps for AI workloads. Publications or talks...
analysis effort within the Sustainment Systems Engineering team on the Triton Enterprise. You'll be expected to maintain...
within the Sustainment Systems Engineering team on the Triton Enterprise. You'll be expected to maintain, develop, and routinely...