Sr Staff Machine Learning Platform Engineer (Prisma AIRS)
plus. Experience with low-level performance optimization, such as custom CUDA kernel development or using Triton Language...
plus. Experience with low-level performance optimization, such as custom CUDA kernel development or using Triton Language...
and sophisticated AI platforms (CUDA, Triton, NeMo, DOCA, etc.). A track record of building systems for real-time processing and low...
Position Overview Full Time- Triton College - River Grove, IL About Us: At Athletico, we believe in the power...
Looking For Deep, hands-on experience owning production ML infrastructure: inference serving, model optimization (e.g., vLLM, Triton...
of inference servers (vLLM, Triton) using KServe, KubeRay, or Knative to ensure serverless-style scaling for AI workloads...., vLLM, ONNX, TorchServe, Triton). Familiarity with quantization techniques (AWQ, GPTQ) to optimize model size/speed...
, etc.) in a production environment Experience with one or more inference serving frameworks (e.g., NVIDIA Triton, TorchServe, BentoML...
large‑scale GPU workloads on Kubernetes using tools like Ray and Kubeflow. Knowledge of CUDA programming, Triton kernels...
and libraries (e.g., Pytorch, CUDA, vLLM, Triton, NCCL, oneCCL, oneAPI) Experience with GPU performance profiling (e.g., Unitrace...
of AI workloads across customer production environments, spanning AI frameworks (PyTorch, Triton, Megatron-LM, vLLM), ROCm runtime...
Jr Testing & Service Technician 1 Company Summary Triton combines 30+ years of experience with exciting growth...-impact results on time, on budget and have fun while doing it. Triton's Ocean Systems is a fast-paced, interdisciplinary...