AI models. This role offers the unique opportunity to leverage your expertise in PyTorch and kernel-level programming (Triton... kernels using Triton or Pallas. Demonstrable career progression. Ability to engage reliably for at least 40 hours/week...
Lugar:
Seattle, WA | 23/06/2026 17:06:44 PM | Salario: S/. No Especificado
using vLLM, SGLang, TensorRT-LLM, or Triton Knowledge of GPU networking and performance optimisation, including InfiniBand...
We are hiring immediately for full time and part time FOOD SERVICE WORKER/CASHIER positions. Location: Triton Central...
and build end-to-end pipelines, including high-performance GPU kernels (CUDA/Triton), efficient data I/O in Rust, and latency...
Kubernetes (K8s) skills, specifically with KubeFlow, Ray, or NVIDIA Triton. -CI/CD &IaC: Expertise in GitHub Actions...
Kubernetes (K8s) skills, specifically with KubeFlow, Ray, or NVIDIA Triton. -CI/CD &IaC: Expertise in GitHub Actions...
Kubernetes (K8s) skills, specifically with KubeFlow, Ray, or NVIDIA Triton. -CI/CD &IaC: Expertise in GitHub Actions...
Kubernetes (K8s) skills, specifically with KubeFlow, Ray, or NVIDIA Triton. -CI/CD &IaC: Expertise in GitHub Actions...
experience with JAX and/or PyTorch at scale. Experience writing or optimizing custom GPU kernels using Pallas (JAX) or Triton...
, and tuning LLMs using TensorRT-LLM and Triton Inference server. Managing MLOps/LLMOps pipelines, using TensorRT-LLM and Triton... balancing, etc. Operation and support of MLOps/LLMOps pipelines, using TensorRT-LLM and Triton Inference server to deploy...