, and efficient serving techniques (vLLM, TensorRT-LLM, Triton) - Help Go-To-Market Specialist define and drive strategy on assets... frameworks (vLLM, llm-d, Triton, SGlang, etc), and optimization techniques (quantization, speculative decoding, KV-cache...
high-performance inference engines such as vLLM, NVIDIA Triton, or TorchServe, and working with cloud-native deployment...
with modern inference platforms and frameworks (vLLM, TensorRT-LLM, SGLang, Triton Inference Server) and the architectural trade...
Triton, or TorchServe, and working with cloud-native deployment, Docker, Kubernetes, MLOps pipelines, and major cloud...
with LLM inference engines and serving frameworks (e.g., SGLang, vLLM, Triton, TensorRT-LLM) Deep low-level systems...
Lugar:
Estados Unidos | 19/06/2026 17:06:04 PM | Salario: S/. $135000 - 160000 per year | Empresa:
SpaceX Experience with audio or podcast programmatic preferred (AudioMax, Adrise, Megaphone, Triton) Proficient in Excel, Salesforce...
Lugar:
Nueva York, NY | 19/06/2026 01:06:58 AM | Salario: S/. $74000 - 120000 per year | Empresa:
FOX using CUDA, Triton, and related tools - Explore AI-driven infrastructure for inference systems, including AI Agents...-level optimization using CUDA, Triton, Cutlass, or related tools - Experience with LLM inference optimization, such as PTQ...
Lugar:
San Jose, CA | 18/06/2026 23:06:18 PM | Salario: S/. No Especificado | Empresa:
TikTok using CUDA, Triton, and related tools - Explore AI-driven infrastructure for inference systems, including AI Agents...-level optimization using CUDA, Triton, Cutlass, or related tools - Experience with LLM inference optimization, such as PTQ...
Lugar:
San Jose, CA | 18/06/2026 22:06:44 PM | Salario: S/. No Especificado | Empresa:
TikTok (e.g. Triton, kserve, etc.) Experience with production ML and High Performance Compute development systems Strong problem...
servers such as vLLM, Triton. Experience with low-level GPU kernel development and writing custom kernels (e.g., CUDA, ROCm...