Serve and Triton Develop and maintain robust serving infrastructure specialized for serving large language models (LLMs...) in-house Develop efficient ML serving platform using Ray Serve and Triton Build the foundation of Tinder's feature store...
synchronous and asynchronous web services. Develop real-time online inferencing for highly complex models using Triton, TensorRT... or equivalent experience in ML. Experience working with ML technologies (PyTorch, Sagemaker, Triton, TensorRT, etc.). Experience...
and written communication skills Experience with MLIR, MLIR Dialects (LinAlg, Affine), Pytorch 2.0, TVM, Triton, and/or LLVM...
Lugar:
San Diego, CA | 03/03/2026 03:03:22 AM | Salario: S/. $158400 - 237600 per year | Empresa:
Qualcomm). Hands-on experience with ML model serving frameworks (TorchServe, TensorFlow Serving, Triton Inference Server). Experience...
, capabilities, and skills Foundational understanding of NVIDIA GPU infrastructure software (e.g., DCGM, BCM, Triton Inference...
, capabilities, and skills Foundational understanding of NVIDIA GPU infrastructure software (e.g., DCGM, BCM, Triton Inference...
, Triton Inference Server, or Ray Serve (nice to have). Subsidiary: PayPal Travel Percent: 0 - The base pay...
Lugar:
San Jose, CA | 20/02/2026 18:02:44 PM | Salario: S/. $143500 - 212850 per year | Empresa:
PayPal, Python. Utilize model inference engines including TensorFlow Serving, Triton, or other AI accelerators. Build large scale...;Model inference engines including TensorFlow Serving, Triton, or other AI accelerators;Building large scale distributed...
Lugar:
Austin, TX | 29/01/2026 22:01:07 PM | Salario: S/. $130700 - 202300 per year | Empresa:
Visa and optimize GPU kernels using tools like CUDA or Triton, and leverage techniques such as FlashAttention and Tensor Cores...
(Triton, TensorRT, VLLM, TGI, etc.). Familiarity with MLOps practices for CI/CD, continuous training, observability...