Identity Security Architect
protecting live AI inference pipelines built on TensorRT-LLM and Triton Inference Server. Cloud-Native Governance: Extensive...
protecting live AI inference pipelines built on TensorRT-LLM and Triton Inference Server. Cloud-Native Governance: Extensive...
on TensorRT-LLM and Triton Inference Server. Cloud-Native Governance: Extensive familiarity managing Kubernetes-native security...
. AI Pipeline Security: Hardened experience protecting live AI inference pipelines built on TensorRT-LLM and Triton Inference Server...
for GenAI and LLM workloads. Design and optimize inference solutions using vLLM, TensorRT-LLM, Triton Inference Server..., Triton, SGLang GPU & Distributed Systems: CUDA, NCCL, MIG, Tensor Parallelism Platforms: Kubernetes, OpenShift AI, KServe...
: Hardened experience protecting live AI inference pipelines built on TensorRT-LLM and Triton Inference Server. Cloud-Native...
experience protecting live AI inference pipelines built on TensorRT-LLM and Triton Inference Server. Cloud-Native Governance...
: Hardened experience protecting live AI inference pipelines built on TensorRT-LLM and Triton Inference Server. Cloud-Native...
experience protecting live AI inference pipelines built on TensorRT-LLM and Triton Inference Server. Cloud-Native Governance...
PyTorch with Triton performance Engineer Bellevue, WA (Onsite) Contract : Design and implement high intensity stress... workloads using PyTorch and Triton to identify performance bottlenecks and improve platform stability and maturity Design...
platforms. Preferred Qualifications: Experience with Client technologies such as NIM, NeMo, TensorRT-LLM, Triton Inference...