comprehensive, real-time monitoring, alerting, and logging solutions focused on deep operational health, model performance analysis..., TorchServe, Triton Inference Server/TIS). Workflow Orchestration: (Airflow, Kubeflow, MLflow, Ray, Vertex AI, SageMaker...
Implement AI/ML solutions including model inferencing, RAG architecture, and LLM integration Utilize NVIDIA Triton Server APIs... solutions and AI-driven architectures leveraging modern technologies Collaborate across global teams, ensure...
such as vLLM, TensorRT-LLM, Triton Inference Server, or similar. Deep understanding of GPU and distributed systems performance... AI era – and beyond. We’ve been innovating fearlessly for 40 years to create solutions that power how humans and technology work together across the...
your career. THE ROLE: AMD’s Software and Solutions Team is seeking a Principal ML/AI Solutions Engineer to empower customers... development, integration, deployment, and performance analysis, ensuring innovative, efficient, and scalable AI solutions on AMD...
your career. THE ROLE: AMD’s Software and Solutions Team is seeking a Senior GPU Kubernetes Engineer to lead GPU operator..., Triton, KServe, or Ray, along with experience deploying LLM workloads, is highly desirable. Knowledge of ROCm, AMD MI300...
of ideas and perspectives at AHEAD. The AHEAD Senior Modern Datacenter Specialist Solutions Engineer (SSE) is a presales... technical leader focused on designing, positioning, and enabling enterprise-scale AI infrastructure solutions. This role owns...
Lugar:
USA | 31/01/2026 23:01:18 PM | Salario: S/. No Especificado | Empresa:
AHEAD and AI deployment frameworks (Triton, MLIR, IREE, ONNX, OpenVINO, TVM, XLA). 5+ years in software architecture and design... and transformative solutions. Your focus will be on leveraging technology innovation and incubation to drive commercial success, ensuring...
Lugar:
Hillsboro, OR | 31/01/2026 19:01:13 PM | Salario: S/. No Especificado | Empresa:
Intel with compiler engineers and runtime engineers to create, build and tune distributed inference solutions with Trainium and Inferentia... of Neuron's inference stack across Amazon and the Open Source Community. As you design and code solutions to help our team drive...
with compiler engineers and runtime engineers to create, build and tune distributed inference solutions with Trainium and Inferentia... of Neuron's inference stack across Amazon and the Open Source Community. As you design and code solutions to help our team drive...
will have a profound and lasting impact on our ability to deliver cutting-edge AI security solutions at a massive scale. Your Impact... are a significant plus. Experience with low-level performance optimization, such as custom CUDA kernel development or using Triton...