logging, and cost attribution Operational Excellence Own uptime, reliability, and performance of ML/LLM services (SLIs... & Infrastructure: Strong experience with major cloud providers (AWS, GCP, or Azure) and ML-specific services (SageMaker, Vertex...
Lugar:
USA | 05/02/2026 00:02:20 AM | Salario: S/. No Especificado | Empresa:
Sumo Logic our next generation airline services. You will be part of the team that is building a globally deployed platform for our customers using... that design, develop, test, debug and document software, by providing high quality technical solutions and services that deliver...
Lugar:
Chicago, IL | 04/02/2026 18:02:15 PM | Salario: S/. No Especificado | Empresa:
SES, and implement cloud-native automation and remediation services across AWS, Azure, and GCP platforms Build and maintain highly... Apply Site Reliability Engineering principles including SLIs, SLOs, SLAs, and error budgets to cloud services Design...
Lugar:
San Jose, CA | 04/02/2026 02:02:03 AM | Salario: S/. No Especificado | Empresa:
F5 applications to Kubernetes. Be responsible for maintenance and improvements to multiple internal services, for example Kubernetes..., Prometheus, ELK. Monitor, triage and respond to alerts in our high availability environments. Participate in design and code...
applications to Kubernetes. Be responsible for maintenance and improvements to multiple internal services, for example Kubernetes..., Prometheus, ELK. Monitor, triage and respond to alerts in our high availability environments. Participate in design and code...
using tools such as Prometheus, Grafana, OpenTelemetry, InfluxDB, and AWS CloudWatch. Design and implement real-time... metrics pipelines and time-series data processing systems. Develop scalable APIs and services to expose observability data...
. We offer our customers the flexibility to use their accounts to purchase and receive payments for goods and services, as well as the ability.... Familiarity with metrics-driven development and monitoring using tools like DataDog, Grafana, Prometheus, or New Relic...
Lugar:
USA | 31/01/2026 18:01:29 PM | Salario: S/. No Especificado | Empresa:
PayPal architectures and RESTful API development. Experience with AWS or other public clouds, distributed systems, and scaling services... (Terraform), and monitoring/observability (Prometheus, Datadog, ELK, Splunk, PagerDuty). Strong Git practices (branching, code...
Lugar:
Columbia, MD | 30/01/2026 21:01:27 PM | Salario: S/. $137500 - 183500 per year | Empresa:
Tenable using tools such as Prometheus, Grafana, OpenTelemetry, InfluxDB, and AWS CloudWatch. Design and implement real-time... metrics pipelines and time-series data processing systems. Develop scalable APIs and services to expose observability data...
Reliability Engineer, you will be responsible for working with program development teams, infrastructure and platform services...-scale software systems and services while minimizing downtime and mitigating potential failures. Qualifications...