Senior Site Reliability Engineer
/SLOs, implement real-time observability (Datadog, Prometheus, Grafana, ...) and proactively identify risks before it...
/SLOs, implement real-time observability (Datadog, Prometheus, Grafana, ...) and proactively identify risks before it...
, batching, etc.). Data Infrastructure Ownership Select and maintain tooling (e.g., Prometheus/Grafana, Datadog... analytics, observability, or monitoring Hands-on with metrics/logging/tracing frameworks (Prometheus, Grafana, Datadog, New...
as a Service Some technologies that we use: AWS & Azure platforms Terraform HCP Vault Datadog Github Keycloak...
, metrics, distributed tracing, and alerting across tools such as New Relic, Datadog, or CloudWatch. Collaborate closely...
with monitoring tools (Prometheus, Grafana, Datadog, OpenTelemetry) Automate ingestion and cleanup tasks Build internal libraries...
and understanding of service monitoring tools (Instana, Datadog, Grafana, etc. are beneficial) Behaviours & Competencies: Data...
. Implement and manage observability using OpenTelemetry for logs and traces, integrating with Datadog. Ensure security, high.... Hands-on experience with AWS services Familiarity with OpenTelemetry, Datadog, and observability tooling. Solid coding...
workloads Experience with nginx or similar technologies Experience with DataDog, Prometheus or similar tools A positive...
via Datadog and we increasingly orchestrate our workloads using K8’s. Skilled in agile delivery, incident management...
, ElasticSearch, log management systems, and Datadog APM, to name but a few API specifications, conforming to the OpenAPI (Swagger...