Senior Site Reliability Engineer
of observability principles and tools like (Prometheus, Datadog, OpenTelemetry). Experience with leading incident management...
of observability principles and tools like (Prometheus, Datadog, OpenTelemetry). Experience with leading incident management...
workflows. Familiarity with application performance monitoring and telemetry tools (Sentry, Datadog, or similar APM systems...
, or similar). Experience with monitoring and observability tools (DataDog, Prometheus, Grafana). Knowledge of database...
, Prometheus, DataDog, Azure Monitor). Respond quickly to alerts, triage incidents, and coordinate with SRE and Platform teams...). Experienced with monitoring and alerting solutions (Prometheus, Grafana, DataDog, Elastic, Azure Monitor). Strong analytical...
to have Understanding of the Investment Data Domain. • Familiarity with Dynatrace or Datadog for system observability and monitoring...
or SIEMs (e.g., Splunk, Datadog, Sumo Logic) and storage destinations (e.g., S3, R2, GCS) is a plus. Experience...
stack including JIRA & Confluence o DataDog, BigPanda, Service Now o Test Driven Development · Demonstrable experience...
/data fabric architectures, and observability tooling (e.g., Monte Carlo, DataDog, OpenLineage). Deep understanding of the...
., Prometheus, Grafana, Datadog). Proficiency in scripting and automation (Python, Bash, PowerShell). Familiarity...
We manage our infrastructure with Terraform, Kubernetes CRDs, ArgoCD and DataDog. “As an engineer, I follow a product through...