Lead Site Reliability Engineer
. Expertise with observability stacks: Datadog, Prometheus, Grafana, OpenTelemetry. Deep AWS experience across EC2, EKS, Lambda...
. Expertise with observability stacks: Datadog, Prometheus, Grafana, OpenTelemetry. Deep AWS experience across EC2, EKS, Lambda...
/ Kubernetes Azure DevOps, Octopus Deploy, TeamCity Datadog, Azure Monitor MSSQL, MySQL What we're looking for Deep hands...
and other products Monitoring: Datadog, Bugsnag, Kibana (OpenSearch), AWS console.... What we're looking for 5+ years...
, Honeycomb, or Datadog). Solid understanding of event-driven systems, RESTful APIs, and caching solutions like Redis...
across a broad technical landscape, including: Linux, Unix, Solaris Cloud technologies (GCP, AWS) RESTful APIs Datadog or similar...
, Datadog, or Nagios Good understanding of infrastructure security concepts and best practices Experience using Infrastructure...
, Honeycomb, or Datadog). Solid understanding of event-driven systems, RESTful APIs, and caching solutions like Redis...
and logging systems (Prometheus, Grafana, Datadog) to define alerts, dashboards, and log-based metrics that improve application...
. Familiar with observability, alerting, and incident management (DataDog, Grafana) and collaborative, agile team environments...
) for deployments. Observability Mindset - You believe in measuring everything. You've worked with DataDog (or similar) to ensure...