Sr. Platform Ops Engineer
, Prometheus, Grafana, CloudWatch, ELK stacks. Incident management experience - PagerDuty, on-call rotations, post-mortem culture...
, Prometheus, Grafana, CloudWatch, ELK stacks. Incident management experience - PagerDuty, on-call rotations, post-mortem culture...
and Kafka Monitoring: Datadog, PagerDuty, Sentry Version Control: Github, PagerDuty Projects we're working on: At Headway...
investors including top executives at Atlassian, Okta, Qualtrics, Zoom, and PagerDuty. Front has received numerous Great Place...
Storage, and Veeam while ensuring smooth operations of Atlassian, Zabbix, and PagerDuty platforms through sprint-based project... Storage systems (array operations, snapshots, replication). Experience with Atlassian, Zabbix, and PagerDuty or equivalent...
Observability: Prometheus, Grafana, DataDog, PagerDuty What Success Looks Like Production systems that measurably improve...
: PostgreSQL, Redis, Kafka, Snowflake/BigQuery, dbt Observability: Prometheus, Grafana, DataDog, PagerDuty What Success...
handling (PagerDuty, Datadog) and post-incident evaluations. Demonstrated success in mentoring and developing junior/mid-level...
with incident management tools (PagerDuty, OpsGenie, ServiceNow). Strong scripting skills (Python, Bash, or similar). Excellent...
, and infrastructure health using PRTG, PagerDuty, and related tooling Lead root cause analysis on complex incidents and implement...
/GitHub, Docker, EC2, S3, SNS, SQS, DataDog, and PagerDuty RESPONSIBILITIES: Manage the performance, development...