MES, DNS, and authentication services Participate in an on-call rotation and provide after-hours support... skills with Bash, PowerShell, or Python Familiarity with infrastructure monitoring and alerting tools such as Prometheus...
and cloud-native services Develop and maintain systems that ensure rigorous follow-through on action items, remediation plans... Experience with observability platforms (Prometheus, Grafana, Datadog, Splunk, ELK) and distributed systems monitoring, logging...
Lugar:
Bethesda, MD | 24/03/2026 18:03:15 PM | Salario: S/. No Especificado | Empresa:
GEICO and rollback safety Manages Terraform based infrastructure, certificates, and secrets Implements observability using Prometheus... production ready services and operational excellence Drives incident resolution, root cause analysis, and continuous improvement...
of our AWS Aurora PostgreSQL and MySql databases along with other AWS services. If you thrive in a dynamic environment and love..., Prometheus) 4+ years of Scripting and automation skills (Python, Bash) Advanced level of proficiency with communication...
AI Systems (agentic services, APIs/SDKs, RAG/CAG, E2E pipelines) Ensure Model Quality, Evaluation, Security & Safety Implement... (OpenTelemetry style);alerting (eg, Prometheus/Grafana equivalents) 3+ years in Programming & Packaging Python (typing, pytest...
AI Systems (agentic services, APIs/SDKs, RAG/CAG, E2E pipelines) Ensure Model Quality, Evaluation, Security & Safety Implement... (OpenTelemetry style);alerting (eg, Prometheus/Grafana equivalents) 5+ years in Programming & Packaging Python (typing, pytest...
of those we are honored to serve. Senior Software Engineer - Typescript and AWS services Requisition number: 2348829 Job category...: Design and implement scalable, secure, and resilient cloud-native applications using AWS services (e.g., Lambda, ECS, RDS...
-production environments Manage user accounts, access controls, sudo policies, and authentication services (LDAP, AD integration... and troubleshoot system services (systemd, SELinux, firewalld, NTP, DNS, DHCP) Monitor system health, performance, and capacity using...
dashboards, alerts, and reliability improvements using Prometheus and Grafana Partner with development teams to automate...+ years of experience troubleshooting distributed systems and working with observability platforms such as Prometheus, Grafana...
architecture, Typescript, Graph QL, Bootstrap.js, HTML5, XML, CSS3, Java, JavaScript, REST services, NoSql technologies (Cassandra.../Mongo DB), Spring boot, Kafka/MQ, Redis, Splunk, Azure / AWS, Prometheus/Grafana, Git, Jira, Jenkins, Docker, Kubernetes...
Lugar:
Dallas, TX | 22/03/2026 20:03:26 PM | Salario: S/. No Especificado | Empresa:
AT&T