. As a Senior Lead Site Reliability Engineer at JPMorgan Chase within the Infrastructure Platforms and Foundational Services (IPFS... services in your application and product lines. You will ensure those NFRs are accounted for in your products' design and test...
‑on experience with monitoring tools (Prometheus, Grafana, Splunk, ELK, Datadog, etc.). Familiarity with Kubernetes, container... to start, and ability to attain Top Secret/SCI Preferred Experience supporting DoD or other federal programs...
Project with a Major Prime) for the following position for a federal agency. Job Title: Cloud Developer - Senior Location... expertise in AWS cloud services, Terraform infrastructure as code, Linux system administration, and Agile/Scrum practices...
, and the discipline to design, automate, and operate complex services so that reliability becomes a first-class engineering... objectives (SLOs), service-level indicators (SLIs), and error budgets for critical services, and use those measures to drive...
Company Federal Reserve Bank of Boston Federal Reserve Financial Services (FRFS) delivers a suite of payments... services to financial institutions via FedLine® Solutions, FedNowSM, Fedwire®, National Settlement Service (NSS), FedCash...
, and connector configurations using GitOps patterns. Implement comprehensive observability using Prometheus, Grafana, Datadog.... Experience operating Kafka on Kubernetes (Strimzi, Confluent Operator). Exposure to managed Kafka services (AWS MSK, Azure Event...
large-scale, secure, and highly available cloud platforms on Amazon Web Services. This is a deeply hands-on engineering role... services, with strong attention to scalability, reliability, and security. Author and maintain production-quality...
configurations using GitOps patterns. Implement comprehensive observability using Prometheus, Grafana, Datadog, or Confluent Control.... Experience operating Kafka on Kubernetes (Strimzi, Confluent Operator). Exposure to managed Kafka services (AWS MSK, Azure Event...
, and the discipline to design, automate, and operate complex services so that reliability becomes a first-class engineering... objectives (SLOs), service-level indicators (SLIs), and error budgets for critical services, and use those measures to drive...
metrics (TPS, latency drift) using Prometheus or similar tools to identify bottlenecks before they impact production. Own... federal, state, and local laws. Capital One promotes a drug-free workplace. Capital One will consider for employment qualified...