and Kubernetes platforms supporting AI-enabled applications for the federal civilian sector. You’ll have the opportunity to shape...-grade systems 2+ years of experience with AWS, including services such as EKS, EC2, RDS, Lambda, S3, IAM, VPC, Route 53...
that the infrastructure components integrate seamlessly with platform services and developer workflows. Key Responsibilities.... Observability & Reliability Define observability frameworks (Prometheus, Grafana, ELK, CloudWatch) for infrastructure...
Lugar:
Portland, OR | 09/01/2026 18:01:27 PM | Salario: S/. No Especificado | Empresa:
ApTask to help build a next-generation cloud platform that powers V1G (Vehicle-to-Grid) services. This platform is essential... Development & Integration: Build APIs, scripts, and secure authentication for V1G services and DERMS/OEM systems. · Testing...
needs of our business. This includes working with application teams in designing and deploying services in the cloud... with monitoring and logging tools such as Datadog, Grafana and Prometheus. Strong communication, analytical, and technical...
, governance, and model risk management across ML services. Lead design and implementation of models across classical ML and deep... tracking, A/B testing and online evaluation frameworks. Observability: Prometheus/Grafana, OpenTelemetry;SLO-driven...
About NDi: Network Designs, Inc. (NDi) is a leading Federal contractor that specializes in designing, developing..., VxRail Monitoring Frameworks: Prometheus, Nagios, Grafana Security & Networking: SSH, SSL/TLS, Key Vaults Server...
patterns that evolve into standards across the enterprise. Our Federal Government faces complex missions ranging... from enabling services for citizens to defending against our adversaries on the battlefield. The Platform Engineer...
Lugar:
McLean, VA | 08/01/2026 18:01:57 PM | Salario: S/. No Especificado | Empresa:
Kentro objects (namespaces/cells, Deployments/Jobs, Services, policies, gateways) Integrate network policy (eBPF/Cilium), secrets... design/versioning Observability: Prometheus, OpenTelemetry, Fluent Bit/OpenSearch;incident response and performance tuning...
Lugar:
Beavercreek, OH | 08/01/2026 18:01:15 PM | Salario: S/. No Especificado | Empresa:
KBRservices;Kubernetes/EKS, serverless;Kafka/Kinesis;lakehouse (Delta/Iceberg);vector DBs;feature stores;model registries... (MLflow/SageMaker/Databricks);orchestration (Airflow/Ray);observability (OpenTelemetry/Prometheus). LLM-specific tooling...
requirements and to define, plan, and implement requisite solutions. Experience using tools such as Prometheus, Nagios.../services. Experience with Site Reliability Engineering for Kubernetes infrastructure and application deployments. Security...