) along with monitoring tools (e.g., Prometheus and Grafana);monitor message queues and services to ensure optimal operations and identify... in a DoD, federal, or large‑scale enterprise environment. Experience with SIEM/SOAR platforms (Splunk, Elastic, Azure Sentinel...
Overview AMERICAN SYSTEMS is an employee-owned federal government contractor supporting national priority programs..., CIS Benchmarks). Manage system services, networking, access controls, logging, and system monitoring on Linux platforms...
Connected Vehicle Services) are leading a new era of audio entertainment and services by delivering the most compelling... deployment, operation, and refinement, focusing on reliability and scalability. Monitor and Maintain Services: Ensure live...
to encompass multiple interdependent services both on premise and within SaaS or cloud infrastructure and effectively design..., team-oriented environment Experience with Observability tools such Datadog, Dynatrace, Prometheus, Grafana, Splunk...
foundation for the customer's AI capabilities, focusing on inference services while supporting the boarder ecosystem... inference at scale. Support the development and maintenance of production AI services and applications, including retrieval...
foundation for the customer's AI capabilities, focusing on inference services while supporting the boarder ecosystem... inference at scale. Support the development and maintenance of production AI services and applications, including retrieval...
, security, governance, and model risk management across ML services. Lead design and implementation of models across classical... evaluation frameworks. Observability: Prometheus/Grafana, OpenTelemetry;SLO-driven operations and incident management. Model...
Overview: AMERICAN SYSTEMS is an employee-owned federal government contractor supporting national priority programs..., CIS Benchmarks). Manage system services, networking, access controls, logging, and system monitoring on Linux platforms...
, monitor, and continuously improve SLAs, SLOs, and SLIs across critical services. Develop and maintain robust observability... tooling including logging, metrics, and tracing (e.g., Azure Monitor, OpenTelemetry, Prometheus). Proactively conduct...
, web services, application observability and/or messaging/ stream architecture 5+ years of IT full-stack engineering...: Instrumenting apps with Prometheus/Grafana, and creating effective alarms and dashboards Log indexing tools (e.g. ELK stack...