. Their infrastructure doubles in size every year. We’re looking for engineers who love getting deep into Linux systems, pushing hardware...-scale, distributed systems Occasionally assist customers in optimizing workloads Your profile Key requirements (non...
, Bloomberg, NVIDIA, Microsoft, and Salesforce - trust Grafana Labs to ensure reliability of their applications and systems.... You'll operate with one foot in the code and the other in the customer's world. Whether it's understanding container...
as code, and optimization across public and hybrid cloud platforms, helping clients improve performance, reliability... design, deployment, and optimisation of high performance AI cloud systems Perform hardware R and D testing and experiments...
organization. This role will focus on evolving our lakehouse architecture, data replication systems, and orchestration frameworks... Web Services), container orchestration (e.g., Kubernetes), and distributed systems Extensive experience with end-to-end...
, prototyping, and production implementation of Web3 systems and secure, intuitive crypto wallet experiences natively on the Android... platform. Code Craftsmanship: Maintain, advocate for, and extend a clean, highly testable, and robust codebase, utilizing...
. We'd love to talk if: You've built AI systems that run in the real world: You have experience across the full lifecycle... - from early experimentation to deployment and monitoring - and understand what it takes to make AI systems reliable in production...
and resolve issues. Performing root cause analyses independently and implementing sustainable fixes. Ensuring unit testing, code.... Strongproficiencywith Git, GitLab andGitOpsworkflows. Solid experience with Bash, Terraform, Helm, and other Infrastructure as Code tooling...
Are you excited by the opportunity to design advanced AI systems that accelerate scientific discovery and unlock... and apply modern machine learning and LLM-based approaches to build scalable, reliable systems with real user impact...
Are you excited by the opportunity to design advanced AI systems that accelerate scientific discovery and unlock... and apply modern machine learning and LLM-based approaches to build scalable, reliable systems with real user impact...
deployments (GCP and/or AWS) and infrastructure-as-code (Terraform, Pulumi, etc.). Expertise in debugging distributed systems... with POS systems, aggregators, and restaurant infrastructure, making ordering faster, more accurate, and easier to manage...