Description Do you want to be part of AI revolution? At AWS our vision is to make deep learning pervasive for everyday developers and to democratize access to AI hardware and software infrastructure. In order to deliver on that vision, we...
Description The Product: AWS Machine Learning accelerators are at the forefront of AWS innovation and one of several AWS tools used for building Generative AI on AWS. The Inferentia chip delivers best-in-class ML inference performance at ...
Description The Annapurna Labs team at Amazon Web Services (AWS) builds AWS Neuron, the software development kit used to accelerate deep learning and GenAI workloads on Amazon's custom machine learning accelerators, Inferentia and Trainiu...
Description The Annapurna Labs team at Amazon Web Services (AWS) builds AWS Neuron, the software development kit used to accelerate deep learning and GenAI workloads on Amazon's custom machine learning accelerators, Inferentia and Trainiu...
Description Lead multi-person projects end-to-end - from design documentation and architecture reviews through to delivery Design container platform integrations - device plugins, DRA drivers, and operator development for ML accelerator ...
Description AWS's Trainium and Inferentia chips power the world's largest machine learning clusters. Our team builds virtual platforms - full-system C++ and SystemC models of these custom SoCs - that let software teams start development m...
Description AWS designs some of the most complex custom SoCs in the world - Trainium chips that power massive machine learning training clusters. Our team builds models of these SoCs that are used across the chip development lifecycle: ar...
Description The Product: AWS Machine Learning accelerators are at the forefront of AWS innovation and one of several AWS tools used for building Generative AI on AWS. The Inferentia chip delivers best-in-class ML inference performance at ...
Description The AWS Neuron Compiler team is actively seeking skilled compiler engineers to join our efforts in developing a state-of-the-art deep learning compiler stack. This stack is designed to optimize application models across divers...
Description We are seeking an experienced engineer to work on distributed AI/ML systems. This role involves working on collective operations - the fundamental operations that enable AI to scale across multiple accelerators & servers. Most...