Principal HPC Architect
. Operations & Reliability: Maintain and monitor HPC clusters, job schedulers (Slurm, PBS, LSF), and distributed filesystems...
. Operations & Reliability: Maintain and monitor HPC clusters, job schedulers (Slurm, PBS, LSF), and distributed filesystems...
Engineering toolchain comfort — Linux, GIT, LSF — you live in these environments and don't slow down because of them Leadership...
., SLURM, PBS, LSF) and parallel file systems (Lustre, GPFS/Spectrum Scale). Experience implementing and managing automation...
of HPC or large-scale distributed computing systems. Expertise with batch schedulers (SLURM, PBS, LSF) and parallel file...
Background with distributed schedulers and shared compute environments such as LSF, Slurm, Grid Engine, or similar systems...
maintenance;two years rotary or fixed wing aircraft inspector under the Army Maintenance Management System (LSF only) APSD...
of HPC or large-scale distributed computing systems. Expertise with batch schedulers (SLURM, PBS, LSF) and parallel file...
., SLURM, PBS, LSF) and parallel file systems (Lustre, GPFS/Spectrum Scale). Experience implementing and managing automation...
(Ray, Slurm, LSF, or similar) Experience with bare-metal provisioning and lifecycle management at datacenter scale...
, Engineering, or Physics Nice If You Have: Experience with HPC job schedulers such as Slurm, PBS, and LSF, and submitting batch...