Slurm

From CAC Documentation wiki
Revision as of 17:16, 31 May 2019 by Pzv2 (talk | contribs) (Added intro)
Jump to navigation Jump to search

Some of the CAC's Private Clusters are managed with OpenHPC, which includes the Slurm Workload Manager (Slurm for short). Slurm (Simple Linux Utility for Resource Management) is a group of utilities used for managing workloads on compute clusters.

This page is intended to give users an overview of Slurm. Some of the information on this page has been adapted from the Cornell Virtual Workshop topics on the Stampede2 Environment and Advanced Slurm. For a more in-depth tutorial, please review these topics directly.

Common Commands

A few Slurm commands to initially get familiar with:

scontrol show nodes
scontrol show partition

Submit a job: sbatch testjob.sh

Interactive Job: srun -p short --pty /bin/bash

scontrol show job [job id]
scancel [job id]
sinfo -l

References