Difference between revisions of "Slurm"
Jump to navigation
Jump to search
(Moved links to a References section (new), moved commands to Common Commands (updated title) section, deleted Slurm HELP section) |
(Added intro) |
||
Line 1: | Line 1: | ||
− | + | Some of the CAC's [[Private Clusters]] are managed with [https://github.com/openhpc/ohpc/wiki OpenHPC], which includes the [https://slurm.schedmd.com/ Slurm Workload Manager] (Slurm for short). Slurm (Simple Linux Utility for Resource Management) is a group of utilities used for managing workloads on compute clusters. | |
+ | |||
+ | This page is intended to give users an overview of Slurm. Some of the information on this page has been adapted from the [https://cvw.cac.cornell.edu/ Cornell Virtual Workshop] topics on the [https://cvw.cac.cornell.edu/Environment Stampede2 Environment] and [https://cvw.cac.cornell.edu/slurm/ Advanced Slurm]. For a more in-depth tutorial, please review these topics directly. | ||
__TOC__ | __TOC__ | ||
Line 23: | Line 25: | ||
== References == | == References == | ||
+ | :* [https://slurm.schedmd.com/ Slurm Docs] | ||
:* [https://slurm.schedmd.com/quickstart.html Quick Start User Guide] - this page lists all of the available Slurm commands | :* [https://slurm.schedmd.com/quickstart.html Quick Start User Guide] - this page lists all of the available Slurm commands | ||
:* [https://slurm.schedmd.com/pdfs/summary.pdf Command/Option Summary] (two page PDF) | :* [https://slurm.schedmd.com/pdfs/summary.pdf Command/Option Summary] (two page PDF) |
Revision as of 17:16, 31 May 2019
Some of the CAC's Private Clusters are managed with OpenHPC, which includes the Slurm Workload Manager (Slurm for short). Slurm (Simple Linux Utility for Resource Management) is a group of utilities used for managing workloads on compute clusters.
This page is intended to give users an overview of Slurm. Some of the information on this page has been adapted from the Cornell Virtual Workshop topics on the Stampede2 Environment and Advanced Slurm. For a more in-depth tutorial, please review these topics directly.
Common Commands
A few Slurm commands to initially get familiar with:
scontrol show nodes scontrol show partition
Submit a job: sbatch testjob.sh
Interactive Job: srun -p short --pty /bin/bash
scontrol show job [job id] scancel [job id] sinfo -l
References
- Slurm Docs
- Quick Start User Guide - this page lists all of the available Slurm commands
- Command/Option Summary (two page PDF)
- Frequently Asked Questions includes FAQs for Management, Users and Administrators
- Convenient SLURM Commands has examples for getting information on jobs and controlling jobs
- sbatch - used to submit a job script for later execution. The script will typically contain one or more srun commands to launch parallel tasks.