User Tools

Site Tools


slurm_tutorial

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Next revision
Previous revision
slurm_tutorial [2014/09/08 15:50] – created wsislurm_tutorial [2022/07/21 06:59] (current) – external edit 127.0.0.1
Line 1: Line 1:
-====== Using Slurm ======+====== Using SLURM ====== 
 +===== What is SLURM? ===== 
 +SLURM is an open-source resource manager designed for Linux clusters of all sizes. It provides three key functions. First it allocates exclusive and/or non-exclusive access to resources (computer nodes) to users for some duration of time so they can perform work. Second, it provides a framework for starting, executing, and monitoring work (typically a parallel job) on a set of allocated nodes. Finally, it arbitrates contention for resources by managing a queue of pending work. 
 {{:images:arch.gif|}} {{:images:arch.gif|}}
  
Line 36: Line 39:
 ===== Starting a Job with srun ===== ===== Starting a Job with srun =====
  
 +  $ srun -n4 --time=0:30 -o myjob.out ./myprogram
  
 ===== Starting a Job with sbatch ===== ===== Starting a Job with sbatch =====
 +Setting up all of your options everytime you use srun can be repetitive. Use a batch file and submit that instead!
 +
 +Example file 'mybatch':
 +  #!/bin/sh
 +  #SBATCH -n 4
 +  #SBATCH --time=0:60
 +  #SBATCH --output=mybatch.out
 +  srun ./my_program
 +
 +  $ sbatch mybatch
 +  Submitted batch job 47
  
  
Line 51: Line 65:
      
 Cancel a job with scancel, then the JobID of the program you want to cancel. You can view the jobID with squeue. Cancel a job with scancel, then the JobID of the program you want to cancel. You can view the jobID with squeue.
 +
 +===== More information =====
 +[[https://computing.llnl.gov/linux/slurm/quickstart.html|Slurm Quickstart]]
 +
  
slurm_tutorial.1410213051.txt.gz · Last modified: 2022/07/21 06:59 (external edit)