Sample Batch Scripts for Running Jobs on the Eagle System#

For a walkthrough of the elements of an sbatch script, please see Submitting Batch Jobs.

Many more examples of sbatch scripts are available in the HPC Repository Slurm Directory on Github. You may also check the individual Systems pages for details related to the cluster you're working on.

Sample batch script for a serial job in the debug queue

#!/bin/bash 
#SBATCH --ntasks=4 # Tasks to be run 
#SBATCH --nodes=1  # Run the tasks on the same node 
#SBATCH --time=5   # Required, estimate 5 minutes 
#SBATCH --account=<project_handle> # Required 
#SBATCH --partition=debug 

cd /scratch/$USER 

srun $HOME/hpcapp -options

Sample serial batch script with GPU and memory request

#!/bin/bash
#SBATCH --nodes=2          # Use 2 nodes
#SBATCH --time 00:20:00    # Set a 20 minute time limit
#SBATCH --ntasks 2         # Maximum CPU cores for job 
#SBATCH --gres=gpu:2       # GPU request 
#SBATCH --mem=184000       # Standard partition (192GB nodes) 

cd /scratch/$USER 
srun my_graphics_intensive_scripting

Sample batch script for a serial job in default (standard) queue

#!/bin/bash 
#SBATCH --partition=standard       # Name of Partition 
#SBATCH --ntasks=12                # CPU cores requested for job 
#SBATCH --nodes=1                  # Keeep all cores on the same node
#SBATCH --time=02-00:00:00         # Job should run for up to 2 days (for example) 

cd /scratch/<userid>/mydir

srun hpcapp -options /home/hpcuser/app/parameters  # use your application's commands

For best scheduling functionality, it is not recommended to select a partition.

Sample batch script to utilize Local Disk (/tmp/scratch)

#!/bin/bash 
#SBATCH --ntasks=36                # CPU cores requested for job 
#SBATCH --nodes=1                  # Keeep all cores on the same node 
#SBATCH --time=01-00               # Job should run for up to 1 day (for example) 
#SBATCH --tmp=20TB                 # Request minimum 20TB local disk 

export TMPDIR=$LOCAL_SCRATCH 
cp /scratch/<userid>/myfiles* $TMPDIR 

srun ./my_parallel_readwrite_program -input-options $TMPDIR/myfiles  # use your application's commands

If you or your application has a need for large local disk, please use /tmp/scratch. In the example above, environment variable $LOCAL_SCRATCH can be used in place of the size limited /tmp.

Sample batch script for an MPI job

Eagle MPI (intel-mpi, hpe-mpi): 

#!/bin/bash 
#SBATCH --nodes=4                   # Number of nodes 
#SBATCH --ntasks=100                # Request 100 CPU cores 
#SBATCH --time=06:00:00             # Job should run for up to 6 hours 
#SBATCH --account=<project handle>  # Where to charge NREL Hours 

module purge
module load mpi/intelmpi/18.0.3.222 
srun ./compiled_mpi_binary          # srun will infer which mpirun to use

For best scheduling functionality, it is not recommended to select a partition.

Sample batch script for high-priority job

#!/bin/sh
#SBATCH --job-name=job_monitor
#SBATCH -A <account>
#SBATCH --time=00:05:00
#SBATCH --qos=high
#SBATCH --ntasks=2
#SBATCH -N 2 
#SBATCH --output=job_monitor.out 
#SBATCH --exclusive

srun ./my_job_monitoring.sh