Preparing, submitting and managing jobs with Slurm

19 Oct 2023, 09:30
2h 30m
Maxwell/Shannon (first floor) (Louvain-La-Neuve)

Maxwell/Shannon (first floor)

Louvain-La-Neuve

Place du Levant 3 1348 Louvain-la-Neuve Belgium

Speaker

Damien François (UCLouvain/CISM)

Description

Slurm is the job manager installed on all CÉCI clusters. The session teaches attendees how to prepare a submission script, how to submit, monitor, and manage jobs on the clusters.

 

Contents:

  • Role of a job scheduler 
  • Creating and submitting a job 
  • Setting job constraints and parameters 
  • Managing and monitoring jobs 
  • Working interactively 
  • Getting accounting information 
  • How priorities are computed 
  • Creating basic submission scripts
  • Best practice

Prerequisite:

  • Being able to use SSH with private keys 
  • Being familiar with a text editor 
  • Mastering the Linux command line and the GNU utilities

Follow-up:

  • Checkpointing to make jobs fit maximum allowed time
  • Workflows to organise jobs and experiments
  • Advanced Slurm to write more complex parallel submission scripts.

Type: Hands-on
Target audience: Everyone
Must: This session is mandatory.

Presentation materials