Learning how to use HPC infrastructure (part II)
Thursday, 20 October 2022 -
09:00
Monday, 17 October 2022
Tuesday, 18 October 2022
Wednesday, 19 October 2022
Thursday, 20 October 2022
09:00
Choosing and activating software with system modules on CECI clusters
-
Bernard Van Renterghem
(
UCL CISM
)
Choosing and activating software with system modules on CECI clusters
Bernard Van Renterghem
(
UCL CISM
)
09:00 - 10:30
Room: Maxwell/Shannon (first floor)
<table border="0" cellpadding="10px"> <tbody> <tr> <td colspan="2"> <p>Software installed on the clusters is organised and managed with environment modules that allow choosing a specific version of a software package compiled with a given compiler, linked to chosen libraries, etc. This session explains how modules are used on the clusters.</p> </td> </tr> <tr> <td rowspan="2"> <p><strong>Contents:</strong></p> <ul> <li>The installed software</li> <li>The modules command</li> <li>What is Easybuild</li> <li>What are the different toolchains</li> <li>how to install software by yourself</li> </ul> </td> <td> <p><strong>Prerequisite:</strong></p> <ul> <li>Being able to use SSH with private keys </li> <li>Being familiar with a text editor </li> <li>Mastering the Linux command line and the GNU utilities (mkdir, cp, scp, etc.)</li> </ul> </td> </tr> <tr> <td> <p><strong>Type</strong>: Hands-on<br /> <strong>Target audience</strong>: Everyone<br /> <strong>Must: </strong>This session is a must for anyone.</p> </td> </tr> </tbody> </table>
10:45
Writing and editing text files with Vim
-
Jérôme de Favereau
(
UCLouvain/IRMP/CP3
)
Writing and editing text files with Vim
Jérôme de Favereau
(
UCLouvain/IRMP/CP3
)
10:45 - 12:15
Room: Maxwell/Shannon (first floor)
<table border="0" cellpadding="10px"> <tbody> <tr> <td colspan="2"> <p>Vim is a very powerful text editor installed on all Unix systems, including Mac OSX, and used by many programs as default text editor. Knowing the basics is crucial on HPC. Mastering it will dramatically speed up tasks like the edition of submition scripts, configuration files, code, ... </p> </td> </tr> <tr> <td rowspan="2"> <p><strong>Contents:</strong></p> <ul> <li> Why use VIM on user interfaces ? </li> <li> VIM modes </li> <li> Movement and action commands</li> <li> Macros </li> <li> Plugins </li> </ul> </td> <td> <p><strong>Prerequisite:</strong></p> <ul> <li>Being able to use SSH with private keys </li> </ul> </td> </tr> <tr> <td> <p><strong>Type</strong>: Hands-on<br /> <strong>Target audience</strong>: Everyone<br /> <strong>Must: </strong>This session is a must for anyone.</p> </td> </tr> </tbody> </table>
13:00
Preparing, submitting and managing jobs with Slurm
-
Damien François
(
UCLouvain/CISM
)
Preparing, submitting and managing jobs with Slurm
Damien François
(
UCLouvain/CISM
)
13:00 - 15:30
Room: Maxwell/Shannon (first floor)
<p>Slurm is the job manager installed on all CÉCI clusters. The session teaches attendees how to prepare a submission script, how to submit, monitor, and manage jobs on the clusters.</p> <p> </p> <table border="0" cellpadding="10px"> <tbody> <tr> <td rowspan="2"> <p><strong>Contents:</strong></p> <ul> <li>Role and duties of a job scheduler/resource manager </li> <li>Creating and submitting a job </li> <li>Setting job constraints and parameters </li> <li>Managing and monitoring jobs </li> <li>Working interactively </li> <li>Getting accounting information for the jobs </li> <li>How priorities are computed </li> <li>Creating parallel jobs with shared-memory software</li> <li>Creating parallel jobs with message passing software</li> <li>Creating parallel jobs with master/slave software </li> </ul> </td> <td> <p><strong>Prerequisite:</strong></p> <ul> <li>Being able to use SSH with private keys </li> <li>Being familiar with a text editor </li> <li>Passive knowledge of parallelisation techniques (OpenMP, MPI)</li> <li>Mastering the Linux command line and the GNU utilities (mkdir, cp, scp, etc.)</li> </ul> </td> </tr> <tr> <td> <p><strong>Type:</strong> Hands-on<br /> <strong>Target audience</strong>: Everyone<br /> <strong>Must: </strong>This session is mandatory.</p> </td> </tr> </tbody> </table>
15:40
Using a Checkpoint/restart program to overcome time limits
-
Olivier Mattelaer
(
UCLouvain/CISM
)
Using a Checkpoint/restart program to overcome time limits
Olivier Mattelaer
(
UCLouvain/CISM
)
15:40 - 16:25
Room: Maxwell/Shannon (first floor)
<table border="0" cellpadding="10px"> <tbody> <tr> <td colspan="2"> <p>Checkpointing and Restarting, or the art of stopping some computations to continue them later, or on another computer, is a very convenient way to get past time limits set on the clusters, and to protect against hardware or software failure on the compute nodes. </p> </td> </tr> <tr> <td rowspan="2"> <p><strong>Contents:</strong></p> <ul> <li>Use and challenges of checkpointing</li> <li>The different approaches</li> <li>Checkpointing in Slurm</li> <li>Using DMTCP for checkpointing</li> </ul> </td> <td> <p><strong>Prerequisite:</strong></p> <ul> <li>Being able to use SSH with private keys </li> <li>Being familiar with a text editor </li> <li>Mastering the Linux command line and the GNU utilities (mkdir, cp, scp, etc.)</li> <li>Passive knowledge of either C, Fortran, Octave, Python or R</li> </ul> </td> </tr> <tr> <td> <p><strong>Type:</strong> Hands-on<br /> <strong>Target audience</strong>: Everyone<br /> <strong>Must: </strong>This session is a must-have for anyone feeling oppressed by time limits.</p> </td> </tr> </tbody> </table>