2018
Using a Checkpoint/restart program to overcome time limits
by
→
Europe/Brussels
DAO (Vinci building)
DAO
Vinci building
Vinci building, room A-182, Bâtiment Vinci, Place du Levant 1. More info on http://www.ceci-hpc.be/training.html#practicalinfo
Description
|
Checkpointing and Restarting, or the art of stopping some computations to continue them later, or on another computer, is a very convenient way to get past time limits set on the clusters, and to protect against hardware or software failure on the compute nodes. |
|
|
Contents:
|
Prerequisite:
|
|
Type: Hands-on |
|
Organised by
UCLouvain/CISM
Registration
Participants
Surveys
Session quality survey