Introduction to BigData tools ~~cancelled ~~

by Michael Waumans (ULB)

bibliotheque des sciences: salle pasteur (comodal (louvain-la-neuve or remote))

bibliotheque des sciences: salle pasteur

comodal (louvain-la-neuve or remote)

Join Zoom Meeting https://cern.zoom.us/j/68165517034?pwd=bllaeUZlNFlZZmh5RVYrVytudnJTQT09 Meeting ID: 681 6551 7034 Passcode: 257128 Join by SIP 68165517034@ 68165517034@ Join by H.323 Meeting ID: 681 6551 7034 Passcode: 257128

High-performance compute and BigData are two disciplines that used to be higly compartmented, but are not anymore. More and more HPC users get interested in BigData software, and more and more BigData users get interested in HPC hardware. This session will broadly present the tools and concepts associated with BigData and explain how they can be used in the CÉCI context.


  • MapReduce and Spark
  • Zeppelin and Jupyter
  • BigData on HPC


  • Being able to use SSH with private keys 
  • Being familiar with a text editor 
  • Mastering the Linux command line and the GNU utilities (mkdir, cp, scp, etc.)

Type: Hands-on
Target audience: Beginner
Must: This session is a must-have for anyone who would like to go see what exist behind the buzzword.

Organized by