Summer School 2024: 18 – 24 August 2024, save the date!

News from Tuesday 5 March 2024
This year's Summer School is scheduled to take place from 18 to 24 August in Ostrava, Czech Republic. Hosted by IT4Innovations, the focus of the event will be HPC in Data Science.

EUMaster4HPC Summer School 2024 will take place in Ostrava, Czech Republic from 18 to 24 August 2024. The Summer School is organised in collaboration with IT4Innovations National Supercomputing Center, which is part of the VSB – Technical University of Ostrava, TU Wien (TUW), and MathWorks. These entities will contribute expertise and resources to enrich the experience for all participants.

 

 

Registration

Registration is open now until 31 May 2024.

To register for the event, please use the registration link based on your affiliation. The participation conditions and respective fees are below.

 

Overview

The Summer School will offer students practical insights into HPC and data analytics, addressing the growing demand for expertise in these areas. Participants will gain competencies in various tools and techniques, empowering them to tackle real-world challenges effectively.

In this summer school, students will learn how to prepare data and understand its characteristics to create meaningful machine learning models on an HPC architecture. For this purpose, we will use open-source programming languages, such as R and Python. The supercomputing infrastructure of IT4Innovations will be used for hands-on exercises, which will be an integral part of all lectures. At the end of the summer school, the student will gain competencies in the following:

  • Using a Linux-based HPC environment.
  • Understanding the theoretical background of exploratory data analysis and modeling.
  • Scale data analysis for Big Data in R and Python.
  • Creating basic Machine and Deep Learning models in R and Python.
  • Deciding whether to use Machine or Deep Learning methods.
  • Building data processing pipelines for Machine or Deep Learning tasks.
  • Knowing how to set up and run data analysis in parallel on an HPC cluster with R and Python.
  • Parallelization of Machine and Deep Learning tasks to use multiple compute nodes and/or multiple accelerators (GPUs).
  • Using MATLAB tools for HPC and data analysis.

 

Detailed agenda

To foster community spirit, we have planned several social events throughout the programme, including a Welcome Party, a visit to the Planetarium, and a day trip to the picturesque mountains of Beskydy.

Saturday 17 August: Arrival of students and guests and check-in to the hotel.

 

Sunday 18 August: Summer School Welcome event at IT4Innovations.

12:00 - 13:30   Lunch

13:30 - 14:00   Introduction of the school programme, practical information

14:00 - 15:00   Introduction of the organisers

  • IT4I
  • VSC
  • MathWorks
  • EUMaster4HPC

15:00 - 15:30.  Coffee Break

15:30 - 16:30   Guided tours around IT4I’s Infrastructure

16:30 - 18:00   Teambuilding activities

18:00 - 21:00   Welcome reception

 

Monday 19 August

9:00     Accessing and using IT4I clusters

  • First login.
  • How to get your data to the cluster.
  • How to log in to the cluster and prepare a computation environment.
  • How to submit computational jobs.

10:30   Coffee Break

11:00   Introduction to Data Science

12:30   Lunch Break

13:30   Coding Challenge Part 1

15:00   Coffee Break

15:30   Coding Challenge Part 2

 

Tuesday 20 August

9:00     Introduction to R

  • What is R and when to consider using it?
  • Basic data types
  • Programming styles in R
  • Very short introduction into tidyverse universe

10:30   Coffee Break

10:45   Exploratory Data Analysis with R

  • How to get basic understanding of data
  • Explore and handle missing values and outliers
  • Clean up messy data
  • Visualisation of basic relationships

12:15   Lunch Break

13:15   Modelling with R

  • Introduction to modelling with tidy models packages
  • Creation of basic ML pipeline
  • An end-to-end example with XGBoost

15:00   Coffee Break

15:15   Parallelisation in R

  • Local machine parallelisation
  • Differences of parallelisation on Windows and UNIX OS
  • Multi-node parallelisation
  • Simple multi-node example in data science workflow

 

Wednesday 21 August

09:00   Challenge Reports: 1st cohort of EUMaster4HPC students

10:30   Coffee break

10:45   Dask, Numba, Ray: Parallelise the lazy way

11:30   Fast, faster, NumPy: Why is the popular library hard to beat?

12:00   Numerical computations on a GPU: Which tool does the best job?

12:30   Lunch break

13:30   Data analysis in Python: Pandas, Polars and the rest of the zoo.

15:00   Coffee break

15:15   Data visualisation: Insightful and pretty?!

16:30   Quiz & Recap

17:30   Leaving from the hotel to the Planetarium

18:00   Social event at planetarium

 

Thursday 22 August

09:00   ML intro: Welcome to weight watching.

09:30   Scikit-Learn: Get to know a living fossil.

09:45   Regression vs Classification: What’s your problem?

10:00   Data pre-processing: Visualise, clean, transform.

10:30   Coffee break

10:45   Prominent ML algorithms: SVMs, Decision Trees, K-nearest neighbors & ensemble methods.

11:00   Evaluation: Which model performed best?

11:30   Hyperparameters: Twiddle the knobs and dials.

12:00   Scaling Scikit-Learn: Dask and RAPIDS to the rescue.

12:30   Lunch break

13:30   Neural Networks: Dive in at the deep (learning) end.

14:15   Tensorflow & Keras: The easy way to become an architect.

15:00   Coffee break

15:15   Convolutional Neural Networks: Give your computer a vision.

16:15   Distributed Training: Sharing the burden.

16:45   Outlook on Transformers: Welcome to the future.

 

Friday 23 August

9:00     Data Analysis with MATLAB

  • Reading in data into MATLAB (including data from cloud, big data, datastores)
  • Datatypes in MATLAB
  • Data visualization in MATLAB (low code apps)
  • Interoperability between MATLAB and Pandas

10:30   Coffee Break

10:45   Harnessing AI with MATLAB

  • Machine learning apps, including code generation in MATLAB
  • Deep learning in MATLAB - model explorer
  • Importing and exporting models from MATLAB

12:15   Lunch Break

13:15   Speeding up your MATLAB code

  • User errors and effects on speed
  • Code Profiler and best practices
  • Parallelizing MATLAB code

15:00   Coffee Break

15:15   From coding to cluster – scaling up MATLAB on HPC

  • Sending jobs to a remote HPC cluster from the MATLAB environment
  • Training AI model on a GPU without learning CUDA
  • NEW! MATLAB and Quantum Computing

Saturday 24 August: Trip to Pustevny

 

Conditions and Fees

Eligible EUMaster4HPC programme students need to cover their travel costs to Ostrava and back. Their accommodation and attendance at the summer school are arranged and fully covered directly by the organisers. It includes attendance at the lessons, access to the HPC infrastructure, catering during the lessons (coffee breaks and lunches), two social events, and a field trip to Pustevny.

External students must cover all expenses related to travel, accommodation, and attendance at the summer school. In case of a higher demand than available spots for external students, the applications will be reviewed, and selected students will be informed by the end of June about their acceptance.

The attendance fee is 250 €

It includes attendance at the lessons, access to the HPC infrastructure, catering during the lessons (coffee breaks and lunches), two social events, and a field trip to Pustevny.

External students need to arrange for their travel to Ostrava.

Organizers can help book accommodation at the same hotel as the EUMaster4HPC students and tutors, but the external students must pay for their stay. The prices vary based on occupancy and room standards and will be communicated to accepted students.

Venue and Travel

Summer School 2024 will be held in Ostrava, Czech Republic. Below, you can find detailed information about the different venues, hotel, and a guide for travel to Ostrava. The venues are within walkable distance from the hotel, except for the Pustevny. For the field trip, a bus transport will be organized.

The welcome reception on Sunday will be held at IT4Innovations National Supercomputing Center, Studentská 6231/1B, Ostrava—Poruba, Czech Republic. View Map

The venue of the school is VŠB – Technical University of Ostrava, New Auditorium, 17. listopadu 15, Ostrava – Poruba. View Map

The social event venue is Planetarium Ostrava, K Planetáriu 502, 725 26 Ostrava 26. View Map

The field trip is organized to Pustevny in the Beskydy Mountains. View Map

The accommodation for EUMaster4HPC students and tutors is booked in Hotel Garni at the university dormitories. View Map

Travel guide including options of flights to Ostrava is available here.

Any questions?

If you have any questions, please do not hesitate to contact us at training@it4i.cz

Access and location