About
This class is STAT 5526. This series is about scaling your compute. The point being, sooner or later, you will find a data set too large to compute on your local computer/laptop. At that point, you have three options:
- Buy a larger computer
- Scale via Cloud
- Scale via HPC
I would argue these last two platforms/methods are merging. Here, we discuss computing on the high performance computing (HPC) clusters managed by Advanced Research Compting at Virginia Tech. Broad topic areas are listed below:
Topics
- HPC
- Cluster organization and access (Lecture 1A)
- Interacting with a scheduler (Lecture 1B)
- Software (Lecture 1B)
- Storage (Lecture 1B)
- Containers (Lecture 1C)
- R (Lecture 1C)
- Parallelization (Lecture 1D)
- MKL/OpenBLAS vectorization
- OpenMP
- MPI
- Monte Carlo (Lecture 2A)
- Neural Networks (Lecture 2B)
- R and Keras (Lecture 2C)
- Homework