About

This class is STAT 5526. This series is about scaling your compute. The point being, sooner or later, you will find a data set too large to compute on your local computer/laptop. At that point, you have three options:

  1. Buy a larger computer
  2. Scale via Cloud
  3. Scale via HPC

I would argue these last two platforms/methods are merging. Here, we discuss computing on the high performance computing (HPC) clusters managed by Advanced Research Compting at Virginia Tech. Broad topic areas are listed below:

Topics

  • HPC
    • Cluster organization and access (Lecture 1A)
    • Interacting with a scheduler (Lecture 1B)
    • Software (Lecture 1B)
    • Storage (Lecture 1B)
  • Containers (Lecture 1C)
  • R (Lecture 1C)
  • Parallelization (Lecture 1D)
    • MKL/OpenBLAS vectorization
    • OpenMP
    • MPI
  • Monte Carlo (Lecture 2A)
  • Neural Networks (Lecture 2B)
  • R and Keras (Lecture 2C)
  • Homework