Source repo: ciml-summer-institute-2024 | Branch:
main| Last synced: 2026-04-24 10:27:17.425 UTC
Session 3.3 Practical Guidelines for Training Deep Learning on HPC
Date: Wednesday, June 26, 2024
Summary: Guildelines on running deep networks on Expanse, such as using tensorboard, notebooks, and batch jobs; also some discussion of multinode execution.
Presented by: Paul Rodriguez (p4rodriguez at ucsd.edu)
Reading and Presentations:
- Presentation slides: https://github.com/ciml-org/ciml-summer-institute-2024/blob/main/3.3_practical_guidelines_for_training_deep_learning_on_hpc/C24_PracticalGuilde_Multinode_v4.pdf