Skip to main content

4.3b Scalable Machine Learning

Source repo: sdsc-summer-institute-2024 | Branch: main | Last synced: 2026-04-24 10:27:17.425 UTC

SDSC Summer Institute 2024

Session 4.3b Scalable Machine Learning

Date: Wednesday, August 7, 2024

Summary: Machine learning is an integral part of knowledge discovery in a wide variety of applications. From scientific domains to social media analytics, the data that needs to be analyzed has become massive and complex.

This session introduces approaches that can be used to perform machine learning at scale. Tools and procedures for executing machine learning techniques on HPC will be presented. Spark will also be covered for scalable data analytics and machine learning. Please note: Knowledge of fundamental machine learning algorithms and techniques is required.

Presented by: Mai Nguyen (mhnguyen @ucsd.edu) and Paul Rodriguez (p4rodriguez @ucsd.edu)

Reading and Presentations:

  • Lecture material:
    • Presentation Slides: will be made available closer to the session
  • Source Code/Examples: N/A

TASKS: None at this time.