Python Data Analysis Library

SDSC Expanse Notebook: Python_Data_Analysis_Library

This README file provides instructions for Expanse users to run Python_Data_Analysis_Library using CPU on Expanse. pandas is a fast, powerful, flexible and easy to use open source data analysis and manipulation tool, built on top of the Python programming language.

This notebook will give you an introduction to PANDAS. Enjoy!/ Listof Content

Import Module:

  • Image
  • pandas

Launch Galyleo

For specific information about launching Galyleo, please refer to this GitHub repository.

Environment Modules

By utilizing --env-modules, we can load any software installed in Expanse. For instance, executing this command line will load CPU modules and Anaconda3 within the Jupyter session.

  • CPU: --env-modules cpu/0.17.3b,anaconda3
    galyleo launch --account abc123 --partition shared --cpus 2 --memory 4 --time-limit 00:30:00 --env-modules cpu/0.17.3b,anaconda3/2021.05
    

Install Modules

To run PandasCSV notebook, we do not need to install any additional packages.

Location

Python_Data_Analysis_Library
├── PandasCSV.ipynb
├── README.md

Submit Ticket

This notebook was last tested on 3/31/25. If you find anything that needs to be changed, edited, or if you would like to provide feedback or contribute to the notebook, please submit a ticket by contacting us at:

Email: consult@sdsc.edu

We appreciate your input and will review your suggestions promptly!

Sorry, the notebook you are looking for does not exist.




Enjoy Reading This Article?

Here are some more articles you might like to read next:

  • Hello_World CPU
  • Parallel Programming with DASK on CPU
  • Data-Analysis
  • String Processing
  • NumPy Intro