Geoscientific data analysis with Python

Contents

Geoscientific data analysis with Python

This course is designed to introduce students to (1) programming with the Python language and (2) common data analysis methods in Python such as:

  • Reading in, cleaning, and basic plotting of spreadsheet data from Excel, CSV files, etc.

  • Batch processing of data files

  • Interpolations of values in 1D and 2D using SciPy

  • Contour plotting in 2D and 3D

  • Fitting lines to scatter plot data, various goodness-of-fit statistics, etc.

  • Best practices for data visualisation (making accessible plots for colourblind viewers, etc.)

The first part of the course will be an introduction to programming in Python as a semi-independent study of the existing course materials for the Geo-Python course, but with weekly meetings with a course assistant to answer questions and provide learning support. Completion of this part of the course is only necessary if you do not already have basic Python programming skills including use of libraries like pandas and matplotlib.

In the second half of the course we will have weekly meetings to discuss various topics in the bulleted list above in an informal way and based on the needs of the group. To the extent possible we will work around the kinds of datasets you are using and discuss ways in which your existing data workflows can be migrated to Python to make your work more efficient and reproducible.