Data Science: Data Processing with Python Pandas

Key Information

Tutor: Dr John Pinney
Course Level: Level 2
Course Credit: 1 credit
Prerequisites: Familiarity with Python is necessary
Course Duration: 2x 2.5 hour sessions
Format: Microsoft Teams with live teaching and hands-on practice

Course Resources

Pre-Course setup & Materials

Pandas is a popular software library for Python which includes a number of useful data processing tools. It is free and relatively easy to learn, with a community which provides support and documentation. It is more powerful than Excel and is able to produce high-quality figures.

This course teaches the beginnings of data processing using the Pandas library. You will learn the basic commands of the package with hands-on examples using the Jupyter Notebook environment.

Syllabus:

Why you might choose to use Pandas for data processing
Selecting rows of data
Selecting columns of data
Cutting and joining data
Reading and writing data
Producing graphs from the data

Learning Outcomes:

On completion of this workshop you will be able to:

Use Jupyter notebooks to perform simple Pandas data analysis
Apply fundamental components of Pandas syntax including data selection and grouping
Create programs designed to process example data and display simple statistics
Interpret common errors and use these to help debug a program

Dates & Booking Information

Summary of the table's contents
Date	Time	Platform/Venue
Monday 13 June 2022 (Part 1) & Wednesday 15 June 2022 (Part 2)	10:00-12:30 10:00-12:30	Microsoft Teams

Please select a date and book on via Inkpath using your Imperial Single-Sign-On.

Imperial College London

Latest News

Graduate School