Data Science: Data Processing with Python Pandas
Key Information
Tutor: Dr John Pinney
Course Level: Level 2
Course Credit: 1 credit
Prerequisites: Familiarity with Python is necessary
Course Duration: 2x 2.5 hour sessions
Format: Microsoft Teams with live teaching and hands-on practice
Course Resources
Pandas is a popular software library for Python which includes a number of useful data processing tools. It is free and relatively easy to learn, with a community which provides support and documentation. It is more powerful than Excel and is able to produce high-quality figures.
This course teaches the beginnings of data processing using the Pandas library. You will learn the basic commands of the package with hands-on examples using the Jupyter Notebook environment.
Syllabus:
- Why you might choose to use Pandas for data processing
- Selecting rows of data
- Selecting columns of data
- Cutting and joining data
- Reading and writing data
- Producing graphs from the data
Learning Outcomes:
On completion of this workshop you will be able to:
- Use Jupyter notebooks to perform simple Pandas data analysis
- Apply fundamental components of Pandas syntax including data selection and grouping
- Create programs designed to process example data and display simple statistics
- Interpret common errors and use these to help debug a program
Dates & Booking Information
Date | Time | Platform/Venue |
---|---|---|
Monday 13 June 2022 (Part 1) & Wednesday 15 June 2022 (Part 2) |
10:00-12:30 10:00-12:30 |
Microsoft Teams |
Please select a date and book on via Inkpath using your Imperial Single-Sign-On.