Overview
What you will study
The Python Pandas for Data Manipulation course offered by University of Oxford starts by teaching you how to read and save data from and to different file formats, such as CSV files, Excel sheets, and JSON files. You will also learn how to clean up data by dealing with missing values, duplicate values, sorting based on specific columns, and replacing specific values. In addition, you will gain an understanding of index, multi- and hierarchical index as well as multi-row headers.
The day also covers different ways to select data based on row or column values, which is equivalent to SQL select statements with various filtering conditions. We will explore how to transpose, join, concatenate, merge and reshape tables, with various important concepts and configurations to perform these operations. Additionally, you will learn how to create pivot tables and apply the GroupBy operator, explaining what these concepts are, why they are useful and how to apply them and obtain their results.
Furthermore, you will understand how to create summaries, binning and aggregations of data by applying existing or user-defined functions. Finally, the day will cover how to generate basic plots and visualisations. By the end of the day, you will have gained the necessary skills to work with data using Pandas, a widely used library in the field of data science.
Programme Structure
The program focuses on:
- Data Cleaning and Preprocessing
- Reading and saving data
- Dealing with missing values and duplicates
- Sorting data
- Replacing specific values in data
- Data transformation
- Data Selection and Manipulation
- Understanding different types of index and hierarchical index
- Selecting data based on rows or columns
- Transposing, joining, concatenating, and reshaping tables
- Using apply, transform and filter functions
- Data Aggregation and Summarization.
- Creating pivot tables and applying the GroupBy operator
- Creating summaries, binning, and aggregations of data using built-in and user-defined functions
- Exploring different types of table join and merge operations
- Dealing with time-series data
- Data Visualisation
- Generating basic plots and visualisations using Pandas and Matplotlib
- Understanding different types of charts and their applications
- Creating custom visualisations using Seaborn and Plotly
- Exporting plots and visualisations in different file formats
Key information
Duration
- Part-time
- 1 days
Start dates & application deadlines
- Starting
- Apply before
-
Language
Delivered
Disciplines
Data Science & Big Data Software Engineering View 43 other Short Courses in Software Engineering in United KingdomAcademic requirements
We are not aware of any specific GRE, GMAT or GPA grading score requirements for this programme.
English requirements
We are not aware of any English requirements for this programme.
Other requirements
General requirements
- Basic knowledge of Python programming and familiarity with Python data types and data structures, such as dictionaries and lists, is expected to benefit from this day.
Technological requirements
- If you are using a laptop or desktop computer, you will also be offered the option of connecting using a web browser.
- If you connect via a web browser, Chrome is recommended.
Tuition Fee
-
International
115 GBP/fullTuition FeeBased on the tuition of 115 GBP for the full programme during 1 days. -
National
115 GBP/fullTuition FeeBased on the tuition of 115 GBP for the full programme during 1 days.