Analyzing Data with Python, Certificate | Part time online | edX - online learning platform | United States
35 days
Duration
Free
Free
Unknown
Tuition fee
Anytime
Unknown
Apply date
Anytime
Unknown
Start date

About

EdX is an online learning platform trusted by over 12 million users offering the Analyzing Data with Python in collaboration with IBMx. In this course, you will learn how to analyze data in Python using multi-dimensional arrays in numpy, manipulate DataFrames in pandas, use SciPy library of mathematical routines, and perform machine learning using scikit-learn!

Visit the official programme website for more information

Overview

Learn how to analyze data using Python. This Analyzing Data with Python course at IBMx will take you from the basics of Python to exploring many different types of data. You will learn how to prepare data for analysis, perform simple statistical analyses, create meaningful data visualizations, predict future trends from data, and more!

What you will learn

You will learn how to:

  • Import data sets

  • Clean and prepare data for analysis

  • Manipulate pandas DataFrame

  • Summarize data

  • Build machine learning models using scikit-learn

  • Build data pipelines

  • Data Analysis with Python is delivered through lecture, hands-on labs, and assignment.

It includes following parts:

Data Analysis libraries: will learn to use Pandas DataFrames, Numpy multi-dimentional arrays, and SciPy libraries to work with a various datasets. We will introduce you to pandas, an open-source library, and we will use it to load, manipulate, analyze, and visualize cool datasets. Then we will introduce you to another open-source library, scikit-learn, and we will use some of its machine learning algorithms to build smart models and make cool predictions.

Programme Structure

Courses include:

Module 1 - Importing Datasets

  • Learning Objectives
  • Understanding the Domain
  • Understanding the Dataset
  • Python package for data science
  • Importing and Exporting Data in Python
  • Basic Insights from Datasets

Module 2 - Cleaning and Preparing the Data

  • Identify and Handle Missing Values
  • Data Formatting
  • Data Normalization Sets
  • Binning
  • Indicator variables

Module 3 - Summarizing the Data Frame

  • Descriptive Statistics
  • Basic of Grouping
  • ANOVA
  • Correlation
  • More on Correlation

Module 4 - Model Development

  • Simple and Multiple Linear Regression
  • Model Evaluation Using Visualization
  • Polynomial Regression and Pipelines
  • R-squared and MSE for In-Sample Evaluation
  • Prediction and Decision Making

Module 5 - Model Evaluation

  • Model  Evaluation
  • Over-fitting, Under-fitting and Model Selection
  • Ridge Regression
  • Grid Search
  • Model Refinement

Key information

Duration

  • Part-time
    • 35 days
    • 2 hrs/week

Start dates & application deadlines

You can apply for and start this programme anytime.

Language

English

Delivered

Online
  • Self-paced

Academic requirements

We are not aware of any specific GRE, GMAT or GPA grading score requirements for this programme.

English requirements

We are not aware of any English requirements for this programme.

Other requirements

General requirements

Prerequisites

  • Some Python Experience

Tuition Fee

To always see correct tuition fees
  • International

    Free
    Tuition Fee
    Based on the tuition of 0 USD for the full programme during 35 days.
  • National

    Free
    Tuition Fee
    Based on the tuition of 0 USD for the full programme during 35 days.
  • Add a Verified Certificate for $99 USD
  • Limited access:free

Funding

Studyportals Tip: Students can search online for independent or external scholarships that can help fund their studies. Check the scholarships to see whether you are eligible to apply. Many scholarships are either merit-based or needs-based.

Our partners

Analyzing Data with Python
-
edX - online learning platform

Wishlist

Go to your profile page to get personalised recommendations!