Big Data Analytics Using Spark, Certificate | Part time online | edX - online learning platform | United States
70 days
Duration
350 USD/full
350 USD/full
Unknown
Tuition fee
Anytime
Unknown
Apply date
Anytime
Unknown
Start date

About

EdX is an online learning platform trusted by over 12 million users offering the Big Data Analytics Using Spark Certificate in collaboration with University of California, San Diego - UC San DiegoX. Learn how to analyze large datasets using Jupyter notebooks, MapReduce and Spark as a platform.

Visit the Visit programme website for more information

Overview

Key Facts

In data science, data is called “big” if it cannot fit into the memory of a single standard laptop or workstation.

The analysis of big datasets requires using a cluster of tens, hundreds or thousands of computers. Effectively using such clusters requires the use of distributed files systems, such as the Hadoop Distributed File System (HDFS) and corresponding computational models, such as Hadoop, MapReduce and Spark.

In the Big Data Analytics Using Spark Certificate, part of the Data Science MicroMasters Program from EdX in partnership with University of California, San Diego - UC San DiegoX, you will learn what the bottlenecks are in massive parallel computation and how to use spark to minimize these bottlenecks.

You will learn how to perform supervised an unsupervised machine learning on massive datasets using the Machine Learning Library (MLlib).

In this course, as in the other ones in this MicroMasters program, you will gain hands-on experience using PySpark within the Jupyter notebooks environment.

Programme Structure

What you'll learn

  • Programming Spark using Pyspark

  • Identifying the computational tradeoffs in a Spark application

  • Performing data loading and cleaning using Spark and Parquet

  • Modeling data through statistical and machine learning methods

Key information

Duration

  • Part-time
    • 70 days
    • 9 hrs/week

Start dates & application deadlines

You can apply for and start this programme anytime.

Language

English

Delivered

Online

Academic requirements

We are not aware of any specific GRE, GMAT or GPA grading score requirements for this programme.

English requirements

We are not aware of any English requirements for this programme.

Other requirements

General requirements

Prerequisites

  • The previous courses in the MicroMasters program: Python for Data Science, Probability and Statistics in Data Science using Python, Machine Learning Fundamentals.

Tuition Fee

To always see correct tuition fees
  • International

    350 USD/full
    Tuition Fee
    Based on the tuition of 350 USD for the full programme during 70 days.
  • National

    350 USD/full
    Tuition Fee
    Based on the tuition of 350 USD for the full programme during 70 days.
  • Unlimited access + verified certificate: $350
  • Limited access: free

Funding

Other interesting programmes for you

Our partners

Big Data Analytics Using Spark
-
edX - online learning platform

Wishlist

Go to your profile page to get personalised recommendations!