Big Data Analytics Using Spark, Certificate | Part time online | edX - online learning platform | United States
70 days
Tuition fee
Apply date
Start date


EdX is an online learning platform trusted by over 12 million users offering the Big Data Analytics Using Spark Certificate in collaboration with University of California, San Diego - UC San DiegoX. Learn how to analyze large datasets using Jupyter notebooks, MapReduce and Spark as a platform.

Visit the Visit university website for more information


In data science, data is called “big” if it cannot fit into the memory of a single standard laptop or workstation.


The analysis of big datasets requires using a cluster of tens, hundreds or thousands of computers. Effectively using such clusters requires the use of distributed files systems, such as the Hadoop Distributed File System (HDFS) and corresponding computational models, such as Hadoop, MapReduce and Spark.

In the Big Data Analytics Using Spark Certificate, part of the Data Science MicroMasters Program from EdX in partnership with University of California, San Diego - UC San DiegoX, you will learn what the bottlenecks are in massive parallel computation and how to use spark to minimize these bottlenecks.

You will learn how to perform supervised an unsupervised machine learning on massive datasets using the Machine Learning Library (MLlib).

In this course, as in the other ones in this MicroMasters program, you will gain hands-on experience using PySpark within the Jupyter notebooks environment.

Programme Structure

What you'll learn

  • Programming Spark using Pyspark

  • Identifying the computational tradeoffs in a Spark application

  • Performing data loading and cleaning using Spark and Parquet

  • Modeling data through statistical and machine learning methods

Key information


  • Part-time
    • 70 days
    • 9 hrs/week

Start dates & application deadlines

You can apply for and start this programme anytime.




  • Self-paced

Academic requirements

We are not aware of any specific GRE, GMAT or GPA grading score requirements for this programme.

English requirements

We are not aware of any English requirements for this programme.

Other requirements

General requirements


  • The previous courses in the MicroMasters program: Python for Data Science, Probability and Statistics in Data Science using Python, Machine Learning Fundamentals.

Tuition Fee

To always see correct tuition fees
  • International

    Tuition Fee
    Based on the tuition of 0 USD for the full programme during 70 days.
  • National

    Tuition Fee
    Based on the tuition of 0 USD for the full programme during 70 days.
  • Unlimited access to self-paced, in-demand courses and professional certificates
  • Starting at $349 per learner/year.


Our partners

Big Data Analytics Using Spark
edX - online learning platform


Go to your profile page to get personalised recommendations!