• Anytime
    Application Deadline
  • 70 days
    Duration

About

In data science, data is called “big” if it cannot fit into the memory of a single standard laptop or workstation.

The analysis of big datasets requires using a cluster of tens, hundreds or thousands of computers. Effectively using such clusters requires the use of distributed files systems, such as the Hadoop Distributed File System (HDFS) and corresponding computational models, such as Hadoop, MapReduce and Spark.

In the Big Data Analytics Using Spark Certificate, part of the Data Science MicroMasters Program from EdX in partnership with University of California, San Diego - UC San DiegoX, you will learn what the bottlenecks are in massive parallel computation and how to use spark to minimize these bottlenecks.

You will learn how to perform supervised an unsupervised machine learning on massive datasets using the Machine Learning Library (MLlib).

In this course, as in the other ones in this MicroMasters program, you will gain hands-on experience using PySpark within the Jupyter notebooks environment.

Detailed Programme Facts

  • Programme intensity Part-time
    • Average part-time duration 70 days
    • Intensity 10 hrs/week
    • Part-time variant
      Flexible
    • Duration description
      10 weeks, 10 hours per week
  • Languages
    • English
  • Delivery mode
    Online
  • More information Go to the programme website

Programme Structure

What you'll learn

  • Programming Spark using Pyspark
  • Identifying the computational tradeoffs in a Spark application
  • Performing data loading and cleaning using Spark and Parquet
  • Modeling data through statistical and machine learning methods

Lecturers

  • Yoav Freund - Professor of Computer Science and Engineering, UC San Diego

English Language Requirements

This programme may require students to demonstrate proficiency in English.

Academic Requirements

Prerequisites

  • The previous courses in the MicroMasters program: Python for Data Science, Statistics and Probability in Data Science using Python and Machine Learning Fundamentals

Tuition Fee

  • International

    350 USD/full
    Tuition Fee
    Based on the original amount of 350 USD for the full programme and a duration of 70 days.
  • National

    350 USD/full
    Tuition Fee
    Based on the original amount of 350 USD for the full programme and a duration of 70 days.
We've labeled the tuition fee that applies to you because we think you are from and prefer over other currencies.
5% discount coupon: Z7LZNQ4TN3B2JTWU valid for any free course + certificate upgrade

Funding

Check the programme website for information about funding options.

StudyPortals Tip: Students can search online for independent or external scholarships that can help fund their studies. Check the scholarships to see whether you are eligible to apply. Many scholarships are either merit-based or needs-based.

The Global Study Awards: get funded with up to £10,000 to study abroad

Together with the ISIC Association and British Council IELTS, Studyportals offers you the chance to receive up to £10000 to expand your horizon and study abroad. We want to ultimately encourage you to study abroad in order to experience and explore new countries, cultures and languages.