Studyportals
Certificate Online

Big Data with PySpark Data Camp

Highlights
Tuition fee
Free
Free
Free
Unknown
Tuition fee
Free
Free
Free
Unknown
Duration
3 days
Duration
3 days
Apply date
Anytime
Unknown
Apply date
Anytime
Unknown
Start date
Anytime
Unknown
Start date
Anytime
Unknown
Taught in
English
Taught in
English

About

In this Big Data with PySpark course offered by Data Camp you will master how to process big data and leverage it efficiently with Apache Spark using the PySpark API.

Overview

Context

Advance your data skills by mastering Apache Spark. Using the Spark Python API, PySpark, you will leverage parallel computation with large datasets, and get ready for high-performance machine learning. 

From cleaning data to creating features and implementing machine learning models, you'll execute end-to-end workflows with Spark. The Big Data with PySpark course offered by Data Camp ends with building a recommendation engine using the popular MovieLens dataset and the Million Songs dataset.

What you will do during this course:

  • Master PySpark to handle big data with ease—learn to process, query, and optimize massive datasets for powerful analytics
  • Learn the fundamentals of working with big data with PySpark.
  • Learn how to clean data with Apache Spark in Python.
  • Learn the gritty details that data scientists are spending 70-80% of their time on; data wrangling and feature engineering.
  • Learn how to make predictions from data with Apache Spark, using decision trees, logistic regression, linear regression, ensembles, and pipelines.
  • Learn tools and techniques to leverage your own big data to facilitate positive experiences for your users.

Programme Structure

Courses

  • PySpark

  • Big Data with PySpark
  • Cleaning Data with PySpark
  • Feature Engineering with PySpark
  • Machine Learning with PySpark
  • Building Recommendation Engines with PySpark
  • Bonus: Building a Demand Forecasting Model

Key information

Duration

  • Part-time
    • 3 days

Start dates & application deadlines

You can apply for and start this programme anytime.

Language

English

Delivered

Online

Campus Location

  • New York City, United States

What students do after studying

Join for free or log in to access our complete career info list.

Academic requirements

We are not aware of any specific GRE, GMAT or GPA grading score requirements for this programme.

English requirements

We are not aware of any English requirements for this programme.

Other requirements

General requirements

  • There are no prerequisites for this track
  • Prior knowledge of machine learning and Python is assumed if you start this track.
  • Data analysts, data engineers, and machine learning engineers will benefit from this Track.

Tuition Fees

Tuition fees are shown in and the most likely applicable fee is shown based on your nationality.
  • International

    Non-residents
    Free
  • Out-of-State
    Free
  • Domestic

    In-State
    Free

Additional Details

  • This course can be accessed for free with the Data Camp Premium or Teams subscriptions

Funding

Other interesting programmes for you

Our partners

Big Data with PySpark
Data Camp
Big Data with PySpark
-
Data Camp

Wishlist

Go to your profile page to get personalised recommendations!