Overview
Context
Advance your data skills by mastering Apache Spark. Using the Spark Python API, PySpark, you will leverage parallel computation with large datasets, and get ready for high-performance machine learning.
From cleaning data to creating features and implementing machine learning models, you'll execute end-to-end workflows with Spark. The Big Data with PySpark course offered by Data Camp ends with building a recommendation engine using the popular MovieLens dataset and the Million Songs dataset.
What you will do during this course:
- Master PySpark to handle big data with ease—learn to process, query, and optimize massive datasets for powerful analytics
- Learn the fundamentals of working with big data with PySpark.
- Learn how to clean data with Apache Spark in Python.
- Learn the gritty details that data scientists are spending 70-80% of their time on; data wrangling and feature engineering.
- Learn how to make predictions from data with Apache Spark, using decision trees, logistic regression, linear regression, ensembles, and pipelines.
- Learn tools and techniques to leverage your own big data to facilitate positive experiences for your users.
Programme Structure
Courses
PySpark
- Big Data with PySpark
- Cleaning Data with PySpark
- Feature Engineering with PySpark
- Machine Learning with PySpark
- Building Recommendation Engines with PySpark
- Bonus: Building a Demand Forecasting Model
Key information
Duration
- Part-time
- 3 days
Start dates & application deadlines
Language
Delivered
Campus Location
- New York City, United States
Disciplines
Data Science & Big Data View 467 other Short Courses in Data Science & Big Data in United StatesWhat students do after studying
Academic requirements
We are not aware of any specific GRE, GMAT or GPA grading score requirements for this programme.
English requirements
We are not aware of any English requirements for this programme.
Other requirements
General requirements
- There are no prerequisites for this track
- Prior knowledge of machine learning and Python is assumed if you start this track.
- Data analysts, data engineers, and machine learning engineers will benefit from this Track.
Tuition Fees
-
International Applies to you
Applies to youNon-residentsFree - Out-of-StateFree
-
Domestic
Applies to youIn-StateFree
Additional Details
- This course can be accessed for free with the Data Camp Premium or Teams subscriptions