Studyportals
Certificate Online

Introduction to Spark with sparklyr in R Data Camp

Highlights
Tuition fee
Free
Free
Free
Unknown
Tuition fee
Free
Free
Free
Unknown
Duration
1 days
Duration
1 days
Apply date
Anytime
Unknown
Apply date
Anytime
Unknown
Start date
Anytime
Unknown
Start date
Anytime
Unknown
Taught in
English
Taught in
English

About

In this Introduction to Spark with sparklyr in R course offered by Data Camp you will learn how to run big data analysis using Spark and the sparklyr package in R

Overview

Context

R is mostly optimized to help you write data analysis code quickly and readably. Apache Spark is designed to analyze huge datasets quickly. The sparklyr package lets you write dplyr R code that runs on a Spark cluster, giving you the best of both worlds. This Introduction to Spark with sparklyr in R course at Data Camp teaches you how to manipulate Spark DataFrames using both the dplyr interface and the native interface to Spark, as well as trying machine learning techniques.

Load Data into Spark and Manipulate Spark DataFrames

You’ll start this Spark course by investigating how Spark and R work well together and practicing loading data, ready for cleaning, transformation, and analysis. You’ll use Spark frames and dplyr syntax to manipulate your data by filtering and arranging rows, and mutating and summarizing columns.

Delve into Big Data Analysis with Spark MLib

This course focuses on building your skills and confidence in analyzing huge datasets. The final chapters take you through Spark’s machine learning data transformation features and offer you the chance to practice sparklyr’s machine learning routines by using it to make predictions using gradient boosted trees and random forests. "

Programme Structure

Chapters

  • Light My Fire: Starting To Use Spark With dplyr Syntax
  • Tools of the Trade: Advanced dplyr Usage
  • Going Native: Use The Native Interface to Manipulate Spark DataFrames
  • Case Study: Learning to be a Machine: Running Machine Learning Models on Spark

Key information

Duration

  • Part-time
    • 1 days

Start dates & application deadlines

You can apply for and start this programme anytime.

Language

English

Delivered

Online

Campus Location

  • New York City, United States

What students do after studying

Join for free or log in to access our complete career info list.

Academic requirements

We are not aware of any specific GRE, GMAT or GPA grading score requirements for this programme.

English requirements

We are not aware of any English requirements for this programme.

Other requirements

General requirements

  • PREREQUISITES: Supervised Learning in R: Regression
  • Even though no prior knowledge of Apache Spark is required, this course introduces learners to the basics of Apache Spark and how to use Spark with the sparklyr package in R.
  • This course can be beneficial for anyone interested in learning how to manipulate large datasets quickly using Apache Spark and the sparklyr package in R. From data engineers to data scientists to analytics professionals and software developers, anyone working with large datasets would benefit from this course.

Tuition Fees

Tuition fees are shown in and the most likely applicable fee is shown based on your nationality.
  • International

    Non-residents
    Free
  • Out-of-State
    Free
  • Domestic

    In-State
    Free

Additional Details

This course can be accessed for free with the Data Camp Premium or Teams subscriptions

Funding

Other interesting programmes for you

Our partners

Introduction to Spark with sparklyr in R
Data Camp
Introduction to Spark with sparklyr in R
-
Data Camp

Wishlist

Go to your profile page to get personalised recommendations!