Overview
Manipulating big data distributed over a cluster using functional concepts is rampant in industry, and is arguably one of the first widespread industrial uses of functional ideas. This is evidenced by the popularity of MapReduce and Hadoop, and most recently Apache Spark, a fast, in-memory distributed collections framework written in Scala. In this course, we'll see how the data parallel paradigm can be extended to the distributed case, using Spark throughout.
Key facts
- We'll cover Spark's programming model in detail, being careful to understand how and when it differs from familiar programming models, like shared-memory parallel collections or sequential Scala collections.
- Through hands-on examples in Spark and Scala, we'll learn when important issues related to distribution like latency and network communication should be considered and how they can be addressed effectively for improved performance.
Learning Outcomes
By the end of this Big Data Analysis with Scala and Spark offered by Coursera in partnership with Ecole Polytechnique Fédérale de Lausanne you will be able to:
- read data from persistent storage and load it into Apache Spark,
- manipulate data with Spark and Scala,
- express algorithms for data analysis in a functional style,
- recognize how to avoid shuffles and recomputation in Spark,
Programme Structure
Courses include:
- Getting Started + Spark Basics
- Reduction Operations & Distributed Key-Value Pairs
- Partitioning and Shuffling
- Structured data: SQL, Dataframes, and Datasets
Key information
Duration
- Part-time
- 2 days
Start dates & application deadlines
Language
Delivered
Campus Location
- Mountain View, United States
Disciplines
Data Science & Big Data Data Analytics View 179 other Short Courses in Data Analytics in United StatesWhat students do after studying
Academic requirements
We are not aware of any specific GRE, GMAT or GPA grading score requirements for this programme.
English requirements
We are not aware of any English requirements for this programme.
Other requirements
General requirements
- Intermediate Level
- Some related experience required
Tuition Fees
-
International Applies to you
Applies to youNon-residentsFree - Out-of-StateFree
Additional Details
- Coursera Plus: Subscribe to build job-ready skills from world-class institutions.
- $59/month, cancel anytime or $399/year with 14-day money-back guarantee
Funding
Coursera provides financial aid to learners who cannot afford the fee. Apply for it by clicking on the Financial Aid link beneath the "Enroll" button on the left. You'll be prompted to complete an application and will be notified if you are approved. You'll need to complete this step for each course in the Specialization, including the Capstone Project.