Learn everything about Analytics

Introduction to Big Data with Apache Spark – UC BerkeleyX- EDX

0-6 Month Online
Beginner 1-Jun-2015
Online Big Data
Online Self Paced 2456


[su_tab title = “Description”]

This course will attempt to articulate the expected output of Data Scientists and then teach students how to use PySpark (part of Apache Spark)  to deliver against these expectations. The course assignments include Log Mining, Textual Entity Recognition, Collaborative Filtering exercises that teach students how to manipulate data sets using parallel processing with PySpark.


[su_tab title = “Program Structure”]

This course covers advanced undergraduate-level material.

Important Date:

Starts June 1, 2015


5 week

5 – 7 hours per week

Full time/Part time:

Part time


[su_tab title = “Eligibility”]

  • Programming background and experience with Python required. All exercises will use PySpark (part of Apache Spark), but previous experience with Spark or distributed computing is NOT required.


[su_tab title =”Tools”]

  • Python
  • Apache Spark
  • PySpark


[su_tab title = “Faculty”]

  • Anthony D. Joseph


[su_tab title = “Contact”]

Name :
Email :
Contact Number :
Message :
Code :



This article is quite old and you might not get a prompt response from the author. We request you to post this comment on Analytics Vidhya's Discussion portal to get your queries resolved