Join below Google Group, its free and access all below modules
Syllabus of Free Training Module 1: Introduction to Apache Spark (Available Length 48 Minutes) - Introduction to Apache Spark
- Features of Apache Spark
- Apache Spark Stack
- Introduction to RDD's
- RDD's Transformation
- What is good and bad In MapReduce?
- Why to use Apache Spark
Module 2: Cloudera QuickStart VM Installation (Hands-on Lab + PDF Download) (Available Length 34 Minutes) - Include Hadoop
- Include Apache Spark
- Include Hive
- Include Sqoop
- Include Hue
Module 3: Deep Dive in HDFS: (Available Length 48 Minutes) - HDFS Design
- Fundamental of HDFS (Blocks, NameNode, DataNode, Secondary Name Node)
- Rack Awareness
- Read/Write from HDFS
- HDFS Federation and High Availability (Hadoop 2.x.x)
- HDFS Command Line Interface
Module 4: Spark Shell Hands On Using HDFS (Hands-on Lab + PDF Download) (Available Length 34 Minutes) - Spark Shell Introduction
- Create file using Hue
- Spark Shell extracting file from HDFS
- Create RDD from HDFS file
Module 5: Programming with RDD Part-1 (Hands-on Lab + PDF Download) (Available Length 28 Minutes) - Creating new RDD
- Transformations on RDD
- Lineage Graph
- Actions on RDD
- RDD Concepts on Persist and Cache
- Lazy evaluation of RDD
|
|