Online Developing Solutions Using Apache Hadoop Training Classes And Courses  BISP 

60% Complete Courses » Developing Solutions Using Apache Hadoop
  • Course Content
  • Course Overview
  • Training Schedule
  • Demo Video
  • Learning Material
  • Register


BIGDATA

Introduction to Bigdata & its impact

Hadoop

Apache Hadoop2.x Overview

Introduction to Apache Hadoop and the Hadoop Ecosystem

Apache Hadoop2.x Cluster Components

HADOOP DISTRIBUTED FILE SYSTEM

HDFS Architecture

Namenode

Datanode

Secondary Namenode

Read & write operations

HDFS high availability

HDFS Federation

DISTRIBUTED PROCESSING IN HADOOP CLUSTER

Mapreduce workflow

YARN architecture

SINGLE NODE HADOOP SETUP

Different ways of setting up hadoop.

Activity: step by step installation & configuration guide

Activity: Executing HDFS commands to understand HDFS file system

MULTINODE HADOOP SETUP

Prerequisites for multinode setup

Activity: step by step installation & configuration guide

HADOOP ADMINISTRATION ACTIVITIES

Activity: Adding & removing nodes from a cluster

 Activity: modifying hadoop configuration parameters

checking health of a cluster

SPARK

Introduction to Spark

Components of Spark

Activity: Scala Crash course

Activity: Spark installation and setup (Eclipse & CLI)

RDD

Introduction to RDD

Transformations & actions of RDD

Activity: Learning programs to understand RDD (With real data)

Spark SQL

Importance of spark SQL

Datasets & Data frames

Data source and different file formats

Activity: Running Spark SQL programmatically

Demo project to show case all the components learnt

Course Id:
HDP001 
Course Fees:
301 USD