Big Data Hadoop

Introduction

Hadoop is a free, Java-based programming framework that supports the processing of large data sets in a distributed computing environment. It is part of the Apache project sponsored by the Apache Software Foundation.

Syllabus

Hadoop Architecture and Design

• Introduction to Hadoop Ecosystem.
• NameNode and DataNodes.
• Data Replication.
• The Persistence of File System Metadata.
• File System Robustness.
• Data Organization.
• HDFS Shell Commands

Map Reduce

• Map Reduce Components.
• Keys and Values in MapReduce.
• Job and Task Tracker.

PIG

• PIG and its features.
• Data Types.
• PIG scripting.

HIVE

• History of HIVE.
• Hive Architecture.
• Hive Components.
• Hive SQL Scripting.

Hadoop other tools

• Sqoop.
• Oozie.
• Flume.
• Zookeeper.
• Introduction to HBASE.

Duration:

45 hrs

Program Structure:

The course will have 30 days Activity based classroom Training,
Seminars, Tutorials, Case Studies, Assignments and Exams.

Eligibility:

• Graduates and Post-Graduates
• Job aspirants looking to make career in Big data

Share This Post:

iTpreneu