Subscribe
Price : $220
Start Date : 10/06/2017
End Date : TBD
Author : Rahul Anand
Tecwallet Funds available: $0
LogIn Forgot password? Create account
OR
To explore the fundamental concepts of big data analytics
To develop in-depth knowledge and understanding of the big data analytic domain.
To learn to analyze the big data using intelligent techniques.
To use advanced analytical tools/ decision-making tools/ operation research techniques to analyze the complex problems and get ready to develop such new techniques for the future..
Master of understanding the concept of the Big Data & Hadoop framework.
Acquire in-depth to understanding with several other types of data which store in Big Data & Hadoop.
Understanding the methods on how to Big Data & Hadoop deployment in a cluster environment and infrastructure.
Master the core and advanced concepts of the Hadoop Ecosystem, including HDFS & Map- Reduce frameworks. Get hands-on experience in setting up a single node Hadoop cluster.
Master with distinct other components of Hadoop Ecosystem.
Performer Data Analytics using PIG & HIVE.
Best implementation of Hadoop project.
Real-life working experience on an industry based project on Big Data Analytics using the Hadoop Ecosystem and much more.
With the number of Big Data & Hadoop careers are on the rise, this course is fast becoming the must-know technology for the following professionals:
Data Architects
Data Engineer
Technical Engineer
Data Analyst
Data Integration Architects
Tech Managers
Decision Makers
Database Administrators
Java Developers/ Any other developers
Technical Infrastructure Team
Any working professional interested in knowing Hadoop
Any graduate/post-graduate with an urge to learn Hadoop
Familiarity with core java will be an advantage, but is not mandatory.
Familiarity with any database will be an advantage, but is not mandatory.
Module-1
Introduction to Big Data
Big Data Definition
Significance -Why Big Data?
At what rate data is moving towards BigData?
How single person contributing towards BigData?
Role of BigData in day today life
Why RDBMS is not suited for BigData
Drawbacks of RDBMS
Case Study description for BigData Giant companies like Facebook, Google, Amazon, uber
Module-2
Introduction to Hadoop
Hadoop -History
Hadoop Architecture
Why is Hadoop Important?
How are files stored in Hadoop?
Hadoop Components
Hadoop Ecosystem
Block Allocation in HDFS
HDFS Architecture
HDFS Read Operation
HDFS Write Operation
When to use and not use HDFS
Advantages of Hadoop
Drawbacks in Hadoop 1.0
Introduction to Hadoop 2.0
Yarn Architecture
Yarn Components
YARN Ecosystem
Difference between Hadoop 1.0 and 2.0
Hands-on Exercises on Cloudera 5.10 and Software Installation
Module-3
MapReduce Concept
MapReduce Components
MapReduce Architecture
MapReduce Internals
Mapper, Reducer, Driver
Understanding Mapper
Understanding Reducer
Shuffler, Sort
Practitioner
Combiner
Running a MapReduce Job
Hands-on Exercises on word count Job using Jar files
Module-4
Introduction to Pig
Pig History
Pig Architecture
Pig Components
Pig Latin Basics
Data Loading and Storing in Pig
Filtering in Pig
Data Transformation in Pig
Grouping and Sorting in Pig
Advanced Features
Joins in Pig &User Defined Functions
Hands-on Exercises on different data-set and basic loading operation with complex approach algorithms
Module-5
Introduction to Hive
Hive History
Hive Architecture
Hive Components
Data Storage in Hive
Data Types in Hive
Hive Query Language Features
Partitions in Hive
Joins in Hive
Hands-on Exercises on different data-set and basic loading operation with complex algorithm approach.
Module-6
Introduction to Sqoop
Sqoop Overview
Data Import and Export
Need for Sqoop
Uses for Sqoop
Advantages for Sqoop
Hands-on Exercises to load data from MySQL to HDFS and HDFS to MySQL using different attributes.
Module-7
Introduction to Apache Impala
Apache Impala Architecture
Apache Impala Features
Uses Apache Impala
Advantages Apache Impala
Comparing Apache Impala and Apache Hive
Hands-on Exercises on different Dataset and building up solution for certain business problem
Module-8
Introduction to Apache Oozie
Apache Oozie Architecture
Apache Oozie workflow
Understating use case for Apache Oozie
Hands-on Exercises and understand to schedule job in Hive using Apache Oozie
Module-9
Understanding Apache Flume
Architecture Apache Flume
Source Concept
Sink Concept
Channel Concept
Extract Data from Twitter
Understanding Cloudera Manager
Comments ( To Post Your Comment Please Login First! )