Online Training, IT Technology Business Certification Courses - Busygamers.com/Academy

BECOME A BIG DATA & HADOOP DEVELOPER IN
12 WEEKS

View Session Recording

$100K

Avg salary of Big Data Developer is $82-100K

10%

Big Data industry is growing at almost 10% every year.

1.5 Million

Big Data needs 1.5 mn managers by 2018

  330+  REVIEWS
17899+  LEARNERS
"Once a new technology rolls over you, if you`re not part of the steamroller, you`re part of the road."  - Stewart Brand
17,999

OUR MENTOR-DRIVEN PROGRAM
Learn the fundamentals
Learn intermediate concepts
Learn advanced concepts

Getting Started

  • Know history and mission of BGAcademy.
  • Get a walkthrough on the Course plan.
  • Set up Eclipse and get familiar with it.
  • Install and get familiar with common Git commands.

Start with Basics

  • Define and describe Big Data with examples
  • Get introduced to Hadoop along with a walkthrough of Hadoop architecture
  • Get an overview of Linux and basic commands for Big Data development
  • Install Hadoop and create your first MapReduce program
COURSE SYLLABUS
How to Solve Big Data Problem

  • Why is Data So Important?
  • Pre-requisite – Data Scale
  • What is Big Data?
  • Big Bank: Big Challenge
  • Customer Churn Analysis
  • Point-of-Sale Transaction Analysis
  • Common Problems
  • 3 Vs of Big Data
  • Defining Big Data
  • Sources of Data Flood
  • Exploding Data Problem
  • Redefining the Challenges of Big Data
  • Possible Solutions
  • Scaling Up Vs. Scaling Out
  • Challenges of Scaling Out
  • Solution for Data Explosion-Hadoop
  • Hadoop: Introduction
  • Hadoop in Layman's Term
  • Hadoop Ecosystem
  • Evolutionary Features of Hadoop
  • Big Data Benchmarks
  • Hadoop Timeline
  • Why Learn Big Data Technologies?
  • Who is Using Big Data?
  • Yearly Salaries in Big Data World
  • Job Trends in Big Data

Hadoop Framework Description

  • HDFS: Introduction
  • Design of HDFS
  • Why Hadoop Cluster?
  • HDFS Blocks
  • Components of Hadoop 1.x
  • NameNode and Hadoop Cluster
  • Arrangement of Racks
  • Arrangement of Machines and Racks
  • Local FS and HDFS
  • NameNode
  • Checkpointing
  • Replica Placement
  • Benefits-Replica Placement and Rack Awareness
  • URI
  • URL and URN
  • HDFS Commands

HDFS

  • Problems with HDFS in Hadoop 1.x
  • HDFS Federation (Included in Hadoop 2.x)
  • HDFS Federation
  • High Availability
  • Configuration Files in Hadoop
  • HDFS Configurations
  • Core Configurations
  • Configuration Files in Hadoop
  • Java API to Read HDFS File
  • Java API to Write HDFS File
  • Java API - Listing of File in HDFS
  • Important Java Classes to Read From HDFS
  • Anatomy of File Read From HDFS
  • Data Read Steps
  • Checksum and Data Integrity
  • Data Read from HDFS: Additional Points
  • Important Java Classes to Write Data to HDFS
  • Anatomy of File Write to HDFS
  • Writing File to HDFS: Steps
  • Handling Failures During Writing a File

Exploring MapReduce 1

  • Building Principles
  • Introduction to MapReduce
  • Some More Real-World Examples
  • Broad Steps
  • Finding Out Maximum Temperature
  • Pseudo Code
  • Mapper Class
  • Reducer Class
  • Driver Code
  • Exploring Methods of Mapper
  • Exploring Methods of Reducer
  • Serialization
  • Deserialization
  • Serialization Classes in Hadoop

Exploring MapReduce 2

  • InputSplit
  • InputSplit and Data Blocks – Difference
  • Why Is The Block Size 128 MB?
  • RecordReader
  • InputFormat
  • Default Inputformat: TextInputFormat
  • MapReduce Example
  • OutputFormat
  • Using a Different OutputFormat
  • Important Points
  • Important Points
  • Data Locality
  • JobTracker and TaskTracker
  • Speculative Execution
  • Combiner
  • Using Combiner
  • Partitioner
  • Using Partitioner
  • Map Only Job
  • Flow of Operations in MapReduce

Schedulers in Yarn

  • "Serialization in MapReduce
  • Custom Writable in MapReduce
  • Custom Writable in MapReduce
  • Custom WritableComparable in MapReduce
  • Overview
  • Schedulers in YARN
  • FIFO Scheduler
  • Capacity Scheduler
  • Fair Scheduler
  • Differences between Hadoop 1.x and Hadoop 2.x "

Introduction to Pig

  • Apache Pig: a) Introduction
  • b) Adages
  • c) Advantages
  • d) Basics
  • e) Why Pig?
  • f) Pig Deployment
  • g) Pig Terminology
  • h) Samples
  • Data Types & Handling
  • Apach Pig Architecture
  • Installation
  • Execution - Running Modes
  • Running Pig
  • Relation Operators

Exploring Pig

  • Hands-On: a) Pig Latin Commands
  • b) Use Case with YouTube Data
  • Sentiment Analysis on Twitter data using Apache Pig

Advanced Pig

  • Hands-On: Writing Pig UDF
  • Execution of xml file Using Pig
  • Advanced Joins Using Pig
  • Ebooks on real time case studies on Pig

Project I

  • Mini Project Discussion
  • Flume Introduction
  • Flume Use Case

Project I cont...

  • Mini Project Discussion contd.
  • Best Practices in Big Data Project Deployment

Advanced MapReduce - I

  • What are Joins?
  • When do we need to use Joins?
  • Map Side Joins:a) Introduction
  • b) What is Distributed Cache?
  • c) Map Side Join Process
  • d) Illustration to Use Distributed Cache?
  • e) Example for MAP-SIDE JOIN Using MapReduce
  • f) Hands-On
  • Reducer Side Join: a) Introduction
  • b) How it Works?
  • c) Illustration to Use Reducer Side Join
  • d) Example for Reducer Side Join using MapReduce
  • e) Hands-On

Advanced MapReduce - II

  • Pre-requisites to Understand Custom Input Format
  • a) RecordReader
  • b) WritableComparable
  • c) FileInputFormat
  • CustomerInputFormat Demo
  • Pre-requisites to Understand Sequence File Format
  • Sequence File Format Demo

Hive Introduction

  • Indroduction
  • Installing Hive
  • Execution Engines of Hive
  • Architecture
  • Services
  • Clients
  • The Metastore
  • Function
  • Query Lifecycle on Hadoop
  • Advantages & Limitations
  • A Walkthrough of Hive Components

Hive Operations

  • Hive Data Definitions
  • Hive Data Manipulations
  • HiveQL Operations
  • Hands-On Titanic Dataset
  • Partitioning & Bucketing

Advanced Hive

  • Complex Data Types Demo
  • Hive UDF Demo
  • Thrift Server Demo
  • Join optimization aspects like Map joins
  • Sort Merge Bucket (SMB) joins
  • Real time use cases implementing join optimzations

Overview of HBase

  • NoSQL Databases
  • Types of NoSQL
  • CAP Theorem
  • Introduction to HBase
  • HBase Architecture
  • HBase vs RDBMS
  • HBase Scalable Deployment
  • HBase Data Model

Hbase Data Model and CRUD Operations

  • Data Model Components
  • Row Example
  • What is a Column Family?
  • Column Family Concepts
  • Hbase Shell Command – Demo
  • Hbase Java API - Demo

Hbase Thrift and Rest Server

  • Hbase Thrift Server: a) What is Hbase Thrift Server?
  • b) Integrating Hbase with your Application
  • c) Sending Request and Response from Thrift Server
  • d) Example
  • HBAse Rest Server
  • Hive HBAse Integration
  • Hands-On

Oozie and Sqoop

  • Oozie
  • Oozie Architecture
  • Oozie Workflow Nodes
  • Oozie Server
  • Oozie Workflow
  • Sqoop Hands On

Project II

  • Major Project Discussion
  • Getting started with Spark - Part 1
  • Discussing Ebook-1 on Spark

Project II cont...

  • Major Project Discussion contd.
  • Getting started with Spark - Part 2
  • Discussing Ebook-2 on Spark

Project II cont...

  • Major Project Discussion contd.
  • Getting started with Business Analytics With R - Part 1
  • Discussing Ebook-1 on Business Analytics With R

Project II cont...

  • Major Project Discussion contd.
  • Getting started with Business Analytics With R - Part 2
  • Discussing Ebook-2 on Business Analytics With R

INTERESTED IN CORPORATE TRAINING?
12
WEEKS
COURSE DURATION
2
SESSIONS
EVERY WEEK
1.5
HOURS
EACH SESSION
200
HOURS
PROJECTS & ASSIGNMENTS
GREAT
CAREER

COURSE EXTRACT

Level
Intermediate
Pre-Requisites
Basic knowledge of Java & SQL
Technologies/Tools
Map reduce, Pig, Hive, Hbase, Oozie,Sqoop, Flume and Spark.
Projects
2 (Fully Functional)
Batch Size
8 - 10 Students
Job Preparation
Yes
Certification
Yes
24*7 Support
Yes

WE HELP YOU GET A GREAT JOB


ADDITIONAL BONUS WEEK


On course completion, you are ready to enter the exciting world of corporates . We show you how to make the right impression at job interviews.


INTENSIVE JOB INTERVIEW PRACTICE

Congratulations, you're now a programmer with awesome skills. In this bonus week, we'll help you prepare for your job interviews so that you can make a great first impression. This includes:

Job Preparation
2 in-depth mock technical interviews


Job Preparation
Guidance on how to answer real interview questions from top employers


Job Preparation
Assistance with creating a resume that stands out


BUILDING AN ONLINE REPUTATION

We can help you build that online presence on sites such as Git, StackOverflow, Quora, and LinkedIn.

Oracle
IBM
Microsoft
CISCO

PROJECTS WHICH STUDENTS WILL DEVELOP

1
State-Wise Development Analysis In India

Aim of this project is to analyze how various state governments have performed in different developmental schemes. This analysis will be helpful in finding out how successful the government has been in implementing various projects.

2
Titanic Data Analysis

Aim of this project is to analyze the casualty details like average age of the passenger who survived and died, number of females survived, details of passengers travelling in different classes etc.

3
USA Consumer Forum Data Analysis

Aim of this project is to analyze performance of various companies on aspects like customer query resolution time, customer satisfaction rate, etc. and determine which of them is more customer centric.

4
Twitter Sentiment Analysis

Aim of this project is to perform Sentiment analysis on Twitter data to analyze the sentiments related to a particular aspect.

5
USA Crime Analysis

Aim of this project is to analyze which area in USA is more crime prone and what type of crime is more prominent in different areas of USA. This analysis will help in understanding the efficiency of USA police in solving criminal cases.

6
Youtube Data Analysis

Aim of the project is to analyze which category of videos are trending among the users, determine rating of videos under various categories/genres, number of views for various videos, etc.

CUSTOMER FEEDBACK


PRICING

17,999



Refund


MONEY BACK GUARANTEE

If you are unhappy with the course and opt out in the first week, you get a complete refund.



Scholarship


SCHOLARSHIPS

We subsidize our fees by 10% for military personnel, and college students with exceptional records. To apply for a scholarship, email enquiry@acadgild.com.



Discount


GROUP DISCOUNT

More than 3 people in a group or from the same Company can avail special discount. For more information contact us at enquiry@acadgild.com

FAQ'S

What is Big Data and Hadoop Development course?

Hadoop is an open source software framework for storing and processing Big Data. This course focuses on improved performance in terms of data processing by emphasizing on implementing the real time case studies within the stipulated duration of the course. This course will enable trainees to take real time big data projects after successful completion of the course.

Who should do this course?

Any graduate aiming to successfully build the career around Big Data can do this course. This course will be beneficial for:

  • Software Developers and Architects
  • Professionals with analytics and data management profile
  • Business Intelligence Professionals
  • Project Managers
  • Data Scientists
  • Professionals with Business Intelligence, ETL and data warehousing background
  • Professionals from testing and mainframes background.

How is this training going to help me get a job in this domain? OR How is this training going to enhance my career?

After getting trained in this course you will be equipped with the skills that would help you in handling Big Data projects in various companies. We provide real time case studies, projects and assignments which span for around 200 hours to equip our trainees with required skills to excel in Big Data industry. Extra assistance like mock interviews sessions, building the resume, career guidance related to openings in various companies would help you to land your dream job in Big Data.

Can only Java developers do Big Data training or developers from other platforms as well as from other domains like testing/mainframe/business analytics/database/freshers also benefit from this course? OR Is Java a pre-requisite to Learn Big Data and Hadoop?

Big Data and Hadoop have many components like Pig, Hive and Hbase where Java is not a pre-requisite. People from various domains are successfully getting trained and are successful in Big Data industry but knowledge of core Java is an added advantage as it is the main component of Hadoop (MapReduce is implemented in Java).

How is Data Scientist different from Big Data Developer?

A Data Scientist takes the business need and as per the need prepares the plan to implement the analytics project. Data scientist possesses both the skills of a software engineer and an applied scientist.

A Big Data developer on the other hand is responsible for the design and implementation of the application to perform analysis on huge data to uncover the insights and to make the business intelligent by analyzing data from various sources.

What are the pre-requisites for this course?

Although basic knowledge of Java and SQL will be an advantage but anyone irrespective of their domain can master this course. To cater to the need for primary Java skills we provide free core Java training before actual training classes.

Difference between Hadoop and Big Data?

Big Data is huge collection of data which can be referred as an asset. Big Data can include many different types of data in different formats. Hadoop is essentially a programming framework which stores and processes huge data. It is basically a tool to handle Big Data to get business insights.

Technologies that will be used for this course.

Extensive training will be given on MapReduce, Pig, Hive, HBase, Oozie, Sqoop, Flume and Spark.

Chat History
Copyright © Busy Gamers LLP 2016. All Rights Reserved

BGAcademy is an online training academy which teaches web development, mobile application development and big data courses. BGAcademy provides mentor driven online courses in frontend web development, Android app development, big data development, hadoop development, big data administration and robotics. With BGAcademy, you can learn how to build great responsive websites using latest technologies like Angular and Node. You can also learn to develop Android applications from the comfort of your home. Want to learn how to build fully functional and complex websites ? Or want to learn Android application development courses? Looking to create beautiful mobile applications? Looking for mentor driven web development Android programming course at affordable price? Trying to find best online classes for frontend development and Android development? Looking for summer programming camp for children at affordable price? Looking for Android programming for children? Looking for an expert to teach you big data development or big data administration? Want your child to become a robotics engineer? Look no further. Our expert mentors can teach to beginners as well as expert programmers. Our web development, Android app programming courses, big data courses and robotics course are tailored to your need. If you do not have any prior programming knowledge or skills, we will teach you from the basics of programming. With our comprehensive web development and Android app development courses, we are sure BGAcademy is the best online coding bootcamp for Android programming courses.