Apache Hadoop Administration Training: 3 Days

Apache Hadoop Administration Training Course Apache Hadoop Administration Training
By TED Integrated
Apache Series:

Introduction to Apache Hadoop Administration

“Learn to install, configure, and maintain Hadoop on Linux in various compute environments.”

By TED Integrated


PSMB Certified Training Provider
Request for In-house TrainingRequest for In-house Training
Register for Public Training Public program not yet available
Email Customer Service [email protected] Contact Customer Service +603-2386-7788
Course Title:
Introduction to Apache Hadoop Administration
Training Category:
Information Technology
Target Audience:
One should have a basic knowledge of Linux Administration, Fundamentals of Java programming, and Knowledge of Hadoop.
Duration:
3 Days
Public Training Events
Nov 2014 ›
Venue:
Melia Hotel, Kuala Lumpur
Schedule:
Wed 19 Nov 2014 - Thu 20 Nov 2014
9:00AM - 5:00PM

Fee Per Person:
RM1,500.00
Promotions:
  • RM1,200 per person for group registration of 5 paxs or more from a same organization.
Register Now!

* Other terms & conditions apply.
For In-house Training
Request for Quotation Request for Quotation
For Other Inquiries
Contact TED Integrated
Contact customer service +603-2386-7788
Email customer service [email protected]
Delivery Methods
  • Language: English
  • PowerPoint Presentation
  • Workshop
  • Presentation Handouts
  • Computer Lab Work
  • Lecture
  • Certificate of Participation
Ad by Google
*Terms & Conditions

Course Introduction ›

Apache Hadoop is an open source framework for creating reliable and distributable compute clusters. Credited with the IBM Watson Jeopardy win in 2011, Hadoop can be used (with other related frameworks) to process large unstructured or semi-structured data sets from multiple sources to dissect, classify, learn from, and make suggestions for business analytics, decision support, and other advanced forms of machine intelligence.

As administrators, you will need to install, configure, and maintain Hadoop on Linux in various compute environments.

Course Objectives ›

This introductory-level, hands-on lab-intensive course is geared for the administrator who is new to Hadoop and responsible for maintaining a Hadoop cluster and its related components. Hadoop is a system designed for massive scalability; it's extremely fault-tolerant compared to other cluster architectures.

This course agenda may be easily customized for addressing areas of specific interest to your team. There are lab variations that support Cloudera and Hortonworks distributions as well.

Course Outline ›

DAY 1

1. Hadoop Overview

  • Map/Reduce
  • Hadoop, YARN, and Spark
  • Mahout and MLib
  • Alternate Frameworks

2. Hadoop Architecture

  • Hadoop Map/Reduce
  • YARN
  • HDFS
  • Spark
  • Cassandra
  • HBase
  • Hive
  • Pig
DAY 2

3. Installing Hadoop

  • Linux Considerations
  • SSH Configuration
  • Hadoop Installation
  • OS Security
  • NamedNodes
  • Job Trackers

4. Test-Running Hadoop Programs

  • Simple MapReduce Test
  • Spark Test
  • Pig Test
DAY 3

5. Cloud Installations

  • Amazon EC2
  • Amazon Elastic MapReduce
  • Rackspace
  • Installing with Docker

6. Optimization and Tuning

  • Performance Metrics
  • Node Sizing
  • Kernel Tuning

7. Installing HBase

  • HBase Installation
  • ZooKeeper

Contact us now ›

  • Course content customization
  • In-house training request
  • Available public program
  • Consultation services
  • Other inquiries
Contact Customer ServiceCall : +603-2386-7788
Email Customer Service E-mail : [email protected]

Related Courses ›

Published by: ,