Class Details

Price: $2,995

4-Day Course Includes:

  • Class exercises in addition to training instruction
  • Courseware books, notepads, pens, highlighters and other materials
  • Free subscription to Cloudera's practice exam questions
  • Full breakfast with variety of bagels, fruits, yogurt, doughnuts and juice
  • Tea, coffee, and soda available all day
  • Freshly baked cookies every afternoon - * only at participating locations

For group training options, please call us at (240) 667-7757 or email promo@phoenixts.com. 

Course Outline

Choosing Apache Hadoop

  • Overview and History of Hadoop
  • Core Components
  • Fundamental Concepts of Hadoop

HDFS

  • Common Features of HDFS
  • File Writes and Reads
  • Considerations for NameNode
  • HDFS Security
  • NameNode Web UI
  • File Shell

HDFS – Data Ingestion

  • External Sources with Flume for Data Ingestion
  • Relational Databases with Sqoop for Data Ingetsion
  • REST Interfaces
  • Common Best Practices for Data Imports

MapReduce

  • MapReduce Basic Features and Concepts
  • Overview of the Architecture
  • MapReduce v2
  • Failure Recovery
  • JobTracker Web UI

Hadoop Cluster Planning

  • General Considerations for Planning
  • Correct Hardware Selection
  • Considerations for the Network
  • Nodes Configuration
  • Cluster Management Planning

Installation and Initial Configuration

  • Types of Deployment
  • Hadoop Installation and Configuration Specification
  • Performing the Initial Configuration of HDFS
  • Performing the Initial Configuration of MapReduce
  • Location of Log Files

Hive, Pig and Impala Installation and Configuration

  • Apache Hive
  • Apache Pig
  • Cloudera Impala

Hadoop Clients

  • Hadoop Client Installation and Configuration
  • Hue Installation and Configuration
  • Authenticating and Configuring Hue

Cloudera Manager

  • Cloudera Manager – Motivation
  • Common Features
  • Versions of Cloudera Manager – Standard & Enterprise
  • Topology
  • Cloudera Manager Installation
  • Hadoop Installation with Cloudera Manager
  • Basic Administration Tasks
  • Utilizing Cloudera Manager

Configuring Advanced Clusters

  • Advanced Parameters for Configuration
  • Hadoop Port Configuration
  • Explicit Host Inclusion and Exclusion
  • HDFS Configuration - Rack Awareness
  • HDFS Configuration - High Availability

Securing Apache Hadoop

  • Importance of Hadoop Security
  • Concepts of Hadoop's Security System
  • Overview of Kerberos for Security
  • Hadoop Cluster Security with Kerberos

Job Schedules and Management

  • Running Job Management
  • Hadoop Job Scheduling
  • FairScheduler Configuration Process

Maintaining Clusters

  • HDFS Status Checks
  • Copying Data Between Clusters
  • Cluster Node Additions and Removals
  • Cluster Rebalancing
  • NameNode Metadata Backup
  • Upgrading Clusters

Monitoring and Troubleshooting Clusters

  • General System Monitoring Techniques
  • Log File Management
  • Cluster Monitoring
  • Issues with Troubleshooting

Objectives

  • Choosing Apache Hadoop
  • HDFS
  • HDFS and Data Ingestion
  • MapReduce
  • Planning for Hadoop Clusters
  • The Installaiont and Initial Configuration
  • Hive, Pig and Impala Installation and Configuration
  • Hadoop Clients
  • Cloudera Manager
  • Configuration for Advanced Clusters
  • Apache Hadoop and Security
  • Job Schedules and Management
  • Maintaining, Monitoring and Troubleshooting Clusters

Class Exam

CCAH Certification Exam

Details:

  • Exam Code: CCA-410
  • Number of Questions: 60
  • Duration: 90 minutes
  • Passing Score: 70%
  • Delivery: Pearson VUE
  • Language: English, Japanese

Objectives:

CCA-410

  • HDFS (38%)
  • MapReduce (10%)
  • Hadoop Cluster Planning (12%)
  • Hadoop Cluster Installation and Administration (17%)
  • Resource Management (6%)
  • Logging and Monitoring (12%)
  • Hadoop's Ecosystem (5%)

CCA-500 and 505

  • HDFS (17%)
  • YARN and MapReduce version 2 (17%)
  • Hadoop Cluster Planning (16%)
  • Hadoop Cluster Installation and Administration (25%)
  • Resource Management (10%)
  • Logging and Monitoring (15%)

Register for Class

Date Location
01/08/19 - 01/11/19, 4 days, 10:00AM – 6:00PM Online Register
01/29/19 - 02/01/19, 4 days, 10:00AM – 6:00PM Online Register