Class Details

Price: $0

3 Day Course Includes:

  • Class exercises in addition to training instruction
  • Courseware books, notepads, pens, highlighters and other materials
  • Free subscription to Cloudera's practice exam questions
  • Full breakfast with variety of bagels, fruits, yogurt, doughnuts and juice
  • Tea, coffee, and soda available all day
  • Freshly baked cookies every afternoon - * only at participating locations

For group training options, please call us at (240) 667-7757 or email 

Course Outline

Module 1: Introduction

Module 2: HDFS

Module 3: MapReduce

Module 4: Planning a Hadoop Cluster

Module 5: Installation and Configuration

Module 6: Identity, Authentication, and Authorization

Module 7: Resource Management

Module 8: Cluster Maintenance

Module 9: Troubleshooting

Module 10: Monitoring

Module 11: Backup and Recovery


At the conclusion of this course, students will be able to do the following;

·          Explore MapReduce in depth, including steps for developing applications with it

·          Set up and maintain a Hadoop cluster running HDFS and MapReduce on YARN

·          Learn two data formats: Avro for data serialization and Parquet for nested data

·          Use data ingestion tools such as Flume (for streaming data) and Sqoop (for bulk data transfer)

·          Understand how high-level data processing tools like Pig, Hive, Crunch, and Spark work with Hadoop