Class Details

Price: $2,295

3-Day Course Includes:

  • Class exercises in addition to training instruction
  • Courseware books, notepads, pens, highlighters and other materials
  • Free subscription to Cloudera's practice exam questions
  • Full breakfast with variety of bagels, fruits, yogurt, doughnuts and juice
  • Tea, coffee, and soda available all day
  • Freshly baked cookies every afternoon - * only at participating locations

For group training options, please call us at (240) 667-7757 or email 

Course Outline

Data Science

  • Intro to Data Science
  • Data Science Growing Need
  • Data Scientist's Role in Business

Evaluating Use Cases

  • Finance, Retail, and Advertising
  • Telecommunications and Utilities
  • Healthcare and Pharmaceuticals
  • Defense and Intelligence

Understanding the Project Lifecycle

  • Project Lifecycle Steps
  • Lab Scenarios

Data Acquisition

  • Sourcing Data
  • Acquisition Methods

Reviewing Input Data

  • Data Quality and Quantity
  • Data Formats

Data Formation

  • File Format Conversion
  • Anonymization
  • Datasets

Analysis and Statistical Techniques

  • Statistics and Probability
  • Descriptive and Inferential Statistics

Machine Learning Fundamentals

  • Three C's of Machine Learning
  • Naive Bayes Classifiers
  • Algorithms and Data


  • Recommender Systems
  • Collaborative Filtering
  • Recommender System Limitations
  • Important Core Concepts

Apache Mahout

  • Mahout Capabilities and Purpose
  • Mahout History
  • Installing Mahout
  • Learning How to Utilize Item-Based Recommenders

Mahout and Recommender Implementation

  • Binary Preference Metrics
  • Numeric Preference Metrics
  • Understanding Scoring

Conducting and Evaluating Experiments

  • How to Measure Recommender Success
  • Design for Successful and Effective Experiments
  • UIs for Recommenders

Production Deployment

  • Production Deployment Overview
  • Developing Conclusions and Creating Visual Results
  • Performance Optimization Considerations


  • Data Scientists in the Modern World
  • Data Science Business Applications
  • Where are Data Applications Implemented
  • Acquiring Data, Source Data Evaluation, and Data Transformation
  • Machine Learning, Algorithms, Data Platforms
  • Implementing and Managing Recommenders with Apache Mahout
  • Production Deployment for Data Analytics Projects

Class Exam

Cloudera Certified Professional: Data Scientist (CCP:DS) Certification Exam

To earn this accredidation, individuals must pass the Data Science Essentials (DS-200) exam and complete the Data Science Challenge.


  • Exam Code - DS-200
  • Questions: 60 Questions with 6-10 extra beta questions
  • Types of Questions - Multiple choice, reading passages and matching
  • Time Limit - 90 minutes
  • Passing Score - 500 on scale of 0-700
  • Language - English


  • Data Acquisition
  • Data Evaluation
  • Data Transformation
  • Machine Learning Basics
  • Clustering
  • Classification
  • Collaborative Filtering
  • Model/Feature Selection
  • Probability
  • Visualization
  • Optimization

Phoenix TS is an authorized testing center for Pearson VUE and Prometric exams. To register for exams contact us or visit the Pearson VUE or Prometric websites.