Class Details

Implementing Machine Learning with R Price: $2,195

Training promotions may be available, contact a training consultant at 240-667-7507 for more information!

This three-day training will turn attendees into savvy data analysts with a solid foundation of unsupervised and supervised machine learning techniques. Students will use R to build data models and evaluate them. They will be able to find new patterns in data and predict behavior of new data points. The workshop optimizes learning by integrating practice time and discussion time in class to improve retention and provide individualized support throughout the session.

Price Match Guarantee Phoenix TS

Implementing Machine Learning with R Course Includes:

  • 3 days of instructor-led training
  • Class exercises in addition to training instruction
  • Courseware books, notepads, pens, highlighters and other materials
  • Course retake option
  • A second voucher included with full purchase 
  • Full breakfast with variety of bagels, fruits, yogurt, doughnuts and juice
  • Tea, coffee, and soda available throughout the day
  • Freshly baked cookies every afternoon - *only at participating locations

Course Outline

Introduction to foundational statistics:

  • Basic statistics
  • Expected value / standard deviation / variance / covariance
  • Statistical tests and significance
  • Linear regression
  • Single variable regression
  • Multiple regression

Best practices for model building:

  • Introduction to the model building process
  • Splitting data into train/test/validation sets
  • Multiple regression – dealing with correlated predictors
  • Commercial applications of regression

Regression model evaluation:

  • Distribution of errors: Q-Q plot, heteroscedasticity
  • Multiple regression
  • R2 and adjusted R2
  • p-values and t-test
  • F-test and F-distribution

Implementing clustering and unsupervised machine learning

  • What is unsupervised machine learning?
  • Introduction to clustering
  • k-means clustering on multi-dimensional data
  • Evaluating the quality of clustering Pitfalls of clustering

Introduction to classification:

  • k-Nearest Neighbors
  • Decision trees: gini coefficient and information gain
  • Introduction to random forests
  • Confusion matrices, misclassification rates
  • Base line errors and lift

Pitfalls and best practices of data science:

  • Understanding the limits of your data
  • Checking data validity
  • Ethical considerations
  • Best practices for data analysts


Implementing Machine Learning with R Price Training Objectives: 

  • Understand how data science can be used effectively in industry
  • Program confidently in R
  • Build data models and find patterns in data