Data Science with R: Machine Learning

Data Science with R: Machine Learning

Data Science with R: Machine Learning

Intermediate

This 35-hour Machine Learning with R course introduces both the theoretical foundation of machine learning algorithms as well as their practical applications in R. It will introduce you to data mining, performance measures and dimension reduction, regression models, both linear and generalized, KNN and Naïve Bayes models, tree models, and SVMs as well as the Association Rule for analysis. After successfully completing of this course, you will be able to break down the mathematics behind major machine learning algorithms, explain the principles of machine learning algorithms, and implement these methods to solve real-world problems.

Course Overview
Intermediate

This 35-hour Machine Learning with R course introduces both the theoretical foundation of machine learning algorithms as well as their practical applications in R. It will introduce you to data mining, performance measures and dimension reduction, regression models, both linear and generalized, KNN and Naïve Bayes models, tree models, and SVMs as well as the Association Rule for analysis. After successfully completing of this course, you will be able to break down the mathematics behind major machine learning algorithms, explain the principles of machine learning algorithms, and implement these methods to solve real-world problems.

January Session
$2990.00
Early bird pricing
$2840.50
January Session
Jan 20 - Feb 24, 2018, 10:00am-5:00pm
June Session
$2990.00
Early bird pricing
$2840.50
June Session
Jun 9 - Jul 21, 2018, 10:00am-5:00pm

Date and Time

January Session Early-bird Pricing!

Jan 20 - Feb 24, 2018, 10:00am-5:00pm
Day 1: January 20, 2018
Day 2: January 27, 2018
Day 3: February 3, 2018
Day 4: February 10, 2018
Day 5: February 24, 2018
$2990.00$2840.50
Add to Cart

June Session Early-bird Pricing!

Jun 9 - Jul 21, 2018, 10:00am-5:00pm
Day 1: June 9, 2018
Day 2: June 23, 2018
Day 3: June 30, 2018
Day 4: July 14, 2018
Day 5: July 21, 2018
$2990.00$2840.50
Add to Cart

September Session Early-bird Pricing!

Sep 8 - Oct 13, 2018, 10:00am-5:00pm
Day 1: September 8, 2018
Day 2: September 15, 2018
Day 3: September 22, 2018
Day 4: September 29, 2018
Day 5: October 13, 2018
$2990.00$2840.50
Add to Cart

Instructors

David Romoff
David Romoff
David Romoff is a risk management consultant with 10 years of experience modeling market and credit risk using the latest methods and technologies. David's recent work includes serving as Manager of Risk Management at On Deck Capital, a business lending company in the FinTech space that uses machine learning models to underwrite loans. David was responsible for estimating and reporting losses on the book of loans. Previously, David worked in Enterprise Risk Management at AIG for five years where he designed and supported models on insurance risk, credit risk, and capital allocation. Before AIG, he worked at Bear Stearns in counterparty credit risk. David has an MBA from the Zicklin School of Business in New York City and a Master of Science in Actuarial Science from Columbia University. His undergraduate degree is from the State University of New York at Albany, where he studied psychology and philosophy.

Product Description


Overview

 

This 35-hour Machine Learning with R course introduces both the theoretical foundation of machine learning algorithms as well as their practical applications in R. It will introduce you to data mining, performance measures and dimension reduction, regression models, both linear and generalized, KNN and Naïve Bayes models, tree models, and SVMs as well as the Association Rule for analysis. After successfully completing of this course, you will be able to break down the mathematics behind major machine learning algorithms, explain the principles of machine learning algorithms, and implement these methods to solve real-world problems.

Details

 


Prerequisites

 

  • Knowledge of R programming
  • Able to munge, analyze, and visualize data in R

Syllabus

Unit 1: Foundations of Statistics and Simple Linear Regression

  • Understand your data
  • Statistical inference
  • Introduction to machine learning
  • Simple linear regression
  • Diagnostics and transformations
  • The coefficient of determination

Unit 2: Multiple Linear Regression and Generalized Linear Model

  • Multiple linear regression
  • Assumptions and diagnostics
  • Extending model flexibility
  • Generalized linear models
  • Logistic regression
  • Maximum likelihood estimation
  • Model interpretation
  • Assessing model fit

Unit 3: kNN and Naive Bayes, the Curse of Dimensionality

  • The K-Nearest Neighbors Algorithm
  • The choice of K and distance measure
  • Conditional probability: Bayes’ Theorem
  • The Naive Bayes’ Algorithm
  • The Laplace estimator
  • Dimension reduction
  • The PCA procedure
  • Ridge and Lasso regression
  • Cross-validation

Unit 4: Tree Models and SVMs

  • Decision trees
  • Bagging
  • Random forests
  • Boosting
  • Variable Importance
  • Hyperplanes and maximal margin classifier
  • Sort margin and support vector classifier
  • Kernels and support vector machines

Unit 5: Cluster Analysis and Neural Networks

  • Cluster analysis
  • K-means clustering
  • Hierarchical clustering
  • Neural networks and perceptrons
  • Sigmoid neurons
  • Network topology and hidden features
  • Back propagation learning with gradient descent

Final Project

After 35 hours of structured lectures, students are encouraged to work on an exploratory data analysis project based on their own interests. A project presentation demo will be arranged afterwards.


Recommended Readings

 

  • An Introduction to Statistical Learning with Applications in R, by Gareth James, Daniela Witten, Trevor Hastie and Robert Tibshirani
  • Applied Predictive Modeling, by Max Kuhn and Kjell Johnson
  • Data Mining with R, by Luis Torgo
  • Machine Learning with R, by Brett Lantz

Reviews

There are no reviews yet.

Be the first to review “Data Science with R: Machine Learning”

Your email address will not be published. Required fields are marked *

Instructors

David Romoff
David Romoff
David Romoff is a risk management consultant with 10 years of experience modeling market and credit risk using the latest methods and technologies. David's recent work includes serving as Manager of Risk Management at On Deck Capital, a business lending company in the FinTech space that uses machine learning models to underwrite loans. David was responsible for estimating and reporting losses on the book of loans. Previously, David worked in Enterprise Risk Management at AIG for five years where he designed and supported models on insurance risk, credit risk, and capital allocation. Before AIG, he worked at Bear Stearns in counterparty credit risk. David has an MBA from the Zicklin School of Business in New York City and a Master of Science in Actuarial Science from Columbia University. His undergraduate degree is from the State University of New York at Albany, where he studied psychology and philosophy.

Product Description


Overview

 

This 35-hour Machine Learning with R course introduces both the theoretical foundation of machine learning algorithms as well as their practical applications in R. It will introduce you to data mining, performance measures and dimension reduction, regression models, both linear and generalized, KNN and Naïve Bayes models, tree models, and SVMs as well as the Association Rule for analysis. After successfully completing of this course, you will be able to break down the mathematics behind major machine learning algorithms, explain the principles of machine learning algorithms, and implement these methods to solve real-world problems.

Details

 


Prerequisites

 

  • Knowledge of R programming
  • Able to munge, analyze, and visualize data in R

Syllabus

Unit 1: Foundations of Statistics and Simple Linear Regression

  • Understand your data
  • Statistical inference
  • Introduction to machine learning
  • Simple linear regression
  • Diagnostics and transformations
  • The coefficient of determination

Unit 2: Multiple Linear Regression and Generalized Linear Model

  • Multiple linear regression
  • Assumptions and diagnostics
  • Extending model flexibility
  • Generalized linear models
  • Logistic regression
  • Maximum likelihood estimation
  • Model interpretation
  • Assessing model fit

Unit 3: kNN and Naive Bayes, the Curse of Dimensionality

  • The K-Nearest Neighbors Algorithm
  • The choice of K and distance measure
  • Conditional probability: Bayes’ Theorem
  • The Naive Bayes’ Algorithm
  • The Laplace estimator
  • Dimension reduction
  • The PCA procedure
  • Ridge and Lasso regression
  • Cross-validation

Unit 4: Tree Models and SVMs

  • Decision trees
  • Bagging
  • Random forests
  • Boosting
  • Variable Importance
  • Hyperplanes and maximal margin classifier
  • Sort margin and support vector classifier
  • Kernels and support vector machines

Unit 5: Cluster Analysis and Neural Networks

  • Cluster analysis
  • K-means clustering
  • Hierarchical clustering
  • Neural networks and perceptrons
  • Sigmoid neurons
  • Network topology and hidden features
  • Back propagation learning with gradient descent

Final Project

After 35 hours of structured lectures, students are encouraged to work on an exploratory data analysis project based on their own interests. A project presentation demo will be arranged afterwards.


Recommended Readings

 

  • An Introduction to Statistical Learning with Applications in R, by Gareth James, Daniela Witten, Trevor Hastie and Robert Tibshirani
  • Applied Predictive Modeling, by Max Kuhn and Kjell Johnson
  • Data Mining with R, by Luis Torgo
  • Machine Learning with R, by Brett Lantz

Reviews

There are no reviews yet.

Be the first to review “Data Science with R: Machine Learning”

Your email address will not be published. Required fields are marked *

Testimonials View All Student Testimonials

Rahul Bhat
Rahul Bhat
Took the weekend course for Machine Learning with R. Course was very helpful in helping me understand the basics of Machine Learning, different models. My instructor was Luke. He was very helpful and would spend enough time covering each topic. He even took an additional class because he didn't want to rush through the material. Overall I am quite satisfied with the results. Would recommend Luke to anyone else who is interested to venture into Machine Learning field.
Lukasz
Lukasz
I studied mechanical engineering and physics for my undergrad at a top university and work in product management with a focus on search. I took this class to satisfy a personal interest in the subject matter and familiarize myself enough with the fundamentals of machine learning to be able to explore the field more deeply on my own. I was also motivated by a career interest: the subject matter is highly relevant to my domain, and I feel that developing an understanding of the concepts and how to deploy them myself will make me better at my job long-term. Prior to enrolling in the class, I spent roughly 8-10 hours learning R and felt sufficiently prepared (I had some previous programming experience). In the end I was extremely happy with this class (Machine Learning in R on Saturdays, 8 hrs at a time). The curriculum and content were excellent, the instructor, Luke, was fantastic and the assignments were challenging and informative. I felt the course did a really great job of driving home the core fundamentals of each subject with a focus on statistics, mathematical theory, derivations and best practices. We covered a LOT of material, yet the material had a lot of depth. I thought the sequencing of the subject matter was very well thought out as well. The class was demanding and had the caliber of a graduate-level course. The course also struck a very nice balance between theory and implementation. After learning about a new model, we would immediately implement it in class using R on our own machines. Luke did a particularly great job at relating the implementation back to the concepts and teaching us how to interpret outcomes of our analyses (I can’t stress enough how important this latter point was for me). He has a really strong grasp of the subject matter, he’s very patient and responsive to questions, offers a lot of insightful commentary on the theory, implementations, and best practices, and he cares about his students a lot. The homework assignments complement the class nicely as well, helping to drive home the methods taught in class and how to interpret your work. If you’re interested in developing a strong understanding of the fundamentals of machine learning in a rigorous format, this class is for you. I also couldn’t recommend Luke as an instructor more. He’s awesome! I was also was very pleased with my choice of the R class. R reduces a lot of the friction in model implementation, which allowed me to focus on developing an understanding of the concepts and interpreting results.
read more
Tingyan Zheng
Tingyan Zheng
Big Data Analyst at
GroupM
This course covers major R machine learning topics; it is intense, and you will learn a lot if you keep up with the pace. Instructor Shu Yan is great at explaining complicated statistical concepts/formulas and translate them into R coding techniques. Course materials, in-class practices, and homework assignment are helpful regarding learning and future references. I would recommend this course to anyone who is interested in data science/machine learning but doesn't know much about this field. It will be a good start for you if you plan to work in this area. It certainly helped me understand a lot about data science and improved my R coding skills. What I learned from this course is worth the money I paid and the effort I put in.
Mark Li
Mark Li
Quantitative Researcher at
Twitter

I took Machine Learning with R and Hadoop data engineering classes in 2015. They are all well-structured classes with extensive information coverage and concrete learning process design. All the techniques been told in the class are very practical and can be applied to work very fast. In addition, it is also a great opportunity to build your "data science" fellow network because all your classmates are "Pro" in this domain with a lot of wonderful industry experiences to share. I would definitely recommend NYC Data Science Academy to my friend!

Margaret Hung
Margaret Hung
SVP, Intelligence Solutions & Strategy at
Millward Brown Digital

As the business world becomes increasingly data-driven, the Data Sciences classes at NYC Data Sciences Academy are invaluable to driving career success, not only for actual data science practitioners, but those who collaborate with them day-to-day to execute on insights to be gleaned from data sciences. I just completed the Intermediate level Data Sciences with R class and have immediately benefited from the ability to understand the different type of advanced analytic techniques that are available to help my clients with their business issues, to better communicate and collaborate with our Data Sciences team on a tactical level and then to take their output and accurately translate it into our clients’ business language. The course was comprehensive and Vivian brings a lot of passion and dedication to the class and ensuring her students’ success.

Date and Time

January Session Early-bird Pricing!

Jan 20 - Feb 24, 2018, 10:00am-5:00pm
Day 1: January 20, 2018
Day 2: January 27, 2018
Day 3: February 3, 2018
Day 4: February 10, 2018
Day 5: February 24, 2018
$2990.00$2840.50
Add to Cart

June Session Early-bird Pricing!

Jun 9 - Jul 21, 2018, 10:00am-5:00pm
Day 1: June 9, 2018
Day 2: June 23, 2018
Day 3: June 30, 2018
Day 4: July 14, 2018
Day 5: July 21, 2018
$2990.00$2840.50
Add to Cart

September Session Early-bird Pricing!

Sep 8 - Oct 13, 2018, 10:00am-5:00pm
Day 1: September 8, 2018
Day 2: September 15, 2018
Day 3: September 22, 2018
Day 4: September 29, 2018
Day 5: October 13, 2018
$2990.00$2840.50
Add to Cart