Reading List For A Data Scientist Interview

Joseph Lee
Posted on Feb 28, 2016
I just wanted suggest some readings that I personally found super helpful for my interviews and wanted to share with the NYCDSA.
1. Fundamentals of Machine Learning for Predictive Data Analytics: Algorithms, Worked Examples, and Case Studies
   John D. Kelleher, Brian Mac Namee, Aoife D’arcy ; 1st Edition


2. Machine Learning: A Probabilistic Perspective
    Kevin P. Murphy, 1st Edition
The second book, Machine Learning: A probabilistic Perspective is a very technical read but gives good technical questions for in depth statistical learning.  This may be more suited to those with an advanced degree in mathematics.


However, the first book, the Fundamentals of Machine Learning for Predictive Data Analytics is by far my favorite supplementary read.  It is semi-technical (not as technical as the Machine learning: A Probabilistic Perspective I should say), fairly easy to read, and goes over the higher level thinking for data science methods in business applications.  It goes over the CRISP-DM approach and gives examples on how to implement it as well for different situations. For myself it helped consolidate everything that I learned in the bootcamp and helped me develop a big picture understanding and approach to my data science methodologies.  Furthermore, this book definitely helped me with phone and non-programming interviews that I had most of my data scientist interviews.  In fact, most of my data science interviews involved questions that focused on my approach and data understanding rather than just pure programming questions and this book helped me prepare for such questions. Unfortunately, I don’t believe that there is a pdf version online, but there will likely be one soon as this book is gaining popularity.


Anyways, I just wanted to share these two great resources with the program!

About Author

Joseph Lee

Joseph Lee

A recent graduate from Northwestern University with a B.S. in Biomedical Engineering and a Minor in computer science, Joseph has a strong background in computer engineering and programming concepts. His previous work and academic studies contains a panoply...
View all posts by Joseph Lee >

Leave a Comment

Avatar January 5, 2018
Thank you for publishing this awesome article. I've been reading for a while but I've never been compelled to leave a comment. I've bookmarked your site and shared this on Facebook. Thanks again for a quality article!
Math Problem Solver September 26, 2016
Thanks for finally writing about >blog topic <Loved it!
Sung Pil Moon February 28, 2016
Thanks for sharing. I'll try to read, at least the first book you mentioned.

View Posts by Categories

Our Recent Popular Posts

View Posts by Tags

2019 airbnb alumni Alumni Interview Alumni Spotlight alumni story Alumnus API artist aws beautiful soup Best Bootcamp Best Data Science 2019 Best Data Science Bootcamp Big Data bootcamp Bootcamp Prep Bundles California Cancer Research capstone Career citibike clustering Coding Course Report D3.js data Data Analyst data science Data Science Academy Data Science Bootcamp Data Scientist Data Scientist Jobs data visualization Deep Learning Demo Day dplyr employer networking feature engineering Finance Financial Data Science Flask gbm Get Hired ggplot2 googleVis Hadoop higgs boson Hiring hiring partner events Industry Experts Job JP Morgan Chase Kaggle lasso regression Lead Data Scienctist Lead Data Scientist leaflet linear regression Logistic Regression machine learning Maps matplotlib Medical Research meetup Networking neural network Neural networks New Courses nlp NYC NYC Data Science nyc data science academy NYC Open Data NYCDSA NYCDSA Alumni Open Data painter pandas Portfolio Development prediction Programming PwC python python machine learning python scrapy python web scraping python webscraping Python Workshop R R language R Programming R Shiny r studio R Visualization R Workshop R-bloggers random forest recommendation recommendation system regression Scrapy scrapy visualization seaborn Selenium sentiment analysis Shiny Shiny Dashboard Spark Special Special Summer Sports statistics streaming Student Interview Student Showcase SVM Tableau Testimonial tf-idf Top Data Science Bootcamp twitter visualization web scraping What to expect word cloud word2vec XGBoost yelp