https://nycdsa-blog-files.s3.us-east-2.amazonaws.com/2019/09/3557a319d64e0a74cd1b6b342c5a3f88/Screen-Shot-2019-01-16-at-11.56.01-AM.png
Capstone Jan 16, 2019

Predictive Customer Lifetime Value and Product Recommendation for Retail

Array ( ) On any given day, countless transactions are being made in the retail space. All the transactions generate data, which can be utilized by merchants to improve their sales and help them make important business decisions. As part of our capstone, we consulted two retail clients to explore and identify trends in their customer behavior by building visualizations as well as predictive models.
Getting Started with Data Science
Congratulations! The fact that you’re here means you’re probably trying to figure out what career path is right...
Career Day at NYC Data Science Academy, June 26, 2019
On June 26, NYC Data Science Academy had hosted the 'Data Scientist and Employer Networking Event' for bootcamp...
Scraping and Exploring IMDB
Introduction IMDB is one of the most widely used websites for people deciding whether or not a movie...
Scraping the NYC Rental Market
Introduction     NYC is constantly flooded with advertisements for apartments and condos for a variety of openings...
Web Scraping with a Headless Browser: A Puppeteer Tutorial
    Web development has moved at a tremendous pace in the last decade with a lot of...
Amazon Customer Reviews
Background More and more companies turn to social media to understand their customers, in order to improve their...
Alibaba eCommerce Analysis
Team Alibaba  Lan Mond | Yuqin Xu | Maomao Yi | Xiaofeng Zeng (Fred) Project & Dataset background:  ...
Scraping Sephora and a Flask Querying Application
Photo by Eric Wüstenhagen Introduction For my scraping project, I decided to scrape product reviews from Sephora's website....
Predicting Housing Prices with Machine Learning
In this project our goal was to build accurate models to predict sale prices for houses in Ames,...
How to Build a Data Science Portfolio That Will Get You Hired
When it comes to breaking into data science, your skills are your foundation and you must be able...
A POS Tag Approach to Predict Drug Interactions & User Score Rating
1,2471,247 Comments in moderation This blog explores the utility of a (NLP) Part-Of-Speech Tag counts based methodology to...
United States: Fury Road?
Introduction and Motivation When I was researching crime rates for my visualization project (check it out here!), I...
Can you lift? : Sports Analytics on World Powerlifter's performance
  ShinyApp | Github      For readers who only have a vague idea of "powerlifting"     Powerlifting...
Spotify X Billboard
Introduction Music can be everywhere. When we are waking up, during in-transit, at work, and spending time with...
Machine Learning - House Price Prediction - Ames, Iowa
Team:  Lan Mond,  Yuqin Xu, Fred Zeng Background This project is aimed at developing a model(s) to predict...
Yoga Retreat Worldwide
Background This project is aimed at Scraping Data from randomly selected website and conduct basic analysis on it...
Tennis: Grand Slam Prizes Over the Years
Introduction According to Tennis Industry Association, there are 17.9 million players playing the sport of tennis in the...

View Posts by Categories


Our Recent Popular Posts


View Posts by Tags

2019 airbnb alumni Alumni Interview Alumni Spotlight alumni story Alumnus API artist aws beautiful soup Best Bootcamp Best Data Science 2019 Best Data Science Bootcamp Big Data bootcamp Bootcamp Prep Bundles California Cancer Research capstone Career citibike clustering Coding Course Report D3.js data Data Analyst data science Data Science Academy Data Science Bootcamp Data Scientist Data Scientist Jobs data visualization Deep Learning Demo Day dplyr employer networking feature engineering Finance Financial Data Science Flask gbm Get Hired ggplot2 googleVis Hadoop higgs boson Hiring hiring partner events Industry Experts Job JP Morgan Chase Kaggle lasso regression Lead Data Scienctist Lead Data Scientist leaflet linear regression Logistic Regression machine learning Maps matplotlib Medical Research meetup Networking neural network Neural networks New Courses nlp NYC NYC Data Science nyc data science academy NYC Open Data NYCDSA NYCDSA Alumni Open Data painter pandas Portfolio Development prediction Programming PwC python python machine learning python scrapy python web scraping python webscraping Python Workshop R R language R Programming R Shiny r studio R Visualization R Workshop R-bloggers random forest recommendation recommendation system regression Scrapy scrapy visualization seaborn Selenium sentiment analysis Shiny Shiny Dashboard Spark Special Special Summer Sports statistics streaming Student Interview Student Showcase SVM Tableau Testimonial tf-idf Top Data Science Bootcamp twitter visualization web scraping What to expect word cloud word2vec XGBoost yelp