Alumni Spotlight: Katie Critelli, Data Scientist at Deutsche Bank

Katie Critelli had spent years doing research when she was considering an academic career. When she decided that she wanted to have greater flexibility and apply her skills outside academia, she recognized that the path of the data scientist was the one she wanted to pursue. To obtain the necessary skills and the assistance in […]

Read More

Posts within category: Capstone

Understanding The Drivers Of CTR In Mobile Display Advertising
Introduction: According to the Wireless Association’s Website (, there were approximately 255.4 million American wireless subscribers, representing a...
Michael Chuang
Posted on January 22, 2018
Development of Game AI for Two Sigma Halite II Challenge
Introduction Halite is an open source artificial intelligence programming challenge, created by Two Sigma, where players build bots...
Hans Lau
Shubh Varma
Ilyas Shomayev
, , and
Posted on January 20, 2018
Predicting Customer's Drink of Choice From Real-Time Image Capture
Introduction Our team had the privilege of collaborating with the data science team of one of the largest...
Predicting clicks in mobile advertising: An experiment
Introduction  Advertising is a multi-billion dollar industry that acts as a bridge between companies and their customers. While...
Kathryn Bryant
Paul Ton
M. Aaron Owen
, and
Posted on December 21, 2017
Analyzing and Predicting European Soccer Match Outcomes
Introduction Soccer, in my opinion,  is not only  the most popular but  the  best sport in the world....
Efezino Erome-Utunedi
Posted on October 18, 2017
Real time Yelp reviews analysis and response solutions for restaurant owners
Motivation Before trying a new restaurant, we frequently consult with review platforms, such as Yelp, Zomato, or Google,...
Yu-Han Chen
Posted on September 29, 2017
Identifying "Fake News" With NLP
Introduction What is fake news? We’ve all heard of it, but it is not always easy to identify....
Julia Goldstein
Posted on September 18, 2017
Redefining Cancer Treatment: Predicting Gene Mutations to Advance Personalized Medicine
Introduction and Project Scope One of the most exciting frontiers for machine learning is the field of medical...
Recommending your car brand
Buying a new car is a big and exciting step, especially when it is your first car. Research...
Steven Jongerden
Posted on September 13, 2017
Employee Attrition Analysis
Introduction Attrition is a common issue that every company has to deal with. The goal of the HR analytics project...
Posted on September 11, 2017
Instacart Market Basket Analysis - Reorder Analysis
Introduction InstaCart market basket analysis was a Kaggle competition that was open early 2016 and was conducted by...
Posted on September 8, 2017
Facial Expression Recognition with Tensorflow
Introduction: What's Deep Learning? If you have a basic understanding of Neural Network, then it's easy to explain....
Jian Qiao
Posted on August 24, 2017
Build up a near real time Twitter streaming analytical pipeline from scratch using Spark and AWS
  Introduction: This blog is about the technical implementation of streaming analysis pipeline in our capstone project: Creating...
Posted on August 20, 2017
Credit Card Fraud Detection
Introduction One of the major pain points for the credit card industry has been to accurately find potential...
Smitha Mathew
Posted on August 4, 2017
A Hybrid Recommender with Yelp Challenge Data -- Part I
This is the first part of the Yelper_Helper capstone project blog post. Please find the second part here....
Creating a Real-time Streaming Analytical Platform to manage social media marketing campaign
Motivation and Vision The goal of the project was to provide actionable, scalable and data-driven insights to marketing...
Forecasting Economic Risk in the EU into 2020
Written by Chen Trilnik and Jack Yip. To view the original source code, visit our Github repo here.  ...
Jack Yip
Chen Trilnik
Posted on June 27, 2017
Automating Wikipedia's Manual Processes
"If I download a copy of Wikipedia onto my computer, is my machine any smarter?... Of course not: my...
Rachel Kogan
Posted on June 26, 2017
U.S. Residential Energy Use: Machine Learning on the RECS Dataset
Contributed by Thomas Kassel. He is currently enrolled in the NYC Data Science Academy remote bootcamp program taking...
Thomas Kassel
Posted on June 1, 2017

View Posts by Categories

View Posts by Tags

#Wordcloud 2009 2020 2D Heat Map active ingredient activity tracking afinn agile Airbnb airlines airport alcohol ale amazon amazon web services AMECO American Time Use analysis analyzer API app apple ARIMA Art artificial intelligence aws AWS kinesis Backtesting baseball bayesian optimization beautiful soup beer bestseller Better Life Index Big Data bike Biology Bloomberg Board Games BoardGameGeek bokeh book title books boost breast budweiser cancer capstone Car chipotle citibike classification climate clustering code coffee collaborative filtering College colon Common Core computer vision conditioner convolutional neural networks coors correlation cosine similarity count coupons credit default CustomerService D3.js dashboard data data analysis data cleaning Data Engineering Data mining data science data visualization databricks dating app deals Debt decision deep learning Demo Day dice dickey-fuller digestion doc2vec doctors dplyr drug dygraph ebooks EC2 EchoNest economic risk economy Education employee reviews employment ensemble Entrepreneurship equities Equities Predictive Analytic Equity Valuation EU European Union Eurozone facebook FDA feature engineering Federal finance Financial Crisis Firebase fiverr Flask Flask App Fleet Optimization flights flow Football forecast framework freelance freelancing gbm GDP GDP growth Genetics gensim ggmap ggplot2 ggrepel gigs GitHub Glassdoor glm googleVis H-1B h2o Hadoop haircare hairoil health heat map higgs boson Hive house price houses housing market hyperparameter igraph image image classification imdb immigration indeed indices instagram InteractiveMap Investment Investments iphone ipywidgets Italy java jeopardy! job listing kafka Kaggle kpss Lasso lasso regression lazyeval LDA leaflet Lifestyle likes linear regression Loans Logistic Regression logistics regression lung machine learning macro macroeconomics major league baseball making money manufacturing manuu Map Maps Material matplotlib medical schools mentalillness migration Mobile advertising model selection models mongodb mortality Mortgage movie reviews movies moving window cross validation multiple linear regression Muse museum music MySQL natural language processing NDA network neural network Neural networks New Drug Application new york city new-york-times newegg NewYorkCity NFL nlp nltk Non-profit NY NYC NYC Open Data OECD Open Data outsource outsourcing pain reliever Painting pandas Pandora passive income Pathway of Hope patterns pca pedestrian safety pharmaceutical Pipeline playlist plotly prediction Predictive Analytics predictor price data principal component anlysis process profit warning project workflow prostrate psychotherapists pyLDAvis python python machine learning python scrapy python web scraping python webscraping Python Workshop R R Programming R Shiny R Shiny Web Scrapping Python Apartments r studio R Visualization R Visulization R Workshop R-bloggers random forest rate rbm rdrop2 RDS real-estate recommendation recommendation system recursion reddit regression Renthop Restaurant Retail reviews ridge regression risk rmongodb RShiny russia russian economy S&P S3 Saprk Apache sberbank Science Scikit learn scraping Scrapy scrapy visualization seaborn Search Activity selectorgadget Selenium sentiment sentiment analysis services shampoo Shiny Shiny Application Shiny Dashboard ShinyApp shorts skincare soccer social social media society source Spain Spark Sports Spotify stacking stationarity statistics stock market stocks storms streaming streaming API Student svd SVM table plot Tableau television temperature Tensorflow text text analysis text mining tf-idf TFIDF time series topic model topic modeling trade traffic travel trees tripadvisor Trump Turnaround TV twitter twitter influencers UK unemployment united United Kingdom unsupervised learning US US GDP USA visa vision zero visualization voting classifier weather web scraping webscraping word cloud word frequency word2vec wordcloud2 workflow XGBoost Yelp Youtube Zillow zocdoc