Alumni

Alumni Spotlight: Katie Critelli, Data Scientist at Deutsche Bank

Katie Critelli had spent years doing research when she was considering an academic career. When she decided that she wanted to have greater flexibility and apply her skills outside academia, she recognized that the path of the data scientist was the one she wanted to pursue. To obtain the necessary skills and the assistance in […]

Read More

Posts with tag: XGBoost

Exploring the Financial Statements of US Companies
“You have to understand accounting and you have to understand the nuances of accounting. It’s the language of...
Posted on December 11, 2017
Scraping Soundcloud
Introduction It goes without saying that music is an essential part of the human condition. Despite its variety,...
David Kogan
Posted on December 8, 2017
Scraping Instagram for hashtags
  On Instagram, I have an account where I share pictures and/or videos related to my yoga practice....
Posted on December 3, 2017
Alumni Spotlight: Katie Critelli, Data Scientist at Deutsche Bank
Katie Critelli had spent years doing research when she was considering an academic career. When she decided that...
Posted on November 29, 2017
Kaggle's Competition: Predicting Housing Prices in Ames, Iowa
Introduction Kaggle.com is a website designed for data scientists and data enthusiasts to connect and compete with each...
M. Aaron Owen
Ben Brunson
Nicholas Maloof
Josh Yoon
, , and
Posted on November 20, 2017
Housing Prices in Ames, Iowa: Kaggle's Advanced Regression Competition
Introduction Residential real estate prices are fascinating... and frustrating. The homebuyer, the home-seller, the real estate agent, the...
Kathryn Bryant
Paul Ton
QUENTIN PICARD
Hans Lau
, , and
Posted on November 20, 2017
Kaggle Competition : House Pricing in Ames, Iowa
With the breakthrough of Machine Learning in recent years, it has seen rapid and successful deployment across many...
Chung Meng Lim
Wing Yan Sang
Theo Kwanga
, , and
Posted on November 19, 2017
IT Jobs Demand Analysis - A Snapshot from Monster Jobs Website
Project Description: This project is to show you the current IT Jobs demand in the USA. By scrapping...
Huy Tran
Posted on November 18, 2017
Predicting Iowa House Prices using Supervised Machine Learning Algorithms
Authors: Daniel Park, Dimitri Liakhovitski, Gwen Fernandez, & Henry Crosby NYC Data Science Bootcamp, November 2017 Project Background...
Scraping StockX: Adidas Yeezy Resell Analysis
Introduction The advancement in technology in the last decade has led to a huge increase in both the...
Josh Yoon
Posted on November 16, 2017
Don't Know Much About History: Visualizing the Scale of Major 20th Century Conflicts (Details)
Check out the code here while you read the article.   Executive Summary: I used advanced programming features...
Posted on November 10, 2017
Making sense of scraped Reddit commentary using NLP techniques.
Why would I do this? Any institution’s lifeblood rests upon agents from the outside. Like any organism, external...
Posted on November 8, 2017
How to Recommend Pet Food Product from Unsupervised Learning
Market  Overview Sales of pet food in the US has increased by 40 percent for the first quarter...
Summer Sun
Posted on November 7, 2017
Alumni Spotlight: Claire Keser, Senior Analyst at Casper
Claire Keser is currently a Senior Analyst on the Data Analytics team at Casper, a startup that disrupted...
Posted on November 1, 2017
Joining Data Without A Key
More detail for this project (including analysis and findings) can be found in my team’s capstone project write-up....
Mitch
Posted on October 31, 2017
Predicting Success on Stack Overflow
Can machine learning models outcompete humans in outcomes prediction on the popular tech Q&A site? Introduction As the...
Posted on October 31, 2017
Scrape My Professors: Comparing Institutions of Higher Learning by Department
Introduction Choosing where to pursue or continue your education is a daunting task. According to the Digest of...
M. Aaron Owen
Posted on October 30, 2017
Topic Modeling and User Clustering on Internet Discussion Forums - A Case Study
Overview Internet forums or message boards are online discussion sites. They're used for a variety of purposes, including...
Paul Ton
Posted on October 30, 2017
Webscraping Every Platinum Record: What Happened to the Album?
Introduction Over the course of the past 20 years, the music industry has experienced an incredibly drastic change...
Henry Crosby
Posted on October 30, 2017

View Posts by Categories


View Posts by Tags

#Wordcloud 2009 2020 2D Heat Map active ingredient activity tracking afinn agile Airbnb airlines airport alcohol ale amazon amazon web services AMECO American Time Use analysis analyzer API app apple ARIMA Art aws AWS kinesis Backtesting baseball bayesian optimization beautiful soup beer bestseller Better Life Index Big Data bike Biology Bloomberg Board Games BoardGameGeek bokeh book title books boost breast budweiser cancer capstone Car chipotle citibike classification climate clustering code coffee collaborative filtering College colon Common Core computer vision conditioner convolutional neural networks coors correlation cosine similarity count coupons credit default CustomerService D3.js dashboard data data analysis data cleaning Data Engineering Data mining data science data visualization databricks dating app deals Debt decision deep learning Demo Day dice dickey-fuller digestion doc2vec doctors dplyr drug dygraph ebooks EC2 EchoNest economic risk economy Education employee reviews employment ensemble Entrepreneurship equities Equities Predictive Analytic Equity Valuation EU European Union Eurozone facebook FDA feature engineering Federal finance Financial Crisis Firebase fiverr Flask Flask App Fleet Optimization flights flow Football forecast framework freelance freelancing gbm GDP GDP growth Genetics gensim ggmap ggplot2 ggrepel gigs GitHub Glassdoor glm googleVis H-1B h2o Hadoop haircare hairoil health heat map higgs boson Hive house price houses housing market hyperparameter igraph image image classification imdb immigration indeed indices instagram InteractiveMap Investment Investments iphone ipywidgets Italy java jeopardy! job listing kafka Kaggle kpss Lasso lasso regression lazyeval LDA leaflet Lifestyle likes linear regression Loans Logistic Regression logistics regression lung machine learning macro macroeconomics major league baseball making money manufacturing manuu Map Maps Material matplotlib medical schools migration model selection models mongodb mortality Mortgage movie reviews movies moving window cross validation multiple linear regression Muse museum music MySQL natural language processing NDA network neural network Neural networks New Drug Application new york city new-york-times newegg NewYorkCity NFL nlp nltk Non-profit NY NYC NYC Open Data OECD Open Data outsource outsourcing pain reliever Painting pandas Pandora passive income Pathway of Hope patterns pca pedestrian safety pharmaceutical Pipeline playlist plotly prediction Predictive Analytics predictor price data principal component anlysis process profit warning project workflow prostrate psychotherapists pyLDAvis python python machine learning python scrapy python web scraping python webscraping Python Workshop Q-Learning R R Programming R Shiny R Shiny Web Scrapping Python Apartments r studio R Visualization R Visulization R Workshop R-bloggers random forest rate rbm rdrop2 RDS real-estate recommendation recommendation system recursion reddit regression Reinforcement Learning Renthop Restaurant Retail reviews ridge regression risk rmongodb RShiny russia russian economy S&P S3 Saprk Apache sberbank Science Scikit learn scraping Scrapy scrapy visualization seaborn Search Activity selectorgadget Selenium sentiment sentiment analysis services shampoo Shiny Shiny Application Shiny Dashboard ShinyApp shorts skincare soccer social social media society source Spain Spark Sports Spotify stacking stationarity statistics stock market stocks storms streaming streaming API Student svd SVM table plot Tableau television temperature Tensorflow text text analysis tf-idf TFIDF time series topic model topic modeling trade traffic travel trees tripadvisor Trump Turnaround TV twitter twitter influencers UK unemployment united United Kingdom unsupervised learning US US GDP USA visa vision zero visualization voting classifier weather web scraping webscraping word cloud word frequency word2vec wordcloud2 workflow XGBoost Yelp Youtube Zillow zocdoc