Rapid Deployment of Visualizations

Posted on Nov 6, 2016

Economic indicators are in widespread use by government and business entities, and are closely tracked. Accordingly, the quality of information in a given metric and the loss of granularity are important to consider. The purpose of this project was to compare employment-related data in the Census business surveys against unemployment data from the Bureau of Labor and Statistics.

detourThat’s the way the project started. However, due to unforeseen events, it got bogged down in munging epic amounts of data and navigating the nuances of the plotly package. While the opportunity for EDA was limited, the end product was a useful proof of concept for a quick-and-dirty visualization app that can be used in real business scenarios.

The app consists of a choropleth map of US states that when clicked brings up detailed data of economic data about the clicked on state. More than 10 years of data is available. The app can run on a variety of data points depending on the configuration of the widgets in the header. For example, In the below graph you can see that it’s been configured to reflect the year-over-year percent change in total payroll for each state. Starting clockwise from the top-left, the charts are:

  1. A map reflecting either year-over-year change or per capita metric value
  2. An industry breakout of year-over-year metric values
  3. County level version of top left map for the selected state
  4. Trending state-level unemployment data (line) and county-level breakout (dots)

fullscreen

The metric reflected in all the charts can be changed by the user, with all the plots being updated automatically.

dropdown

I also used rollover events to display more information. Had I had more time, I would have added a sub-industry breakdown of each bar in the industry chart, allowing for an easy way to quickly drill down three levels of data while still having the top-level view visible. For illustrative purposes I also added annotations to select rollover popups. It would be useful when doing exploratory data analysis to be able pin a note to a given data point for further reference. I also added the ability to filter based on the presence of annotations.

rollover

I have seen many business cases where I would have found this sort of app useful. While choropleth maps and master-detail visualizations come standard with most enterprise business intelligence packages, waiting for your IT department to make a data set available can take a lot of time. This sort of Shiny app would fill the gap where urgency trumps resilience and scalability. For example, say you have a new product rolled out and you’ve instructed your sales people to email their region’s orders on a daily basis because the order data won’t be available to your BI system until the end of the week or month. An analyst could quickly throw a Shiny app together to make explorable data available to decision makers online, rather than relying on static reports sent via email. While the business intelligence space is already quite crowded and mature, there still does appear to be a niche for Shiny -- and adjusted for cost, it has an infinite advantage!

Next steps for this project would be to analyze the relationship between unemployment, payrolls and the number of employees. It would be interesting to look for divergences between the economic health implied by one metric versus that implied by another. I would also like to track industry sectors over time to gain more insight into popular narratives such as the decline of manufacturing and the increase in health care spending.

About Author

Jason Sippie

Jason has an eclectic skill set including programming, data warehousing, business intelligence, and risk management, spanning Pharma and Finance domains. With one career in technology consulting and a second in financial services, he is excited about leveraging these...
View all posts by Jason Sippie >

Leave a Comment

No comments found.

View Posts by Categories


Our Recent Popular Posts


View Posts by Tags

#python #trainwithnycdsa 2019 2020 Revenue 3-points agriculture air quality airbnb airline alcohol Alex Baransky algorithm alumni Alumni Interview Alumni Reviews Alumni Spotlight alumni story Alumnus ames dataset ames housing dataset apartment rent API Application artist aws bank loans beautiful soup Best Bootcamp Best Data Science 2019 Best Data Science Bootcamp Best Data Science Bootcamp 2020 Best Ranked Big Data Book Launch Book-Signing bootcamp Bootcamp Alumni Bootcamp Prep boston safety Bundles cake recipe California Cancer Research capstone car price Career Career Day citibike classic cars classpass clustering Coding Course Demo Course Report covid 19 credit credit card crime frequency crops D3.js data data analysis Data Analyst data analytics data for tripadvisor reviews data science Data Science Academy Data Science Bootcamp Data science jobs Data Science Reviews Data Scientist Data Scientist Jobs data visualization database Deep Learning Demo Day Discount disney dplyr drug data e-commerce economy employee employee burnout employer networking environment feature engineering Finance Financial Data Science fitness studio Flask flight delay gbm Get Hired ggplot2 googleVis H20 Hadoop hallmark holiday movie happiness healthcare frauds higgs boson Hiring hiring partner events Hiring Partners hotels housing housing data housing predictions housing price hy-vee Income Industry Experts Injuries Instructor Blog Instructor Interview insurance italki Job Job Placement Jobs Jon Krohn JP Morgan Chase Kaggle Kickstarter las vegas airport lasso regression Lead Data Scienctist Lead Data Scientist leaflet league linear regression Logistic Regression machine learning Maps market matplotlib Medical Research Meet the team meetup methal health miami beach movie music Napoli NBA netflix Networking neural network Neural networks New Courses NHL nlp NYC NYC Data Science nyc data science academy NYC Open Data nyc property NYCDSA NYCDSA Alumni Online Online Bootcamp Online Training Open Data painter pandas Part-time performance phoenix pollutants Portfolio Development precision measurement prediction Prework Programming public safety PwC python Python Data Analysis python machine learning python scrapy python web scraping python webscraping Python Workshop R R Data Analysis R language R Programming R Shiny r studio R Visualization R Workshop R-bloggers random forest Ranking recommendation recommendation system regression Remote remote data science bootcamp Scrapy scrapy visualization seaborn seafood type Selenium sentiment analysis sentiment classification Shiny Shiny Dashboard Spark Special Special Summer Sports statistics streaming Student Interview Student Showcase SVM Switchup Tableau teachers team team performance TensorFlow Testimonial tf-idf Top Data Science Bootcamp Top manufacturing companies Transfers tweets twitter videos visualization wallstreet wallstreetbets web scraping Weekend Course What to expect whiskey whiskeyadvocate wildfire word cloud word2vec XGBoost yelp youtube trending ZORI