Data Analysis Project: Employment Growth in the U.S.

Posted on Jan 26, 2020
The skills I demoed here can be learned through taking Data Science with Machine Learning bootcamp with NYC Data Science Academy.

Job Growth - "Are people getting hired?", "If so, where?"

The main goal of this exploratory data analysis project was to find out if there had been employment growth in the U.S. for the past several years, and if so, particularly in which state we can see the large growth.
Also, by narrowing down the changes by industry, I made it easier to analyze which industry is booming. Additionally, I performed time-series analysis to show the growth by year and enterprise size to analyze if there is a trend of employment growth by enterprise size. 

Please find my Shiny App application here and github codes here.

Data Summary


Based on the dataset from Census Bureau, "Statistics of US Business (SUSB) Employment Change Data Tables" was retrieved for the changes from 2010 to 2016.
Each dataset records birth and death of employment and expansion, and contraction of establishment (enteprise count) by industry classification based on 2012 North American Industry Classification System (NAICS) codes.

Heat Map - Industry


Overall, the total number showed the growht of 1.6%.

Some states showed tremendous growth such as Florida.

This heat map by state shows the following industries have the highest average growth:

Construction 4.6%
Arts, Entertainment, and Recreation 3.5%
Finance and Insurance 3%
Educational Services 3%

Lowest industry:
Mining, Quarrying, and Oil and Gas Extraction -10.7%

Some industries had the concentration of growth in some states (e.g. 23.2% growth in D.C. for Arts, Entertainment, and Recreation), but other industries displayed more general growth at many of states (e.g. Professional, Scientific, and Technical Services)

Time Series


From 2010 to 2016, there were both ups and downs in the employment and enterprise expansion/birth as shown in the following chart. It is notable that especially in 2010 and 2011, there were decreases in the number of enterprises.
However, from 2012, the changes had been positive changes.
When you select enterprise size, you can see that the smaller the enterprise size is, there is an increasing growth in the employment. It has to be noted that smallest enterprises did not even show negative changes when the enterprises of other sizes suffered the decrease.
The largest enterprise size, companies that employ more than 500 employees, showed slowest growth.

Conclusion
Overall, the numbers from the dataset display positive changes in most of industries over the recent years.
If one is looking for a job, it would be advisable to look for employment at smaller companies. Also, it should be noted that if you are shifting your career towards construction, arts, entertainment and recreation, finance and insurance, or educational services, the chances of employment are relatively high.

The next step of this research project could be to confirm the predicted positive changes based on the next economic census results to be released in next few years or other datasets.

 

Statistics of US Business (SUSB)

The Business Register is the Census Bureau’s source of information on employer establishments included in the Statistics of U.S. Businesses (SUSB) program. The Business Register is a multi-relational database that contains a record for each known establishment that is located in the United States or Puerto Rico and has employees.

GitHub

https://github.com/kisakiwata/ShinyApp2020

About Author

Kisaki Watanabe

Data Scientist with strong consulting experiences in data analytics/visualization and risk management, serving for industries ranging from social networking service, game, pharmaceutical, media, and advertising. Advanced skills in fraud investigation and trend projection/analysis with tools such as Tableau,...
View all posts by Kisaki Watanabe >

Leave a Comment

No comments found.

View Posts by Categories


Our Recent Popular Posts


View Posts by Tags

#python #trainwithnycdsa 2019 2020 Revenue 3-points agriculture air quality airbnb airline alcohol Alex Baransky algorithm alumni Alumni Interview Alumni Reviews Alumni Spotlight alumni story Alumnus ames dataset ames housing dataset apartment rent API Application artist aws bank loans beautiful soup Best Bootcamp Best Data Science 2019 Best Data Science Bootcamp Best Data Science Bootcamp 2020 Best Ranked Big Data Book Launch Book-Signing bootcamp Bootcamp Alumni Bootcamp Prep boston safety Bundles cake recipe California Cancer Research capstone car price Career Career Day citibike classic cars classpass clustering Coding Course Demo Course Report covid 19 credit credit card crime frequency crops D3.js data data analysis Data Analyst data analytics data for tripadvisor reviews data science Data Science Academy Data Science Bootcamp Data science jobs Data Science Reviews Data Scientist Data Scientist Jobs data visualization database Deep Learning Demo Day Discount disney dplyr drug data e-commerce economy employee employee burnout employer networking environment feature engineering Finance Financial Data Science fitness studio Flask flight delay gbm Get Hired ggplot2 googleVis H20 Hadoop hallmark holiday movie happiness healthcare frauds higgs boson Hiring hiring partner events Hiring Partners hotels housing housing data housing predictions housing price hy-vee Income Industry Experts Injuries Instructor Blog Instructor Interview insurance italki Job Job Placement Jobs Jon Krohn JP Morgan Chase Kaggle Kickstarter las vegas airport lasso regression Lead Data Scienctist Lead Data Scientist leaflet league linear regression Logistic Regression machine learning Maps market matplotlib Medical Research Meet the team meetup methal health miami beach movie music Napoli NBA netflix Networking neural network Neural networks New Courses NHL nlp NYC NYC Data Science nyc data science academy NYC Open Data nyc property NYCDSA NYCDSA Alumni Online Online Bootcamp Online Training Open Data painter pandas Part-time performance phoenix pollutants Portfolio Development precision measurement prediction Prework Programming public safety PwC python Python Data Analysis python machine learning python scrapy python web scraping python webscraping Python Workshop R R Data Analysis R language R Programming R Shiny r studio R Visualization R Workshop R-bloggers random forest Ranking recommendation recommendation system regression Remote remote data science bootcamp Scrapy scrapy visualization seaborn seafood type Selenium sentiment analysis sentiment classification Shiny Shiny Dashboard Spark Special Special Summer Sports statistics streaming Student Interview Student Showcase SVM Switchup Tableau teachers team team performance TensorFlow Testimonial tf-idf Top Data Science Bootcamp Top manufacturing companies Transfers tweets twitter videos visualization wallstreet wallstreetbets web scraping Weekend Course What to expect whiskey whiskeyadvocate wildfire word cloud word2vec XGBoost yelp youtube trending ZORI