Data Study on United States Population Projections

Posted on May 15, 2016
The skills the author demoed here can be learned through taking Data Science with Machine Learning bootcamp with NYC Data Science Academy.
Contributed by Denis Nguyen. He is currently in the NYC Data Science Academy 12 week full-time Data Science BootcampΒ program taking place between April 11th to July 1st, 2016. This post is based on his second class project - R Shiny (due on the 4th week of the program).

The U.S. Census, mandated by the Constitution, takes place every 10 years in order to count every resident in the United States. This data is used to determine the number of seats each state has in the U.S. House of Representatives and to distribute public funds to local communities for educational programs, health care, law enforcement and highways. Participating in the decennial census is a civic duty and refusing to do so could lead toΒ a fine of up to $5,000.

The most recent censusΒ took place in 2010, with the next one planned for 2020. The 2010 CensusΒ was the largest one ever performed, with over 70 percent of households returning responses. While this information is useful for the government, it can also be useful for marketing companies and other organizations with target audiences. Equipped withΒ population demographics, marketing companies know where to focus their efforts to yield the maximum return.

Objective

With this interest in mind, I wanted to know how the population was changing and the different ratesΒ at which race/ethnic groups were changing. In order to visualize these changes, I designed an app that allows users to see population projections made by the U.S. Census Bureau for the next 45 years.

Have fun exploring the appΒ and see what insights you can glean.

 

The Data

The data was obtained from the United States Census BureauΒ andΒ projections were produced using information from the 2010 Census and a cohort-component method. These projections are updated periodically in order to reflect the changing factors that affect population growth and data displayedΒ by the app was based on the projections made in 2014. The data contained projections up to 2060 and studied differences in births, deaths, migration, gender, and race/ethnicity.

The data had to be reorganized because some files had a column labeled "Hispanic origin" while other files had those choicesΒ under the race/ethnicity column. In order to keep the information easier to analyze, population count for Hispanic/Non-Hispanic origin was calculated and placed under the same column as the other races and ethnicities.

 

The App's DataΒ 

The app has five tab that allow users to visualize and examineΒ the projected populations. The app was initially planned to display all the information in one graph, giving users the ability to change multiple parameters at a time but the number of options was overwhelming so three tabs were created. This allows users to focus on differentΒ aspects of the population demographics without being lost in the options.

Insight into Population

The first section allows the comparison of the total population, births, deaths, and net migration inside a population. The user is able to pick a race/ethnicityΒ andΒ gender, and juxtapose rates ofΒ births, deaths, migrations, and total population to see the differences.

This set upΒ makes it easier for users to find interestingΒ crossoversΒ of birth rate and death rate and compare how migration rate may contrast. A slider alsoΒ enables users to focus on specific years, zooming into the graph so that they can see specific places of crossing over. Hovering over points on the graphs will give users exact numbers and clicking on a pointΒ will temporarily outlineΒ it until another point is clicked.

-Data Study on United States Population Projections

Insight intoΒ Gender

The second section is similar to the first but differs in focus. It allows users to compare how population may differ between males and females of a specific race/ethnicity. Users can not only compare the total population, but can also look at how birth rates, death rates, and migration rates may differ between genders.

-Data Study on United States Population Projections

Insight into Race

The third section concentratesΒ onΒ the differences among population projections of different races/ethnicities. By removing the "All" option, users can highlight theΒ groupsΒ of interest in order to viewΒ the differences in growth rate and whenΒ some groups may peak or decline in numbers.

Data Study on United States Population Projections

Map of the United States

This section exhibitsΒ the percent change and total change by state. The darker colors represent states that haveΒ a higher increase in population while lighter colors representΒ little population growth. The slider can be shifted to see the population change between two different years and users can hover over each state to get the exact percent/absolute change. The title of the map changes accordingly to reflectΒ the chosen parameters so that users understand what the map is displaying.

-Change

Different Projections

This section displaysΒ the U.S. Census Bureau total population projections made in three different years. Users can toggle on and off projection lines, as well as use the slider to zoom into the years of interest to get a better picture of differences. These projections are made by consideringΒ factors affecting population growth and areΒ periodically updated to reflect changes.

-Projections

Conclusion

This app can be used toΒ understand how the population of the United States is changing and the direction in which it will shift in the following years. Although we expect the population to increase, it is interesting to see where most of the growth is coming from. Some interesting finds include:

  • Immigrants currently make up approximately 30% of the population increase (births + net migration) and is projected to continue well into 2060, contributing 25% of the increase
  • In 2051, theΒ birth rate for multiracial individuals is expected toΒ be double the rate in 2014
  • Immigrants are theΒ largest contribution toΒ the increasing Asian population
  • It is projected that inΒ 2036, the number of deaths will exceed the number of births in the Caucasian population, but population will continue to increase due to immigration
  • The population of white females presently exceeds the population of white males; however, this is expected to change in 2050

Future

With insights attained from this app, marketing companies canΒ identify potential andΒ unsaturated markets andΒ cater to those groups.Β Even if you don't work for a marketing company, these trends could help you decide where to buy your next house. If you're looking for a populated area, consider moving to California where there is a projected high population increase.

If a home in a quiet and peaceful area is more of your liking, consider less-populated states such as West Virginia that are expected to have a decreasing population.Β There is information to be extracted from this app and when combined with additionalΒ information collected byΒ the U.S. Census Bureau, it canΒ be a powerful tool.

Enjoy exploring the app and see what interesting trends you uncover.Β Maybe you'll find your next conversation starter.

The code for this project can be found on GitHub.

About Author

Denis Nguyen

With a background in biomedical engineering and health sciences, Denis has a passion for finding patterns and optimizing processes. He developed his interest for data analysis while doing research on the effects of childhood obesity on bone development...
View all posts by Denis Nguyen >

Leave a Comment

No comments found.

View Posts by Categories


Our Recent Popular Posts


View Posts by Tags

#python #trainwithnycdsa 2019 2020 Revenue 3-points agriculture air quality airbnb airline alcohol Alex Baransky algorithm alumni Alumni Interview Alumni Reviews Alumni Spotlight alumni story Alumnus ames dataset ames housing dataset apartment rent API Application artist aws bank loans beautiful soup Best Bootcamp Best Data Science 2019 Best Data Science Bootcamp Best Data Science Bootcamp 2020 Best Ranked Big Data Book Launch Book-Signing bootcamp Bootcamp Alumni Bootcamp Prep boston safety Bundles cake recipe California Cancer Research capstone car price Career Career Day citibike classic cars classpass clustering Coding Course Demo Course Report covid 19 credit credit card crime frequency crops D3.js data data analysis Data Analyst data analytics data for tripadvisor reviews data science Data Science Academy Data Science Bootcamp Data science jobs Data Science Reviews Data Scientist Data Scientist Jobs data visualization database Deep Learning Demo Day Discount disney dplyr drug data e-commerce economy employee employee burnout employer networking environment feature engineering Finance Financial Data Science fitness studio Flask flight delay gbm Get Hired ggplot2 googleVis H20 Hadoop hallmark holiday movie happiness healthcare frauds higgs boson Hiring hiring partner events Hiring Partners hotels housing housing data housing predictions housing price hy-vee Income Industry Experts Injuries Instructor Blog Instructor Interview insurance italki Job Job Placement Jobs Jon Krohn JP Morgan Chase Kaggle Kickstarter las vegas airport lasso regression Lead Data Scienctist Lead Data Scientist leaflet league linear regression Logistic Regression machine learning Maps market matplotlib Medical Research Meet the team meetup methal health miami beach movie music Napoli NBA netflix Networking neural network Neural networks New Courses NHL nlp NYC NYC Data Science nyc data science academy NYC Open Data nyc property NYCDSA NYCDSA Alumni Online Online Bootcamp Online Training Open Data painter pandas Part-time performance phoenix pollutants Portfolio Development precision measurement prediction Prework Programming public safety PwC python Python Data Analysis python machine learning python scrapy python web scraping python webscraping Python Workshop R R Data Analysis R language R Programming R Shiny r studio R Visualization R Workshop R-bloggers random forest Ranking recommendation recommendation system regression Remote remote data science bootcamp Scrapy scrapy visualization seaborn seafood type Selenium sentiment analysis sentiment classification Shiny Shiny Dashboard Spark Special Special Summer Sports statistics streaming Student Interview Student Showcase SVM Switchup Tableau teachers team team performance TensorFlow Testimonial tf-idf Top Data Science Bootcamp Top manufacturing companies Transfers tweets twitter videos visualization wallstreet wallstreetbets web scraping Weekend Course What to expect whiskey whiskeyadvocate wildfire word cloud word2vec XGBoost yelp youtube trending ZORI