Data Study on High Population Densities and Increase Crime

Posted on Jul 20, 2016
The skills the author demoed here can be learned through taking Data Science with Machine Learning bootcamp with NYC Data Science Academy.

Do Higher Population Densities Increase Crime?

Crime, particularly violent crime, is always prevalent in the public consciousness. At the same time, data from the UN reported in 2014 that population densities and the prevalence of urban areas continue to increase, with more than half the world's population living in urban areas for the first time in history.

The relationship between crime rates and population density is unclear from an intuitive standpoint. It seems likely that crime rates increase as population densities increase. You don't shoot your neighbor in the country, right? But when you are traveling alone at night, having a higher population density makes it more likely to have people in the vicinity, which lowers your chances of being mugged. There tends to also be a stronger tax base, allowing for more police who simultaneously have less area to patrol. So which is it: do crime rates, which measure the number of incidents per 100,000 people, go up or down with increasing population density?

It turns out that the answer to this question is rather complex. Over the years population density has increased throughout the state, while crime rate has consistently gone down. Nevertheless, there continues to be a correlation between density and crime. How could this be? That is the question we will answer in this blog post by looking at publicly available data from New York State.

Data on the population density in NYC counties vs New York's counties

Data Study on High Population Densities and Increase CrimeData Study on High Population Densities and Increase Crime

Since we are interested in how the population density is related to crime rates, we first look to see what the population density is in New York State and how it has changed over time. It is important to consider population densities when considering geo-spatial metrics and not simply absolute population. The population density in New York City is nearly an order of magnitude higher than in the rest of New York counties, which makes graphical comparisons more difficult.

The NYC counties and the New York counties outside NYC will be investigated separately in light of the strong disparity of population densities and differing availability of data for NYC counties. The data was obtained through and is published by the State of New York and maintained by OpenData NY. Click here for the dataset.

For each county over the years of 1990-2015 it includes population and both the absolute number of crimes and crime rate (incidents per 100,000 people) for four types of crime metrics: index, property, violent, and firearm. These metrics are collected by the FBI through the National Uniform Crime Reporting Program (UCR).

As reported by "The UCR reporting system collects information on seven crimes classified as Index offenses which are most commonly used to gauge overall crime volume. The Index Rate includes the violent crimes of murder/non-negligent manslaughter, forcible rape, robbery, and aggravated assault; and the property crimes of burglary, larceny, and motor vehicle theft."

Information for the NYC counties of the Bronx, Kings, Manhattan, Queens and Richmond was only provided between 1990-2001, supporting the decision to investigate the NYC counties separately from the rest of New York counties.

Data on the mean population density in New York counties vs NYC popden_nysData Study on High Population Densities and Increase Crime

Keep in mind the axes on the graphs above are different. Population density clearly increases over the timespan provided by the available data for both New York counties outside NYC as well as in NYC counties. The upward trend of population in New York State outside NYC, seen on the left, has continued steadily over the last 25 years with the exception of two spikes: one before the millennium and subsequent decrease in the following year, and one in 2010 with a subsequent increase. In NYC, the population density was flat with a sharp upward spike beginning in 1999, the same year a similar trend was seen in New York State outside NYC.

Data on crime rate in the past 25 years in New York State


The Index Crime Rate for NY counties outside NYC has dropped over 1/3 in the last 25 years. Because the property crime rate has often been nearly an order of magnitude higher than the violent crime rate, the change in the index rate is largely shaped by the change in the rate of property crime, shown in green.

The change in the violent crime rate shown in blue and the firearm crime rate, which is a subset of the violent crime rate and is shown in purple, are difficult to distinguish from this graph but also decrease over the 25 year time span. If you expected crime rates to increase with increasing population density, the trends in the last two sections begin to cast doubt on your assumption.

Data on crime rate during the 1990s in NYC


It turns out that New York City was also a beneficiary of decreasing crime rates over the 11-year period for which data was reported in this dataset. The index rate fell more than 1/2 over the 1990's alone, indicating that factors outside of population density, which increased over the same period, have a strong impact on crime rates. As a percentage, the crime rates in NYC decreased more than those in NY counties outside NYC over the years 1990-2001.

The crime rates rose with population density outside NYC when it remains under 500 ppl/sq.mi


In NY counties outside NYC the index crime rate shows an increasing trend with increasing population density up to 500 ppl/sq.mi. The trend can be hard to see, so a least squares regression line was added solely for visual aid. Each point on this scatterplot represents the population density of one county in one year. All years between 1990-2015 are shown. The decreasing crime rates during that period account for a large part of the variance at a given population density. For these less densely population areas the trend for each individual year is essentially the same.

The index crime rate rises with population density in NYC


Similar to the New York counties outside NYC with a population density below 500 people per square mile, the counties in NYC show an increasing crime rate with increasing population density. Since the overall index crime rate for both sets of counties decreased over time, but the index rate still increases with increasing population density, two possible explanations are: 1. the crime rate decreased uniformly over all counties, or 2. the crime rate decreased more in counties with lower population density, maintaining the upward correlation.

Crime rates did not increase with population density above 500 ppl/sq.mi outside NYC


The main insight is that the index crime rate decreased with increasing population density above population densities of 500 people per square mile in counties outside NYC. The counties in the above scatterplot with the highest population densities are Nassau, Westchester, and Rockland counties, all of which are directly adjacent to NYC counties.

The decrease is unusual, considering the upward trend seen amongst counties with population densities below 500 people per square mile and the NYC counties. In the context of the data, this unusual behavior should be investigated further, such as the relative change of population densities and crime rates in these counties compared to the others.

Data Takeaways

  • Population density in NY state has increased over the last 25 years
  • Crime rates in NY state have decreased over the last 25 years
  • In NYC counties, crime rates increase with population density
  • In counties outside NYC, an increase in crime rates appears with an increase in population density, but only up to 500 ppl/sq.mi, where it then appears to decrease
  • The first three points can all be true if the crime rates decreased uniformly for all counties or more for less densely populated counties

The Road Ahead

Household income, racial diversity, age, and education are all variables which intuitively could impact crime rates for which there is available data. Expanding the investigation to account of those variables would shed more light on the factors affecting crime rates and help guide policy decisions.


If you would like to see the R code which generated this blog post, click here.

About Author

David Richard Steinmetz

David became a data scientist for two reasons: it was the part of his previous jobs he loved, and he saw the need companies have at interpreting their data to meet business goals. With a PhD in Materials...
View all posts by David Richard Steinmetz >

Related Articles

Leave a Comment

Google August 30, 2021
Google Always a massive fan of linking to bloggers that I like but don’t get a good deal of link appreciate from.
Google January 2, 2021
Google Although internet sites we backlink to beneath are considerably not associated to ours, we feel they're in fact worth a go via, so have a look.
Google December 17, 2020
Google The facts talked about inside the write-up are a few of the very best accessible.
Lab Billing Services November 25, 2020
The medical billing offers all aspects of (RCM). Laboratory medical billing provides a complete service that includes patient billing, medical coding, insurance verification, and more.
MKsOrb August 28, 2020
MKsOrb [...]below you will come across the link to some sites that we believe you ought to visit[...]
OnHax Me August 19, 2020
OnHax Me [...]Sites of interest we have a link to[...] August 5, 2020 [...]the time to study or visit the subject material or web sites we have linked to below the[...] July 30, 2020 [...]below you’ll come across the link to some web pages that we feel you should visit[...]
cbd oil July 9, 2020
cbd oil [...]usually posts some really interesting stuff like this. If you are new to this site[...]
Google October 2, 2019
Google Here is a great Weblog You might Come across Exciting that we encourage you to visit.
Google September 20, 2019
Google That will be the end of this write-up. Here you’ll obtain some web pages that we think you’ll appreciate, just click the hyperlinks.
National Review: Which Party Can We Blame for Poverty and Crime? – Occidental Dissent July 30, 2019
[…] but much lower than the national average. What’s that all about? Population density? Maybe. The fact that African-American poverty in poor cities has characteristics that are different from […]
Which Party Can We Blame for Poverty and Crime? July 30, 2019
[…] but much lower than the national average. What’s that all about? Population density? Maybe. The fact that African-American poverty in poor cities has characteristics that are different from […]
Jack Polenta January 14, 2019
Any analysis which only looks at Population Density alone will be mottled. Controlling for household wealth (NOT income) is necessary for significant analysis. Once you do so, racial factors become a non-issue.
Computer Science January 25, 2018
Thanks for this interesting post, I have shared it on Twitter.
M88 January 17, 2018
Wow! In the end I got a weblog from where I be able to genuinely take useful information concerning my study and knowledge.
Consuelo November 25, 2017
M᧐mmy and Daddʏ hugged the twins because itt was getyting time to get to bed. ?Mommy thinks the most effective thing about God is he gave me these two little rascals and theyre the very beѕt factor in Mommy?s ԝorⅼd.? Sһe said cuddling and tickⅼing each boys. Tһat wаs the sort of factor momies always say. Thhe giggled and hugged Ⅿommy and have been nearⅼу ready tto go to their bunk beds when Lee said.
m88 November 19, 2017
The art of ghazal singing has was able to entice millions throughout the globe. " It was President Theodore Roosevelt who had given it the category of White House in 1901. Here you'll be able to shop by theme or browse a complete variety of themes in case you are sill unsure on what to base the party.
8 ball pool tricks August 16, 2017
You race on paths made bу tthe players tһemselves.
The Pressure Cooker: Population Density and Crime – Cloud Data Architect October 9, 2016
[…] Contributed by David Richard Steinmetz. He takes the NYC Data Science Academy 12 week full time Data Science Bootcamp program from July 5th to September 22nd, 2016. This post is based on their first class project – the Exploratory Data Analysis Visualization Project, due on the 2nd week of the program. You can find the original article here. […]

View Posts by Categories

Our Recent Popular Posts

View Posts by Tags

#python #trainwithnycdsa 2019 2020 Revenue 3-points agriculture air quality airbnb airline alcohol Alex Baransky algorithm alumni Alumni Interview Alumni Reviews Alumni Spotlight alumni story Alumnus ames dataset ames housing dataset apartment rent API Application artist aws bank loans beautiful soup Best Bootcamp Best Data Science 2019 Best Data Science Bootcamp Best Data Science Bootcamp 2020 Best Ranked Big Data Book Launch Book-Signing bootcamp Bootcamp Alumni Bootcamp Prep boston safety Bundles cake recipe California Cancer Research capstone car price Career Career Day citibike classic cars classpass clustering Coding Course Demo Course Report covid 19 credit credit card crime frequency crops D3.js data data analysis Data Analyst data analytics data for tripadvisor reviews data science Data Science Academy Data Science Bootcamp Data science jobs Data Science Reviews Data Scientist Data Scientist Jobs data visualization database Deep Learning Demo Day Discount disney dplyr drug data e-commerce economy employee employee burnout employer networking environment feature engineering Finance Financial Data Science fitness studio Flask flight delay gbm Get Hired ggplot2 googleVis H20 Hadoop hallmark holiday movie happiness healthcare frauds higgs boson Hiring hiring partner events Hiring Partners hotels housing housing data housing predictions housing price hy-vee Income Industry Experts Injuries Instructor Blog Instructor Interview insurance italki Job Job Placement Jobs Jon Krohn JP Morgan Chase Kaggle Kickstarter las vegas airport lasso regression Lead Data Scienctist Lead Data Scientist leaflet league linear regression Logistic Regression machine learning Maps market matplotlib Medical Research Meet the team meetup methal health miami beach movie music Napoli NBA netflix Networking neural network Neural networks New Courses NHL nlp NYC NYC Data Science nyc data science academy NYC Open Data nyc property NYCDSA NYCDSA Alumni Online Online Bootcamp Online Training Open Data painter pandas Part-time performance phoenix pollutants Portfolio Development precision measurement prediction Prework Programming public safety PwC python Python Data Analysis python machine learning python scrapy python web scraping python webscraping Python Workshop R R Data Analysis R language R Programming R Shiny r studio R Visualization R Workshop R-bloggers random forest Ranking recommendation recommendation system regression Remote remote data science bootcamp Scrapy scrapy visualization seaborn seafood type Selenium sentiment analysis sentiment classification Shiny Shiny Dashboard Spark Special Special Summer Sports statistics streaming Student Interview Student Showcase SVM Switchup Tableau teachers team team performance TensorFlow Testimonial tf-idf Top Data Science Bootcamp Top manufacturing companies Transfers tweets twitter videos visualization wallstreet wallstreetbets web scraping Weekend Course What to expect whiskey whiskeyadvocate wildfire word cloud word2vec XGBoost yelp youtube trending ZORI