Data Visualization on Risk Assessment of Meteor Impacts

Posted on Apr 30, 2017
The skills the author demoed here can be learned through taking Data Science with Machine Learning bootcamp with NYC Data Science Academy.

What are the chances, based on data,  Earth is impacted by a meteor?

If a meteor does hit Earth, how much damage would it cause?

The meteor application I created attempts to answer these questions as well as provide background information on solar system objects.


There are over 700,000 asteroids and comets that have been observed in our solar system. All key information about these objects has been stored by the Jet Propulsion Laboratory (JPL) in their small-body database.


Two parameters are generally used to classify these objects:

  1. Semi-major axis - the longer of the two radii that define an elliptical orbit
  2. Perihelion distance - the closer of the two distances from the object to the sun along the major axis

Data Visualization on Risk Assessment of Meteor Impacts

Since the small-body database contains the class information about each object, the potential threat of each class can be assessed by charting the mean and median semi-major axis and perihelion distance. More specifically, an object makes a close approach when the fly-by distance from Earth is less than 0.05 AU (4.6 million miles) and is considered hazardous when the diameter is larger than 330 ft. Out of twenty classes, only four are potentially dangerous, as pictured below (colors) crossing the Earth (inner) and Mars (outer) orbits.

Data Visualization on Risk Assessment of Meteor Impacts

More specific information about hazardous objects is contained in another source: the Sentry - Earth impact monitoring database. This database contains information on object velocity and diameter which will be used to assess potential damage.

For the curious, the small-body database serves as a great demonstration of Kepler's third law. This relation between semi-major axis and orbital period served in part as a basis for Newton's work on gravity.


Asteroid and Comet Classification

As mentioned in the introduction, only a small subset of objects are potentially dangerous to Earth. Specifically, 95% of objects are contained within the asteroid belt between Mars and Jupiter while roughly 0.1% of objects have been determined to be potentially hazardous. My application can be used to browse the various classes and explore why they are or are not potentially hazardous.

For example, the Jupiter Trojan objects are described as hovering around Jupiter's Lagrangian points with semi-major axes around 5.2 AU (according to Wikipedia). By selecting the TJN class in the application, the following semi-major axis (left), perihelion distance (center) and diameter (right) plots are produced:

a-TJN-classData Visualization on Risk Assessment of Meteor ImpactsData Visualization on Risk Assessment of Meteor Impacts











For reference, the dashed lines indicate mean of the distribution while the dotted lines indicate the median of the distribution. Using the mean values, we can plot the average expected orbit for each class to understand approximately where these objects travel on a macro level. For the TJN class (black), the following image is produced with Jupiter (purple), Mars (red) and Earth (blue) orbits are plotted for reference. We see that these objects are outside of the asteroid belt and are extremely far away from Earth, hence not dangerous in the least.



In contrast, the following semi-major axis (left), perihelion distance (center) and diameter (right) plots are produced for Aten class objects which are potentially dangerous, but much smaller in diameter:


Charting these orbital parameters in terms of mean orbits produces the image below. No wonder that these are hazardous, their average orbit crosses Earth's orbit!


Potentially Hazardous Objects

With an understanding of which objects are potentially hazardous asteroids (PHA), we can turn to objects which have a non-zero probability of impacting Earth. The distributions of these objects are plotted below:


Of main interest is the diameter distribution since the size of the meteor is proportional to the damage it will cause (assuming that the object densities are similar). The median of this distribution is roughly 60 feet while the mean is roughly 130 feet.

Potential Impacts Page

In astronomical terms, these objects are tiny. How much damage would one of these median-size objects cause? Using the search feature in the datatable tab of the potential impacts page, we can locate an example asteroid: 2009 BE.

Screen Shot 2017-04-30 at 12.22.59 PM

The crater application takes in a meteor, target and meteor material information, impact city (visualization only) and angle of impact to determine key figures such as crater radius, depth and time for formation while displaying key statistics about the meteor itself. For 2009 BE, we see that there is a relatively high impact probability of 43 thousandths of a percent though the resulting crater would only be 300 feet wide and 200 feet deep.

2010 AU118

For a much less likely less scenario, let's see what happens if the largest PHA hits earth: 2010 AU118.

Screen Shot 2017-04-30 at 12.24.07 PM

We see that the crater will be two orders of magnitude larger and thus the destruction would likely be devastating. Not included in this calculation are the other effects that a meteor would cause such as pressure wave generation and large scale dust generation and corresponding climate change. Given more time, this application could be updated to include more disaster-related calculations in addition to the crater generation figures.

Impact Probability vs Diameter

So how likely is it that some massive meteor impacts Earth? The hope is that there is a negative correlation between impact probability and diameter. This plot is shown below (both on log scales) and while there is a general trend, there is a large spread about the least squares regression line with some notable outliers in high probability and large diameter.


While this is not comforting, the following statistics should be:

  1. The probability of one small meteor (2010 RF12, diameter = 0.007 km) hitting Earth is 5%
  2. The probability of at least one meteor hitting Earth is rough 8.5%
  3. The most likely estimated year of impact is between 50 and 100 years from now


In conclusion, there is very little to worry about regarding meteor impact especially considering the small fraction of the world that has significant population density. In the future, an estimation of the probability of a meteor impacting a populated area will be included.


The relevant equations and coefficients for the crater formation problem can be found in the following paper:

  • Holsapple, K.A., 1993. The Scaling of Impact Processes in Planetary Sciences. Annu Rev Earth Pl Sc 21, 333–373

About Author

Sam O'Mullane

Sam O'Mullane is a multidisciplinary scientist who is passionate about making data science accessible to a wide audience. Sam received his MSc in Quantum Fields and Fundamental Forces from Imperial College London (Th. Physics) in 2013 and PhD...
View all posts by Sam O'Mullane >

Related Articles

Leave a Comment

Google October 17, 2019
Google Here are some links to sites that we link to for the reason that we feel they are really worth visiting.
Google September 26, 2019
Google The data talked about in the post are a few of the top out there.
Louvenia July 14, 2017
This is a matter close to my heart cheers. Thanks
Chance July 11, 2017
Great post. I am facing a couple of these problems.

View Posts by Categories

Our Recent Popular Posts

View Posts by Tags

#python #trainwithnycdsa 2019 2020 Revenue 3-points agriculture air quality airbnb airline alcohol Alex Baransky algorithm alumni Alumni Interview Alumni Reviews Alumni Spotlight alumni story Alumnus ames dataset ames housing dataset apartment rent API Application artist aws bank loans beautiful soup Best Bootcamp Best Data Science 2019 Best Data Science Bootcamp Best Data Science Bootcamp 2020 Best Ranked Big Data Book Launch Book-Signing bootcamp Bootcamp Alumni Bootcamp Prep boston safety Bundles cake recipe California Cancer Research capstone car price Career Career Day citibike classic cars classpass clustering Coding Course Demo Course Report covid 19 credit credit card crime frequency crops D3.js data data analysis Data Analyst data analytics data for tripadvisor reviews data science Data Science Academy Data Science Bootcamp Data science jobs Data Science Reviews Data Scientist Data Scientist Jobs data visualization database Deep Learning Demo Day Discount disney dplyr drug data e-commerce economy employee employee burnout employer networking environment feature engineering Finance Financial Data Science fitness studio Flask flight delay gbm Get Hired ggplot2 googleVis H20 Hadoop hallmark holiday movie happiness healthcare frauds higgs boson Hiring hiring partner events Hiring Partners hotels housing housing data housing predictions housing price hy-vee Income Industry Experts Injuries Instructor Blog Instructor Interview insurance italki Job Job Placement Jobs Jon Krohn JP Morgan Chase Kaggle Kickstarter las vegas airport lasso regression Lead Data Scienctist Lead Data Scientist leaflet league linear regression Logistic Regression machine learning Maps market matplotlib Medical Research Meet the team meetup methal health miami beach movie music Napoli NBA netflix Networking neural network Neural networks New Courses NHL nlp NYC NYC Data Science nyc data science academy NYC Open Data nyc property NYCDSA NYCDSA Alumni Online Online Bootcamp Online Training Open Data painter pandas Part-time performance phoenix pollutants Portfolio Development precision measurement prediction Prework Programming public safety PwC python Python Data Analysis python machine learning python scrapy python web scraping python webscraping Python Workshop R R Data Analysis R language R Programming R Shiny r studio R Visualization R Workshop R-bloggers random forest Ranking recommendation recommendation system regression Remote remote data science bootcamp Scrapy scrapy visualization seaborn seafood type Selenium sentiment analysis sentiment classification Shiny Shiny Dashboard Spark Special Special Summer Sports statistics streaming Student Interview Student Showcase SVM Switchup Tableau teachers team team performance TensorFlow Testimonial tf-idf Top Data Science Bootcamp Top manufacturing companies Transfers tweets twitter videos visualization wallstreet wallstreetbets web scraping Weekend Course What to expect whiskey whiskeyadvocate wildfire word cloud word2vec XGBoost yelp youtube trending ZORI