Flight Delays in the COVID-19 Era: Insights from Top Carriers

Posted on May 2, 2023

The COVID-19 pandemic brought major disruption to many of our lives. Work and school became remote, supply chain issues arose, and multiple industries saw a dearth of demand. The travel industry was hit particularly hard as international travel became very difficult,  and even domestic flights were rare. We sought to observe the five largest carriers and see which performed best during this period. By identifying the most efficient ones, we can look at their operations to inform future decision-making and strategic planning within the aviation industry. 

The dataset can be obtained from the BTS (Bureau of Transportation Statistics) website. It covers the time period of 09/2007 - 08/2022. The dataset contains nearly 250,000 observations and features of year, month, carrier, airport, arrivals, arrival delays, and some other features splitting up arrival delays into specific contributions (e.g. weather, security, late aircraft in previous leg, etc.). 

There is no specific date parameter in this dataset, so we have to analyze temporally by month-year. Below is a scatter plot to help visualize the distribution of arrival flights against minutes of delay. Each point represents a different pair of carrier and airport values at a specific month-year. There is a somewhat conical shape to the distribution.

When comparing different airlines, the number of arrivals differ. Thus, in order to compare different airlines, we look at an engineered feature instead: the average minutes of delay per flight. Via this metric, let’s first see what happened to the average minutes delayed during the pandemic. At the initial onset, there is a large drop in average minutes delayed. It’s not until partway through 2021 that efficiency falls to the prior levels, likely due to increased demand related to summer travel and vaccine acceptance.

The average flight delay drops at the onset of the pandemic and slowly rises until the middle of 2021. This indicates that customers were, in general, facing fewer delays during this time period. This makes sense as logistics often get a bit easier at a smaller scale.

The five airline carriers with the most arrival flights were American Airlines (AA), Delta Airlines (DL), SkyWest Airlines (OO), United Airlines (UN), and Southwest Airlines (WN).

Looking at their performance over time, we see that Southwestern Airlines was frequently the most efficient of these airlines. However, Delta Airlines was able to maintain a fairly low average minutes delayed, while Southwestern Airlines jumped around more as flight demand increased. This suggests that Delta Airlines had a system in place that was scalable and ready to adjust to an increase in demand, which Southwestern lacked.

This exploratory analysis of airline carriers during the COVID-19 pandemic grants insight into the adaptability and resilience of their operations. The aviation industry has largely recovered by now. However, in the event of another pandemic, we should learn from the example of Southwestern to strip back operations while maintaining high efficiency and from Delta to  scale-up operations once demand increases. If we can properly extract the principles which permitted these, we may be able to build robust systems that can be applied to industries  outside of aviation, as well. Robustness allows our systems to withstand fluctuations and shocks, benefiting both the systems themselves and the people who use them.

About Author

Daniel Erickson

I'm received my Ph.D in mathematics from Oregon State University. I enjoy tackling complex problems - specifically those amenable to insight gleaned through data.
View all posts by Daniel Erickson >

Related Articles

Leave a Comment

No comments found.

View Posts by Categories


Our Recent Popular Posts


View Posts by Tags

#python #trainwithnycdsa 2019 2020 Revenue 3-points agriculture air quality airbnb airline alcohol Alex Baransky algorithm alumni Alumni Interview Alumni Reviews Alumni Spotlight alumni story Alumnus ames dataset ames housing dataset apartment rent API Application artist aws bank loans beautiful soup Best Bootcamp Best Data Science 2019 Best Data Science Bootcamp Best Data Science Bootcamp 2020 Best Ranked Big Data Book Launch Book-Signing bootcamp Bootcamp Alumni Bootcamp Prep boston safety Bundles cake recipe California Cancer Research capstone car price Career Career Day citibike classic cars classpass clustering Coding Course Demo Course Report covid 19 credit credit card crime frequency crops D3.js data data analysis Data Analyst data analytics data for tripadvisor reviews data science Data Science Academy Data Science Bootcamp Data science jobs Data Science Reviews Data Scientist Data Scientist Jobs data visualization database Deep Learning Demo Day Discount disney dplyr drug data e-commerce economy employee employee burnout employer networking environment feature engineering Finance Financial Data Science fitness studio Flask flight delay gbm Get Hired ggplot2 googleVis H20 Hadoop hallmark holiday movie happiness healthcare frauds higgs boson Hiring hiring partner events Hiring Partners hotels housing housing data housing predictions housing price hy-vee Income Industry Experts Injuries Instructor Blog Instructor Interview insurance italki Job Job Placement Jobs Jon Krohn JP Morgan Chase Kaggle Kickstarter las vegas airport lasso regression Lead Data Scienctist Lead Data Scientist leaflet league linear regression Logistic Regression machine learning Maps market matplotlib Medical Research Meet the team meetup methal health miami beach movie music Napoli NBA netflix Networking neural network Neural networks New Courses NHL nlp NYC NYC Data Science nyc data science academy NYC Open Data nyc property NYCDSA NYCDSA Alumni Online Online Bootcamp Online Training Open Data painter pandas Part-time performance phoenix pollutants Portfolio Development precision measurement prediction Prework Programming public safety PwC python Python Data Analysis python machine learning python scrapy python web scraping python webscraping Python Workshop R R Data Analysis R language R Programming R Shiny r studio R Visualization R Workshop R-bloggers random forest Ranking recommendation recommendation system regression Remote remote data science bootcamp Scrapy scrapy visualization seaborn seafood type Selenium sentiment analysis sentiment classification Shiny Shiny Dashboard Spark Special Special Summer Sports statistics streaming Student Interview Student Showcase SVM Switchup Tableau teachers team team performance TensorFlow Testimonial tf-idf Top Data Science Bootcamp Top manufacturing companies Transfers tweets twitter videos visualization wallstreet wallstreetbets web scraping Weekend Course What to expect whiskey whiskeyadvocate wildfire word cloud word2vec XGBoost yelp youtube trending ZORI