Visualizing Natural Gas Withdrawals, Consumption and Prices

Posted on Aug 7, 2016

Visualizing Natural Gas Production and Consumption


Natural Gas is used in the United States for electricity generation, residential and commercial heating and cooking, fueling industrial processes, powering vehicles, and for energy production operations.  The United States is one of the largest consumers of Natural Gas, in large part due to the available natural gas reserves in the country, the developed production industry, and the difficulties in storing and transporting the volatile substance.  Most of the gas consumed in the US is produced in the US, with only about 11% of the consumed gas being imported, primarily from Canada via pipelines, and only 8% being exported to Canada or Mexico (first 5 months of 2016).  Because of the difficulties in transporting the fuel, one might expect that natural gas would be primarily consumed near the production source.  We will be able to obtain some evidence for this via this visualization.

Production sources vary as well, and while the majority of gas is produced from traditional wells specifically designed to produce natural gas, it is also produced from oil wells, coal beds, and more recently an increasing number of shale wells.  While production from shale wells has suffered due to the economics of these wells, the amount of gas obtained from shale wells has grown impressively in the last decade, and we may be able to view some interesting trends in shale production via this visualization.

The Data

All data used for this visualization was obtained from the US Energy Information Administration (  An immense amount of energy data is available via the EIA website.  To access the data quickly and easily in R, I utilized the EIAdata package for R, developed by Matthew Brigida (EIAdata Link).  The package allows for easy navigation of the numerous categories of data made available by the EIA and a simple mechanism for requesting and receiving the data.  This Shiny application uses time-series data to allow the user to view changes over time; the EIAdata package makes time-series data available in a R xts format, which allows for easy subsetting of the data. Most of the data series were available in monthly increments, but was easier to visualize when grouped into annual totals, which was easily achievable using the xts built-in functions.  Unfortunately, much of the data on withdrawals was only complete through the end of 2014, so the visualization only includes data up through 2014.  Code used to obtain and sort through the data will be available via links below.

All volumes are reported in Billion Cubic Feet (Bcf).  For a detailed explanation of the size of this unit, visit the EIA site, here.

The App App Link

The application is comprised of three parts.  Withdrawals (production) and consumption are viewable in  US states maps in the first two tabs of the application, which allows both visual and anecdotal comparisons of withdrawal and consumption statistics.  Within the Withdrawals Map, a user can view Withdrawals from the various production sources, by year, and can also compare withdrawals between pairs of years.  Similarly, consumption can be viewed by year and changes in consumption between pairs of years can be visualized.  Consumption can be filtered by the various end users.

Monthly time-series data can be visualized in the Charts tab.  Within this tab, the user can view, side-by-side, production of gas from a chosen source-type with consumption of gas by a chosen end-user-type.  This data can be viewed at the US-total level, or by state.  Additionally, a representative price is displayed, utilizing “city-gate” prices provided for each state by the EIA.


The application has a lot of data that can be grouped and viewed in many ways, which allows the user to explore trends on a very granular level or on a much higher-level.  To illustrate the power of the application, we will focus on two specific scenarios which can be visualized in the app.  The user is encouraged to search for other trends and draw conclusions.

Example 1: Shale Gas Production Growth

In the Withdrawals Map Tab, select “Shale” from the Withdrawals drop-down.  The user will notice that there is no data prior to 2007 in this category.  This is likely due to the minimal amount of gas withdrawn from shale wells, but I suspect that shale gas was not a reportable category before 2007.  By setting the time scale to 2007, and pressing “Play” on the time scale, one can watch the growth of shale gas production, most specifically in Pennsylvania, where shale gas production grew from zero to over 4.4 million Bcf in 2014.  Total shale production grew from just under 2 million Bcf reported in 2007 (viewable in the information box in the bottom right) to almost 14 million Bcf in 2014.

The change alone can be viewed by changing the “View Statistic” dropdown to “diff” and setting the time scales to 2007 at top and 2014 below.  However, since there was no production reported in 2007, this “change” is equivalent to the total amount produced in 2014.  Alternatively, one can look at the change in total production by changing the Withdrawal Source back to “All Sources” and viewing the map again.  The growth of shale in Pennsylvania becomes obvious here.  The next-closest “increase” was in Texas (rankings can be viewed in the graphic below the map, which will show the 5 biggest increases and 5 biggest decreases), which only had 2 million Bcf of new production in 2014 vs 2007 while Pennsylvania increased total production by 4 million Bcf.

One might wonder if consumption of Natural Gas increased in Pennsylvania in step with production increases.  To view this, the user can switch to the consumption map, change the statistic to view to “diff” and select 2007 and 2014 as the comparison years.  The largest increase in consumption between those years is in Pennsylvania.  By flipping through the consumer types, one can see that most of this increase can be attributed to electricity production, and make a guess that locally-produced gas is displacing coal as the fuel source for electricity production.

Finally, the user can view the month-over-month changes by switching to the charts tab.  Select PA in the state selection at the bottom, and one can immediately see the increase in total withdrawals, and can possibly detect an increase in consumption.  Note the seasonality of demand for gas.  IN the US Northeast, natural gas is used for heating in winter, hence the increased demand in the cold winter period.  As we noted earlier, the increase is most obvious in electricity production, which interestingly does not show as much seasonality.

Finally, one can view the time-series of price in the same period by looking to the bottom of this page.  The local price of gas has decreased in step with this growth in production, and despite an increase in consumption.  However, the decrease in price corresponds to a price decrease in gas across the US, and we require further statistical analysis before attempting to explain this price trend.

Example 2: Hurricanes in the Gulf

As anyone familiar with the natural gas industry knows, hurricanes in the US Gulf cause problems for production and lead to price spikes for natural gas.  As a second example, we will visualize the phenomenon.  The year 2005 was known for the two hurricanes, Katrina and Rita, which caused widespread damage and loss-of-life.  Natural gas production and distribution was disrupted as well.

To visualize the effects, one can start by looking at the withdrawals map, and viewing the change in withdrawals between 2014 and 2015.  Louisiana, which took a direct hit from Katrina, shows a decrease in total production of 67,000 Bcf.  Total US withdrawal decreased by 500,000 Bcf from 2014-2015.

The consumption map shows a decrease in consumption in Texas of over 400,000 Bcf.  One might be able to conclude that this decrease was related to decreases in production in the area due to hurricanes.

The Charts provide even more insight.  By selecting Total production and setting both time scales to 2005 (which allows us to visualize start-end of 2005), the user will see a sharp decrease in production in September of 2005.  By expanding the timeframe in either direction, one can see that production in September of 2005 was the lowest monthly production since 1993, and that low level of production has not been seen again.  The decrease corresponds with a massive spike in the price for natural gas.

Filtering by state, one can see that production in Louisiana declined sharply in September of 2005.  A similar decrease in withdrawals can be viewed in the “Federal Offshore” area by choosing that area in the state-dropdown.


This app provides the interested user a number of ways to view the production and consumption of natural gas in the United States.  A user can investigate the impact of specific events or trends and can develop theories for further research and statistical analysis.  Other things to explore include the changing trends in gas production from coal beds, which might inspire the user to further research coal production and how changing trends in coal production could affect natural gas production.  A user might also be interested in exploring the relationships between production, consumption and prices in their home state over specific time frames.  The monthly granularity of the charts could allow an interested user to investigate how annual changes might be affected by spikes or valleys in specific months relating to individual events.  Or, one might be interested in simply looking at the growth of natural gas production in the US since 1991.  The possibilities are seemingly endless.

Server.R code:

Ui.R Code:

Helper.R Code:

About Author

Ben Townson

Ben Townson graduated from the New York City Data Science Academy 12-week Data Science Bootcamp on September 23. At NYCDSA he has mastered machine learning and data analysis techniques, complementing more than ten years spent in the finance...
View all posts by Ben Townson >

Leave a Comment

No comments found.

View Posts by Categories

Our Recent Popular Posts

View Posts by Tags

#python #trainwithnycdsa 2019 2020 Revenue 3-points agriculture air quality airbnb airline alcohol Alex Baransky algorithm alumni Alumni Interview Alumni Reviews Alumni Spotlight alumni story Alumnus ames dataset ames housing dataset apartment rent API Application artist aws bank loans beautiful soup Best Bootcamp Best Data Science 2019 Best Data Science Bootcamp Best Data Science Bootcamp 2020 Best Ranked Big Data Book Launch Book-Signing bootcamp Bootcamp Alumni Bootcamp Prep boston safety Bundles cake recipe California Cancer Research capstone car price Career Career Day citibike classic cars classpass clustering Coding Course Demo Course Report covid 19 credit credit card crime frequency crops D3.js data data analysis Data Analyst data analytics data for tripadvisor reviews data science Data Science Academy Data Science Bootcamp Data science jobs Data Science Reviews Data Scientist Data Scientist Jobs data visualization database Deep Learning Demo Day Discount disney dplyr drug data e-commerce economy employee employee burnout employer networking environment feature engineering Finance Financial Data Science fitness studio Flask flight delay gbm Get Hired ggplot2 googleVis H20 Hadoop hallmark holiday movie happiness healthcare frauds higgs boson Hiring hiring partner events Hiring Partners hotels housing housing data housing predictions housing price hy-vee Income Industry Experts Injuries Instructor Blog Instructor Interview insurance italki Job Job Placement Jobs Jon Krohn JP Morgan Chase Kaggle Kickstarter las vegas airport lasso regression Lead Data Scienctist Lead Data Scientist leaflet league linear regression Logistic Regression machine learning Maps market matplotlib Medical Research Meet the team meetup methal health miami beach movie music Napoli NBA netflix Networking neural network Neural networks New Courses NHL nlp NYC NYC Data Science nyc data science academy NYC Open Data nyc property NYCDSA NYCDSA Alumni Online Online Bootcamp Online Training Open Data painter pandas Part-time performance phoenix pollutants Portfolio Development precision measurement prediction Prework Programming public safety PwC python Python Data Analysis python machine learning python scrapy python web scraping python webscraping Python Workshop R R Data Analysis R language R Programming R Shiny r studio R Visualization R Workshop R-bloggers random forest Ranking recommendation recommendation system regression Remote remote data science bootcamp Scrapy scrapy visualization seaborn seafood type Selenium sentiment analysis sentiment classification Shiny Shiny Dashboard Spark Special Special Summer Sports statistics streaming Student Interview Student Showcase SVM Switchup Tableau teachers team team performance TensorFlow Testimonial tf-idf Top Data Science Bootcamp Top manufacturing companies Transfers tweets twitter videos visualization wallstreet wallstreetbets web scraping Weekend Course What to expect whiskey whiskeyadvocate wildfire word cloud word2vec XGBoost yelp youtube trending ZORI