Explore Global Patent Data Trends in Technology

Posted on Oct 13, 2020
The skills I demoed here can be learned through taking Data Science with Machine Learning bootcamp with NYC Data Science Academy.

Introduction 

An invention brings something new into being and has a practical bent. When these inventions are registered as intellectual property – primarily patents, the resulting records provide valuable technological and geographic details. By gathering data on technology patents filed around the world through the United States Patent and Trademark Office (USPTO) in the last two decades, slicing and dicing the data by country and technology domains, and see changes along with the timelines, one can gain insights into the dynamics of a variety of tech domains at different countries.

For this reason, I created this Shiny app to help people easily visualize the dynamics of technology development, which will be particularly useful for business leaders, researchers, and policymakers. The patent data used for the app is published by the Organization of Economic Cooperation and Development (OECD). 

The R Shiny App

The interactive app was built using the Shiny dashboard in R. The app contains three main sections:

  1. World - This allows users to select a year, visualize the geographic distribution of patents filed across the world, view a Leaderboard of countries and technology domain ranked by number of patents filed, as well as additional interesting factors and correlations related to the patent quantity
  2. Country – This allows users to select a particular country of interest and see patent trends and ranking in various technology domains, as well as additional insights on other technology indicators such as GDP, foreign collaboration, and Gross Domestic Expenditure on R&D
  3. Tech Domain – This allows users to select a particular technology domain of interest, and see trends and ranking of various countries in the selected domain.

Data on World 

Explore Global Patent Data Trends in Technology

Under the World tab, the users are first presented with a world map where they can hover over the countries and see how many patents are filed in each country in the year selected. Above the map, users can conveniently see the total number of patents, top country and top tech domain in terms of patent quantity in the year selected.

Explore Global Patent Data Trends in Technology

Leaderboard

The second section under the World tab is a Leaderboard, which ranks the top 10 countries and all tech domains based on patent quantity in the year selected.

Correlations

The third section under the World tab – “Correlations” – displays the correlation of patent volume with several important economic and demographic factors. The first plot shows that the number of patents of a country is positively correlated to its Gross Domestic Expenditure on R&D, with most of the data points congregated around below 250,000 million USD PPPs.

In the second plot, we normalize the Gross Domestic Expenditure on R&D for each country and represented the data by Gross Domestic Expenditure on R&D as a % of GDP, and the positive correlation with patents volume is even clearer, and the data points are more spread out. One interesting to note is that from the graph we can easily spot countries that yields a higher or lower number of patents (i.e. Japan, highlighted) compared to the expected value (the corresponding y-value on the regression line), given the country’s devotion of resources on R&D.

Data on Country

Explore Global Patent Data Trends in Technology

Under the Country tab, the first thing users can see is a graph showing the trend of each tech domain over the last 20 years in the selected country, the United States in this example. The banners on the top indicate the most prominent tech domain in the United States is Computer Technology, and the U.S is producing 45% of the total world patents overall. This gives people a good idea of how different tech domains have been trending in a given country.

The second section under the Country tab is a Leaderboard, which ranks all the technology domains based on patent quantity in the year and country selected.

Additional Insights

The third section under the Country tab – “Additional Insights”, displays the trend of several other technology indicators of a given country, together with the total patent volume throughout the years. For example, in the graph above, the top right plot (GDP Per Capita trend) showed that around the year 2008 the GDP Per Capita took a big hit, which makes total sense because that corresponds to the Great Recession. We can observe a similar dip in the patent volume around 2008 as well, meaning that the Great Recession, definitely had some negative impact on U.S.’s technology advancement as well.

Let’s look at another example – China, we can immediately spot several unique patterns. Firstly, the GDP per capita and a total patent trend increase monotonously in the past two decades without any dips, which means China is on a high growth path in terms of tech innovation.

Another interesting thing to notice is that the percentage of patents with foreign collaboration has been continuously decreasing. This might indicate that China is shifting its technology strategy from relying on more foreign partners to forming a more closed-loop, independent entity when it comes to new patents. Notice how the United States is doing the opposite – its percentage of the foreign collaboration of patents has been on the rise.

Takeaways and Future Work 

This app is designed to help anyone who is interested in knowing overall trend in technology around the world in different countries and tech domains, understand which domains is each country focusing on through the years, and correlate the trend to interesting factors such as GDP per capita, level of collaboration with world, and other technology indicators such as Gross Domestic Expenditures on R&D etc. The World, Country, and Domain sections are separated in a way such that users have the flexibility to zoom in and out to examine the high-level trend as well as a specific country of domain and do further slicing and dicing.

Going forward, I would like to enhance this app in the following ways:

  • Getting more recent and potentially real time data and sync it to the data pipeline for the app. The current version of the app only takes static data from 1999 – 2018. As OECD provides new update, it would be great to be able to sync the data to the app automatically through an API and display updated results.
  • Dig deeper into some countries and further segment their patents filed by industry vs. academic institutions, and possibly also show insights into patents filed by prestigious companies (E.g. Google, Huawei, etc.)

Thank you

Thank you for reading my blog post and I hope you enjoyed it. Here is the link to my Github repository if you want to see the code. Please feel free to contact me via Email: [email protected] if you have any questions or would like to discuss further details.

 

About Author

Qing Ying

Qing (Sophie) Ying graduated from UC Berkeley with a Master's degree in Industrial Engineering and Operations Research. She has been a product manager in a healthcare technology startup for 3 years, where she developed various data analytics products...
View all posts by Qing Ying >

Leave a Comment

No comments found.

View Posts by Categories


Our Recent Popular Posts


View Posts by Tags

#python #trainwithnycdsa 2019 2020 Revenue 3-points agriculture air quality airbnb airline alcohol Alex Baransky algorithm alumni Alumni Interview Alumni Reviews Alumni Spotlight alumni story Alumnus ames dataset ames housing dataset apartment rent API Application artist aws bank loans beautiful soup Best Bootcamp Best Data Science 2019 Best Data Science Bootcamp Best Data Science Bootcamp 2020 Best Ranked Big Data Book Launch Book-Signing bootcamp Bootcamp Alumni Bootcamp Prep boston safety Bundles cake recipe California Cancer Research capstone car price Career Career Day citibike classic cars classpass clustering Coding Course Demo Course Report covid 19 credit credit card crime frequency crops D3.js data data analysis Data Analyst data analytics data for tripadvisor reviews data science Data Science Academy Data Science Bootcamp Data science jobs Data Science Reviews Data Scientist Data Scientist Jobs data visualization database Deep Learning Demo Day Discount disney dplyr drug data e-commerce economy employee employee burnout employer networking environment feature engineering Finance Financial Data Science fitness studio Flask flight delay gbm Get Hired ggplot2 googleVis H20 Hadoop hallmark holiday movie happiness healthcare frauds higgs boson Hiring hiring partner events Hiring Partners hotels housing housing data housing predictions housing price hy-vee Income Industry Experts Injuries Instructor Blog Instructor Interview insurance italki Job Job Placement Jobs Jon Krohn JP Morgan Chase Kaggle Kickstarter las vegas airport lasso regression Lead Data Scienctist Lead Data Scientist leaflet league linear regression Logistic Regression machine learning Maps market matplotlib Medical Research Meet the team meetup methal health miami beach movie music Napoli NBA netflix Networking neural network Neural networks New Courses NHL nlp NYC NYC Data Science nyc data science academy NYC Open Data nyc property NYCDSA NYCDSA Alumni Online Online Bootcamp Online Training Open Data painter pandas Part-time performance phoenix pollutants Portfolio Development precision measurement prediction Prework Programming public safety PwC python Python Data Analysis python machine learning python scrapy python web scraping python webscraping Python Workshop R R Data Analysis R language R Programming R Shiny r studio R Visualization R Workshop R-bloggers random forest Ranking recommendation recommendation system regression Remote remote data science bootcamp Scrapy scrapy visualization seaborn seafood type Selenium sentiment analysis sentiment classification Shiny Shiny Dashboard Spark Special Special Summer Sports statistics streaming Student Interview Student Showcase SVM Switchup Tableau teachers team team performance TensorFlow Testimonial tf-idf Top Data Science Bootcamp Top manufacturing companies Transfers tweets twitter videos visualization wallstreet wallstreetbets web scraping Weekend Course What to expect whiskey whiskeyadvocate wildfire word cloud word2vec XGBoost yelp youtube trending ZORI