Explore Global Patent Trends in Technology

Qing Ying
Posted on Oct 13, 2020

Introduction 

An invention brings something new into being and has a practical bent. When these inventions are registered as intellectual property – primarily patents, the resulting records provide valuable technological and geographic details. By gathering data on technology patents filed around the world through the United States Patent and Trademark Office (USPTO) in the last two decades, slicing and dicing the data by country and technology domains, and see changes along with the timelines, one can gain insights into the dynamics of a variety of tech domains at different countries. For this reason, I created this Shiny app to help people easily visualize the dynamics of technology development, which will be particularly useful for business leaders, researchers, and policymakers. The patent data used for the app is published by the Organization of Economic Cooperation and Development (OECD). 

The R Shiny App

The interactive app was built using the Shiny dashboard in R. The app contains three main sections:

  1. World - This allows users to select a year, visualize the geographic distribution of patents filed across the world, view a Leaderboard of countries and technology domain ranked by number of patents filed, as well as additional interesting factors and correlations related to the patent quantity
  2. Country – This allows users to select a particular country of interest and see patent trends and ranking in various technology domains, as well as additional insights on other technology indicators such as GDP, foreign collaboration, and Gross Domestic Expenditure on R&D
  3. Tech Domain – This allows users to select a particular technology domain of interest, and see trends and ranking of various countries in the selected domain.

World 

Under the World tab, the users are first presented with a world map where they can hover over the countries and see how many patents are filed in each country in the year selected. Above the map, users can conveniently see the total number of patents, top country and top tech domain in terms of patent quantity in the year selected.

The second section under the World tab is a Leaderboard, which ranks the top 10 countries and all tech domains based on patent quantity in the year selected.

The third section under the World tab – “Correlations” – displays the correlation of patent volume with several important economic and demographic factors. The first plot shows that the number of patents of a country is positively correlated to its Gross Domestic Expenditure on R&D, with most of the data points congregated around below 250,000 million USD PPPs. In the second plot, we normalize the Gross Domestic Expenditure on R&D for each country and represented the data by Gross Domestic Expenditure on R&D as a % of GDP, and the positive correlation with patents volume is even clearer, and the data points are more spread out. One interesting to note is that from the graph we can easily spot countries that yields a higher or lower number of patents (i.e. Japan, highlighted) compared to the expected value (the corresponding y-value on the regression line), given the country’s devotion of resources on R&D.

Country

Under the Country tab, the first thing users can see is a graph showing the trend of each tech domain over the last 20 years in the selected country, the United States in this example. The banners on the top indicate the most prominent tech domain in the United States is Computer Technology, and the U.S is producing 45% of the total world patents overall. This gives people a good idea of how different tech domains have been trending in a given country.

The second section under the Country tab is a Leaderboard, which ranks all the technology domains based on patent quantity in the year and country selected.

The third section under the Country tab – “Additional Insights”, displays the trend of several other technology indicators of a given country, together with the total patent volume throughout the years. For example, in the graph above, the top right plot (GDP Per Capita trend) showed that around the year 2008 the GDP Per Capita took a big hit, which makes total sense because that corresponds to the Great Recession. We can observe a similar dip in the patent volume around 2008 as well, meaning that the Great Recession, definitely had some negative impact on U.S.’s technology advancement as well.

Let’s look at another example – China, we can immediately spot several unique patterns. Firstly, the GDP per capita and a total patent trend increase monotonously in the past two decades without any dips, which means China is on a high growth path in terms of tech innovation. Another interesting thing to notice is that the percentage of patents with foreign collaboration has been continuously decreasing. This might indicate that China is shifting its technology strategy from relying on more foreign partners to forming a more closed-loop, independent entity when it comes to new patents. Notice how the United States is doing the opposite – its percentage of the foreign collaboration of patents has been on the rise.

Takeaways and Future Work 

This app is designed to help anyone who is interested in knowing overall trend in technology around the world in different countries and tech domains, understand which domains is each country focusing on through the years, and correlate the trend to interesting factors such as GDP per capita, level of collaboration with world, and other technology indicators such as Gross Domestic Expenditures on R&D etc. The World, Country, and Domain sections are separated in a way such that users have the flexibility to zoom in and out to examine the high-level trend as well as a specific country of domain and do further slicing and dicing.

Going forward, I would like to enhance this app in the following ways:

  • Getting more recent and potentially real time data and sync it to the data pipeline for the app. The current version of the app only takes static data from 1999 – 2018. As OECD provides new update, it would be great to be able to sync the data to the app automatically through an API and display updated results.
  • Dig deeper into some countries and further segment their patents filed by industry vs. academic institutions, and possibly also show insights into patents filed by prestigious companies (E.g. Google, Huawei, etc.)

Thank you

Thank you for reading my blog post and I hope you enjoyed it. Here is the link to my Github repository if you want to see the code. Please feel free to contact me via Email: [email protected] if you have any questions or would like to discuss further details.

 

About Author

Qing Ying

Qing Ying

Qing (Sophie) Ying graduated from UC Berkeley with a Master's degree in Industrial Engineering and Operations Research. She has been a product manager in a healthcare technology startup for 3 years, where she developed various data analytics products...
View all posts by Qing Ying >

Leave a Comment

No comments found.

View Posts by Categories


Our Recent Popular Posts


View Posts by Tags

#python #trainwithnycdsa 2019 airbnb Alex Baransky alumni Alumni Interview Alumni Reviews Alumni Spotlight alumni story Alumnus API Application artist aws beautiful soup Best Bootcamp Best Data Science 2019 Best Data Science Bootcamp Best Data Science Bootcamp 2020 Best Ranked Big Data Book Launch Book-Signing bootcamp Bootcamp Alumni Bootcamp Prep Bundles California Cancer Research capstone Career Career Day citibike clustering Coding Course Demo Course Report D3.js data Data Analyst data science Data Science Academy Data Science Bootcamp Data science jobs Data Science Reviews Data Scientist Data Scientist Jobs data visualization Deep Learning Demo Day Discount dplyr employer networking feature engineering Finance Financial Data Science Flask gbm Get Hired ggplot2 googleVis Hadoop higgs boson Hiring hiring partner events Hiring Partners Industry Experts Instructor Blog Instructor Interview Job Job Placement Jobs Jon Krohn JP Morgan Chase Kaggle Kickstarter lasso regression Lead Data Scienctist Lead Data Scientist leaflet linear regression Logistic Regression machine learning Maps matplotlib Medical Research Meet the team meetup Networking neural network Neural networks New Courses nlp NYC NYC Data Science nyc data science academy NYC Open Data NYCDSA NYCDSA Alumni Online Online Bootcamp Open Data painter pandas Part-time Portfolio Development prediction Prework Programming PwC python python machine learning python scrapy python web scraping python webscraping Python Workshop R R language R Programming R Shiny r studio R Visualization R Workshop R-bloggers random forest Ranking recommendation recommendation system regression Remote remote data science bootcamp Scrapy scrapy visualization seaborn Selenium sentiment analysis Shiny Shiny Dashboard Spark Special Special Summer Sports statistics streaming Student Interview Student Showcase SVM Switchup Tableau team TensorFlow Testimonial tf-idf Top Data Science Bootcamp twitter visualization web scraping Weekend Course What to expect word cloud word2vec XGBoost yelp