Explore Global Patent Data Trends in Technology
The skills I demoed here can be learned through taking Data Science with Machine Learning bootcamp with NYC Data Science Academy.
IntroductionΒ
An invention brings something new into being and has a practical bent. When these inventions are registered as intellectual property β primarily patents, the resulting records provide valuable technological and geographic details. By gathering data on technology patents filed around the world through the United States Patent and Trademark Office (USPTO) in the last two decades, slicing and dicing the data by country and technology domains, and see changes along with the timelines, one can gain insights into the dynamics of a variety of tech domains at different countries.
For this reason, I created this Shiny app to help people easily visualize the dynamics of technology development, which will be particularly useful for business leaders, researchers, and policymakers. The patent data used for the app is published by the Organization of Economic Cooperation and Development (OECD).Β
The R Shiny App
The interactive app was built using the Shiny dashboard in R. The app contains three main sections:
- World - This allows users to select a year, visualize the geographic distribution of patents filed across the world, view a Leaderboard of countries and technology domain ranked by number of patents filed, as well as additional interesting factors and correlations related to the patent quantity
- Country β This allows users to select a particular country of interest and see patent trends and ranking in various technology domains, as well as additional insights on other technology indicators such as GDP, foreign collaboration, and Gross Domestic Expenditure on R&D
- Tech Domain β This allows users to select a particular technology domain of interest, and see trends and ranking of various countries in the selected domain.
Data on WorldΒ
Under the World tab, the users are first presented with a world map where they can hover over the countries and see how many patents are filed in each country in the year selected. Above the map, users can conveniently see the total number of patents, top country and top tech domain in terms of patent quantity in the year selected.
Leaderboard
The second section under the World tab is a Leaderboard, which ranks the top 10 countries and all tech domains based on patent quantity in the year selected.
Correlations
The third section under the World tab β βCorrelationsβ β displays the correlation of patent volume with several important economic and demographic factors. The first plot shows that the number of patents of a country is positively correlated to its Gross Domestic Expenditure on R&D, with most of the data points congregated around below 250,000 million USD PPPs.
In the second plot, we normalize the Gross Domestic Expenditure on R&D for each country and represented the data by Gross Domestic Expenditure on R&D as a % of GDP, and the positive correlation with patents volume is even clearer, and the data points are more spread out. One interesting to note is that from the graph we can easily spot countries that yields a higher or lower number of patents (i.e. Japan, highlighted) compared to the expected value (the corresponding y-value on the regression line), given the countryβs devotion of resources on R&D.
Data on Country
Under the Country tab, the first thing users can see is a graph showing the trend of each tech domain over the last 20 years in the selected country, the United States in this example. The banners on the top indicate the most prominent tech domain in the United States is Computer Technology, and the U.S is producing 45% of the total world patents overall. This gives people a good idea of how different tech domains have been trending in a given country.
The second section under the Country tab is a Leaderboard, which ranks all the technology domains based on patent quantity in the year and country selected.
Additional Insights
The third section under the Country tab β βAdditional Insightsβ, displays the trend of several other technology indicators of a given country, together with the total patent volume throughout the years. For example, in the graph above, the top right plot (GDP Per Capita trend) showed that around the year 2008 the GDP Per Capita took a big hit, which makes total sense because that corresponds to the Great Recession. We can observe a similar dip in the patent volume around 2008 as well, meaning that the Great Recession, definitely had some negative impact on U.S.βs technology advancement as well.
Letβs look at another example β China, we can immediately spot several unique patterns. Firstly, the GDP per capita and a total patent trend increase monotonously in the past two decades without any dips, which means China is on a high growth path in terms of tech innovation.
Another interesting thing to notice is that the percentage of patents with foreign collaboration has been continuously decreasing. This might indicate that China is shifting its technology strategy from relying on more foreign partners to forming a more closed-loop, independent entity when it comes to new patents. Notice how the United States is doing the opposite β its percentage of the foreign collaboration of patents has been on the rise.
Takeaways and Future WorkΒ
This app is designed to help anyone who is interested in knowing overall trend in technology around the world in different countries and tech domains, understand which domains is each country focusing on through the years, and correlate the trend to interesting factors such as GDP per capita, level of collaboration with world, and other technology indicators such as Gross Domestic Expenditures on R&D etc. The World, Country, and Domain sections are separated in a way such that users have the flexibility to zoom in and out to examine the high-level trend as well as a specific country of domain and do further slicing and dicing.
Going forward, I would like to enhance this app in the following ways:
- Getting more recent and potentially real time data and sync it to the data pipeline for the app. The current version of the app only takes static data from 1999 β 2018. As OECD provides new update, it would be great to be able to sync the data to the app automatically through an API and display updated results.
- Dig deeper into some countries and further segment their patents filed by industry vs. academic institutions, and possibly also show insights into patents filed by prestigious companies (E.g. Google, Huawei, etc.)
Thank you
Thank you for reading my blog post and I hope you enjoyed it. Here is the link to my Github repository if you want to see the code. Please feel free to contact me via Email: [email protected] if you have any questions or would like to discuss further details.