Python Survey 2017 Visualization with R and Shiny
Project GitHub | LinkedIn: Niki Moritz Hao-Wei Matthew Oren
The skills we demoed here can be learned through taking Data Science with Machine Learning bootcamp with NYC Data Science Academy.
With the arrival of Big Data era, data has become more and more important to human beings. People who have better understand of data can not only gain advantage for their business, but also can have better understand of their industries. Because of this, data analysis tools such as python, R, SAS, etc. also become more and more popular. Since I have background in both computer science and applied mathematics and statistics. I decided to take a good look of survey that conducted by JetBean for python users in 2017. You can view my project via the link, and the code for the project is on github.
The data set I used is from JetBean.com, the website provides the report about Python community in both 2016 and 2017. They only provide the raw data for 2017. It gives me a chance to doing analyzing with the data on different ways from their report. From the dataset, I hope to provide my audiences a better understand of python in the world now. The dataset includes answers from 10,000 JetBean users for the Python Developers Survey 2017. The survey has 30 different questions to ask users from if they use python as main languages to the type of industries the users are in, etc.
After viewing and cleaning the data set, I decided to use 6 main components for my project. They are what is the usage of python for them? what countries do these python users from? What are the age ranges of these python users? What other languages do they use beside Python? What are the purposes do they use Python for? What kinds of industries do you work in?
As you can see in the graph down below, among all the users that finished the survey, there are 85.3% of python users. 67.5% of them use Python as main language; and 17.8% of them use Python as secondary language.
In the Country section, I listed of number of Python users in the top 12 countries that have most Python users and plug them into the global map. The darker color the countries get filled, the more Python users this country has.
In the Age section, I listed out the range of Python users from under 17 to 60 or older. Although Python users have different age, as you can see most of Python users are in their 20s to 30s. And age range of 21-29 has the most Python users.
That leads us to the question of what do people use Python for? Since this question can contain multiple answer on the survey. The total percentage is greater 100%. The answer is most of them use Python for either Data Analysis or Web Development (50% vs 49%), following by DevOps / System administration / Writing automation scripts, Programming of web parsers / scrapers / crawlers, etc.
As I mentioned in the beginning of this, the world has entered the era of Big Data. You can easily figure out the answer for what kinds of industries do Python users work for? Information Technology / Software Development contains about 25% of Python users among 10,000 of them.