Data Analysis on Mental Health in the Workplace

Posted on Aug 14, 2020
The skills I demoed here can be learned through taking Data Science with Machine Learning bootcamp with NYC Data Science Academy.

A dashboard was made using RShiny to visualize mental health survey data. The dashboard can be viewed here, and the code for this project is hosted on Github.


According the US Center for Disease Control (CDC), mental health issues cause some of the greatest burdens on the country as a whole1. In particular, working adults with mental health issues can face negative consequences like poor job performance and low engagement with their work. Therefore, it's important to spread awareness about mental health in order to improve communication and encourage employers to create stronger support networks at the workplace.

The Data

A voluntary survey was administered by Open Sourcing Mental Illness Ltd., a non-profit with the mission of raising awareness around mental wellness in the technology sector. As part of their mission, OSMI has released the data under a free creative commons license. The survey was conducted online and has been running annually since 2014. (The 2014 survey is popular on the data science community kaggle.) The questions change from year to year - ranging from 27 the first year to 83 in the most recent year - and fall into general categories, such as:

  • what type of company do you work at?
  • do you have a mental illness?
  • are you comfortable discussing your mental illness with your coworkesr?
  • does your employer-sponsored insurance include mental health coverage?

Exploratory Data Analysis

A dashboard is great way to explore a large data and look for trends. Questions like, "How does your mental illness impact your work?" are important to understand in different contexts, like between different countries or companies of different sizes. In a dashboard, it is easy to compare these types of questions to better understand the data.

How do different countries compare?

The first page of the dashboard allows for exploring survey questions by country. An initial review of the data shows that the United States is #1 for the percentage of respondents who say they have a mental illness.

Data Analysis on Mental Health in the Workplace

This may be alarming at first, but many organizations - like OSMI - have been working hard to increase mental health awareness. Hypothetically speaking, all ten countries could have the same rates of mental illness, but the United States may be more aware of their illness and more likely to answer, "Yes," on a survey.

Indeed, when we look at how often mental illness impacts work, the United States is near the middle of the ranking.

Data Analysis on Mental Health in the Workplace

In fact, the country where workers feel their work is most impacted by mental illness is India. This is surprising, given that India had the lowest percentage of respondents who admitted to having a mental illness. From this limited sample size, it looks like India could see the greatest benefit of future mental health awareness promotion efforts.

How do different company sizes compare?

The biggest difference between companies is also the least surprising - that large companies are far more likely offer mental health benefits as part of their employer-sponsored healthcare plans than smaller companies. This is expected, because larger benefits packages are usually included in compensation at larger companies.

Data Analysis on Mental Health in the Workplace

What could be potentially more surprising is how often mental health issues actually affect people at different company sizes. When we dig into it, we see that small companies (<25 people) have a sizeable increase of people who say their mental illness often impacts their work.

There are two conclusions that you could draw from this for promoting mental health awareness: (1) if you have a mental illness and you are on the job hunt, you may want to focus on larger companies, because you are more likely to have healthcare coverage, and (2) promotion of mental health awareness should be more focused on smaller companies, because small companies have the most room for improvement.


The first component of any data project is to make sure you really understand your data, and one of the best ways to understand your data is to visualize it. When you visualize data, trends will pop out that you didn't expect that lead to valuable insights.

In this case, the data has shown us two new directions that OSMI could focus their marketing efforts to promote mental health awareness. The first is to focus more attention on a country like India where survey respondents claim to be more impacted by mental health issues at work. The second is to focus more attention on small businesses or early stage startups (<25 people) where respondents also claim to have more difficulty with mental health at work.

Further Reading

If you would like to learn more about Open Sourcing Mental Illness Ltd., or if you want to participate in the 2020 Mental Health in Tech Survey, you can read more on their website:



About Author

Stephen Kita

Stephen is a biomedical engineer who likes to work with data and develop innovative healthcare products. He is an excellent problem-solver with a diverse background in entrepreneurship.
View all posts by Stephen Kita >

Leave a Comment

No comments found.

View Posts by Categories

Our Recent Popular Posts

View Posts by Tags

#python #trainwithnycdsa 2019 2020 Revenue 3-points agriculture air quality airbnb airline alcohol Alex Baransky algorithm alumni Alumni Interview Alumni Reviews Alumni Spotlight alumni story Alumnus ames dataset ames housing dataset apartment rent API Application artist aws bank loans beautiful soup Best Bootcamp Best Data Science 2019 Best Data Science Bootcamp Best Data Science Bootcamp 2020 Best Ranked Big Data Book Launch Book-Signing bootcamp Bootcamp Alumni Bootcamp Prep boston safety Bundles cake recipe California Cancer Research capstone car price Career Career Day citibike classic cars classpass clustering Coding Course Demo Course Report covid 19 credit credit card crime frequency crops D3.js data data analysis Data Analyst data analytics data for tripadvisor reviews data science Data Science Academy Data Science Bootcamp Data science jobs Data Science Reviews Data Scientist Data Scientist Jobs data visualization database Deep Learning Demo Day Discount disney dplyr drug data e-commerce economy employee employee burnout employer networking environment feature engineering Finance Financial Data Science fitness studio Flask flight delay gbm Get Hired ggplot2 googleVis H20 Hadoop hallmark holiday movie happiness healthcare frauds higgs boson Hiring hiring partner events Hiring Partners hotels housing housing data housing predictions housing price hy-vee Income Industry Experts Injuries Instructor Blog Instructor Interview insurance italki Job Job Placement Jobs Jon Krohn JP Morgan Chase Kaggle Kickstarter las vegas airport lasso regression Lead Data Scienctist Lead Data Scientist leaflet league linear regression Logistic Regression machine learning Maps market matplotlib Medical Research Meet the team meetup methal health miami beach movie music Napoli NBA netflix Networking neural network Neural networks New Courses NHL nlp NYC NYC Data Science nyc data science academy NYC Open Data nyc property NYCDSA NYCDSA Alumni Online Online Bootcamp Online Training Open Data painter pandas Part-time performance phoenix pollutants Portfolio Development precision measurement prediction Prework Programming public safety PwC python Python Data Analysis python machine learning python scrapy python web scraping python webscraping Python Workshop R R Data Analysis R language R Programming R Shiny r studio R Visualization R Workshop R-bloggers random forest Ranking recommendation recommendation system regression Remote remote data science bootcamp Scrapy scrapy visualization seaborn seafood type Selenium sentiment analysis sentiment classification Shiny Shiny Dashboard Spark Special Special Summer Sports statistics streaming Student Interview Student Showcase SVM Switchup Tableau teachers team team performance TensorFlow Testimonial tf-idf Top Data Science Bootcamp Top manufacturing companies Transfers tweets twitter videos visualization wallstreet wallstreetbets web scraping Weekend Course What to expect whiskey whiskeyadvocate wildfire word cloud word2vec XGBoost yelp youtube trending ZORI