HR Employee Attrition Analysis

Posted on Jul 31, 2022
  • The data set: Uncover the factors that lead to employee attrition and explore important questions such as show me a breakdown of distance from home by job role and attrition or compare average monthly income by education and attrition. This is a fictional data set created by IBM data scientists.
  • Attrition of employees canโ€™t be avoided. Some employees leave the company as they reach their retirement age, while many leave due to many factors such as, but not limited to, lower satisfaction rate, lower pay rate, and toxic work environment. Measuring attrition can uncover many answers related to the functioning within the organization. Higher attrition rates signal a need for further investigation.


  • Objective:

ย  ย  ย  ย  Know the main factors that drive employees to leave the job and search for another job, is the main objective of this project. Which can help companies mitigate employee exit because it causes a big losses to the company.

The project passed through four phases:

  • Inspecting: (1470 row , 35 columns)
  • Cleansing(Missing values, Empty data, Incorrect Type, Incorrect values, Outliers and non relevant data)
  • Transforming(Reshaping, Transforming Structures, Indexing for quick access, Merging and Joining).
  • Modeling : (Visualization, Statical models, Correlation, Reporting).


Exploratory Data


-The attrition rate for our dataset sample is 18.6%.

Majority of employees aged 25 to 45

-Most of the employees, who have been a part of the company, tend to fall in the age range from 25 years to 45 years.

Majority of employees who have higher attrition aged 18 to 21

-The proportion of employees who left was comparatively more among the young employees.

Data Analysis

Employees Education and Education Field

- Employees having (Below College, Bachelor ) Education , seem to have a higher tendency to leave the company.

- In general employees having (Technical Degree, Human Resources) Education Field, seem to have a higher tendency to leave the company.

Departments attrition rate

- Sales department have the higher attrition among other departments.


- There is a higher proportion of attrite employees who stay far from the office (More than 10 KM) than the proportion of employees who did not leave the company and stay far away from the office.


-Low income employees are more to attrite from the company. While employee with high income are more likely to stay.

-With less total working years employee are more to attrite from the company.

-Low Job Satisfaction, Low Relationย  ship Satisfaction, Bad Work Life Balance, Low Environment Satisfaction effect the employee which have the higher attrition among them.

- Employees with job Level 1 ,Job Role(Sales Representative), low job Involvement have higher attrition.

ย  ย Conclusion:

** Factors which cause employees attrition **

  • Low monthly income.
  • Long distance between employees home and the company location.
  • Sales department, probably because of the pressure on the employees from the managers.
  • Low Job Satisfaction, Low Relation ship Satisfaction, Bad Work Life Balance, Low Environment Satisfaction in the company.
  • Sales Representative.
  • Low job Involvement.


  • Increase the monthly salary.
  • Find accommodation for employees that is close to work.
  • Reducing the pressure on the sales department and give them a bonus to encourage them.
  • Changing the work environment and making it appropriate for achievement and improve the relationship between employees.
  • Increase the job Involvement for the employees.

The skills the author demonstrated here can be learned through taking Data Science with Machine Learning bootcamp with NYC Data Science Academy.

About Author

Leave a Comment

No comments found.

View Posts by Categories

Our Recent Popular Posts

View Posts by Tags

#python #trainwithnycdsa 2019 2020 Revenue 3-points agriculture air quality airbnb airline alcohol Alex Baransky algorithm alumni Alumni Interview Alumni Reviews Alumni Spotlight alumni story Alumnus ames dataset ames housing dataset apartment rent API Application artist aws bank loans beautiful soup Best Bootcamp Best Data Science 2019 Best Data Science Bootcamp Best Data Science Bootcamp 2020 Best Ranked Big Data Book Launch Book-Signing bootcamp Bootcamp Alumni Bootcamp Prep boston safety Bundles cake recipe California Cancer Research capstone car price Career Career Day citibike classic cars classpass clustering Coding Course Demo Course Report covid 19 credit credit card crime frequency crops D3.js data data analysis Data Analyst data analytics data for tripadvisor reviews data science Data Science Academy Data Science Bootcamp Data science jobs Data Science Reviews Data Scientist Data Scientist Jobs data visualization database Deep Learning Demo Day Discount disney dplyr drug data e-commerce economy employee employee burnout employer networking environment feature engineering Finance Financial Data Science fitness studio Flask flight delay gbm Get Hired ggplot2 googleVis H20 Hadoop hallmark holiday movie happiness healthcare frauds higgs boson Hiring hiring partner events Hiring Partners hotels housing housing data housing predictions housing price hy-vee Income Industry Experts Injuries Instructor Blog Instructor Interview insurance italki Job Job Placement Jobs Jon Krohn JP Morgan Chase Kaggle Kickstarter las vegas airport lasso regression Lead Data Scienctist Lead Data Scientist leaflet league linear regression Logistic Regression machine learning Maps market matplotlib Medical Research Meet the team meetup methal health miami beach movie music Napoli NBA netflix Networking neural network Neural networks New Courses NHL nlp NYC NYC Data Science nyc data science academy NYC Open Data nyc property NYCDSA NYCDSA Alumni Online Online Bootcamp Online Training Open Data painter pandas Part-time performance phoenix pollutants Portfolio Development precision measurement prediction Prework Programming public safety PwC python Python Data Analysis python machine learning python scrapy python web scraping python webscraping Python Workshop R R Data Analysis R language R Programming R Shiny r studio R Visualization R Workshop R-bloggers random forest Ranking recommendation recommendation system regression Remote remote data science bootcamp Scrapy scrapy visualization seaborn seafood type Selenium sentiment analysis sentiment classification Shiny Shiny Dashboard Spark Special Special Summer Sports statistics streaming Student Interview Student Showcase SVM Switchup Tableau teachers team team performance TensorFlow Testimonial tf-idf Top Data Science Bootcamp Top manufacturing companies Transfers tweets twitter videos visualization wallstreet wallstreetbets web scraping Weekend Course What to expect whiskey whiskeyadvocate wildfire word cloud word2vec XGBoost yelp youtube trending ZORI