US Permanent Work Visa - Data Visualization

Posted on Feb 18, 2019


There are approximately 140,000 immigrant visas available each year for workers who seek to immigrate based on their job skills (information from US Citizenship and Immigration Service website).

With so many positions fulfilled by foreign worker, this project aims to be a data visualization tool to help companies and workers to find out what kind of professionals US employers are looking for outside of the country.


The dataset contains information from 2015 to 2018 which was downloaded from website. Each year's record has 125 columns and approximately 100,000 observations.

The first step was clean the dataset where the columns with most relevant information were selected, all wage offers were converted by annual scale and the state name of the job was standardized, because some names were in their short names and other in the long ones).

All scripts may be find here.

Shiny App - Charts

It is possible to find all tabs on the left side of the app. The first data-containing one (Charts) opens two sub-tabs (Visa and Job Information). Click here to open the Shiny App.

Figure 1: Visa sub-tab

The Visa sub-tab shows a bar-chart where it is possible to visualize the number of application per year. The user may filter the visa status (if the application was certified or denied, for example) and the employee's education level. This graph intends to give a first insight about the data where the user may analyze if there is some significant change of the filter variants over the time.

Figure 2: Job Information sub-tab - Top 10 Companies

In the Job Information sub-tab there are four tabs with different data visualization and three interactive tools where all four graphs are affected and the user is able to analyze a specific information.

At the Company tab, there is a bar chart with the top 10 companies that is sponsor of the foreign worker. The user is able to change the number of applicants to the wage offer.

Figure 3: Job Information sub-tab - Top 10 Job Title

At the Job Title tab, there is a bar chart with the top 10 job titles. Based on figures 2 and 3, it may be conclude that the most positions are related with technology.

Figure 4: Job Information sub-tab - Education level by number of applicants
Figure 5: Job Information sub-tab - Education level by average wage offer

At the Education Level and Visa Type tabs, it is possible to visualize the number of applicants for both variables and when the user changes to see by the average wage offer, a box plot is shown. In general, there are more applicants with bachelor's and master's degree and their wage offer are more disperse, as shown on figure 5.

Shiny App - Map

At Map tab, it is possible to visualize the global map in the first tab (From: Country of Citizenship) where represents the country of citizenship of the visa applicant and the US map in the second tab (To: USA State) that is the location of the job. The color of each region of both maps represents the intensity of the variable.

The user may change the number of applicants to the average wage offer, the year and the educational level to analyze how these variables affect the maps.

Figure 6: Map tab - variables
Figure 7: Map tab

Data Scientist

Let's use the app for a specific case where the user wants to search the job title that contains "data scientist" on it.

Figure 8: Companies by number of applicants - Data Scientist search
Figure 9: Companies by average wage offer - Data Scientist search
Figure 11: Education level by number of applicants - Data Scientist search
Figure 8: Education level by average offer - Data Scientist search

It is possible to verify that Microsoft and Facebook are sponsor for data scientists more frequently but Netflix has offered the highest wage salary among the foreign worker with this job title.

The top 10 job title for data scientist seems to be leaders position and the wage range is between 150k and 200k yearly.

Most of the data scientist has master's degree


This app is a tool to visualize some job information from positions fulfilled by foreign workers.

In general, it may say that the most applicants are from India, have bachelor's or master's degree, have job title and work for companies related with technology and the location of these jobs are concentrated in California.

About Author

Stella Oliveira

Data scientist with a background in financial services and demonstrated experience managing data and deploying predictive models. Highly motivated to combine the ability to thrive in a fast-paced work environment with the fascination for generating insights from complex...
View all posts by Stella Oliveira >

Leave a Comment

No comments found.

View Posts by Categories

Our Recent Popular Posts

View Posts by Tags

#python #trainwithnycdsa 2019 airbnb Alex Baransky alumni Alumni Interview Alumni Reviews Alumni Spotlight alumni story Alumnus API Application artist aws beautiful soup Best Bootcamp Best Data Science 2019 Best Data Science Bootcamp Best Data Science Bootcamp 2020 Best Ranked Big Data Book Launch Book-Signing bootcamp Bootcamp Alumni Bootcamp Prep Bundles California Cancer Research capstone Career Career Day citibike clustering Coding Course Demo Course Report D3.js data Data Analyst data science Data Science Academy Data Science Bootcamp Data science jobs Data Science Reviews Data Scientist Data Scientist Jobs data visualization Deep Learning Demo Day Discount dplyr employer networking feature engineering Finance Financial Data Science Flask gbm Get Hired ggplot2 googleVis Hadoop higgs boson Hiring hiring partner events Hiring Partners Industry Experts Instructor Blog Instructor Interview Job Job Placement Jobs Jon Krohn JP Morgan Chase Kaggle Kickstarter lasso regression Lead Data Scienctist Lead Data Scientist leaflet linear regression Logistic Regression machine learning Maps matplotlib Medical Research Meet the team meetup Networking neural network Neural networks New Courses nlp NYC NYC Data Science nyc data science academy NYC Open Data NYCDSA NYCDSA Alumni Online Online Bootcamp Online Training Open Data painter pandas Part-time Portfolio Development prediction Prework Programming PwC python Python Data Analysis python machine learning python scrapy python web scraping python webscraping Python Workshop R R language R Programming R Shiny r studio R Visualization R Workshop R-bloggers random forest Ranking recommendation recommendation system regression Remote remote data science bootcamp Scrapy scrapy visualization seaborn Selenium sentiment analysis Shiny Shiny Dashboard Spark Special Special Summer Sports statistics streaming Student Interview Student Showcase SVM Switchup Tableau team TensorFlow Testimonial tf-idf Top Data Science Bootcamp twitter visualization web scraping Weekend Course What to expect word cloud word2vec XGBoost yelp