Analyzing Gender Data in the Activewear Market

Posted on Mar 30, 2021
The skills I demoed here can be learned through taking Data Science with Machine Learning bootcamp with NYC Data Science Academy.

Github Repository


Data shows most clothes and shoes aren't made unisex. In the activewear market, most companies produce a line of men's products and a separate line of women's products. These separate lines of clothes tend to be very similar looking. A potential activewear company, or a current, struggling activewear company may have questions like:

  • Should gender be a factor when it comes to product pricing?
  • Should gender be a factor when it comes to variety of products?

For guidance, a smart thing to do would be to look at highly successful activewear brands and analyze their strategies. Two of the top activewear brands, or arguably THE top two activewear brand, are Nike and Adidas.

Web Scraping Data on the Products is a majorly successful American online shoe and clothing retailer. It is a major retailer for Adidas and Nike, and sells thousands of their products. Therefore, I scraped to gather data on products by Nike and Adidas.

I decided to analyze three activewear categories: sneakers, shorts, and short sleeve shirts. I specifically chose these three categories because I wanted gender to be the only independent variable in my study. In these three categories, the types of styles available seem to be similar for both genders, lending to the opportunity for a fair analysis of the influence of the specific factor of gender on activewear prices and variety.

When I scraped the price of each product, I made sure to scrape only the list price of each item and never the sale price. This way the analysis of prices would be fair.

I made sure not to include any fashion sneakers in the analysis, as Adidas does have a sub-division of sneakers that are more fashion focused than sports or exercise focused, whereas Nike does not. I didn't think my analysis comparing Nike and Adidas would be fair if I included products in this sub-division, as only one brand makes this type of shoe. Anyway, this sub-division was only a small percentage of Adidas sneakers being sold.

Data Observation and Analysis of Pricing for Each Brand

I decided to observe the price range and mean price for each gender in these three categories for the two brands. When looking at a specific brand, important questions are:

  • Does gender have an effect on mean price for all of the categories, or for any of the categories?
  • On average, is one gender's products more expensive than the other's for all of the categories?
  • Does gender have an effect on price range for all or any of the categories?
  • Is the price range for one gender larger than the others for all of the categories?

Below are box plots to visualize the data:

Analyzing Gender Data in the Activewear Market

Ratio of Pricing

Below are bar plots to visualize the ratio of mean price, Men to Women: (The black line indicates where a 1:1 ratio would fall, indicative of no difference in average price between genders)

Analyzing Gender Data in the Activewear Market

Taking a look at Nike, for sneakers, there is not a significant difference in price range between the gender subcategories. For short sleeve shirts and shorts, when excluding the outliers, the men subcategory has a significantly larger price range.

There is not a significant difference in average product price between the men and women subcategories for the sneakers category. However, the average product price for the shorts category is significantly higher for the men subcategory, and for the short sleeve shirts category, slightly significantly higher for the women subcategory.

Taking a look at Adidas, for sneakers, there is not a significant difference in price range between gender subcategories. For short sleeve shirts and shorts, the men subcategory has a significantly larger price range.

The average product price for short sleeve shirts is significantly higher for the women subcategory. For sneakers and shorts, the average product price is significantly higher for the men subcategory.

Data Observation and Analysis of Variety of Options

It appears that both brands consistently provide a greater variety of options for men.

Which Analyses are Consistent Between Brands

After observing and analyzing both brands, important questions are:

  • Are the analyses the same for both brands?
  • What seems to be consistent between brands and what does not?

The analyses are not exactly the same for both brands. But there are multiple points that are consistent between both brands:

  1. Both have a greater variety of options for men in every category.
  2. There isn't a significant different in price range for the sneakers category between the gender subcategories.
  3. There is a significant difference in price range for shorts and short sleeve shirts between the gender subcategories, that being that the men subcategory has a larger price range.
  4. For the shorts category, the mean price for the men subcategory is significantly higher than for the women subcategory.
  5. For the short sleeve shirts category, the mean price for the women subcategory is significantly higher than for the men subcategory (although for Nike the difference is quite small). 

Conclusion: What Another Company Can Take from This

It would only make sense for a potential or struggling activewear company to consider significant for itself points which are consistent between both brands. Points which are not consistent between brands shouldn't mean much to a potential or struggling company, because both Nike and Adidas are successful, and these points are not true for both of them.

A major point of analysis which is consistent both within a brand and between brands is that men always have more options than women do. A company can choose to look at this point in one of two ways: A company can use this point as a model to follow for their own brand, or it can go in the other direction and make itself attractive by advertising its uniqueness.

Learnings From Nike and Adidas

If a company chooses to follow the approach of Nike and Adidas, they can be confident that it is a safe approach, as Nike and Adidas are majorly successful. However, sometimes being different is what makes a company successful and attractive. So, if a company says we are different than the major brands because we produce an equal variety of options for both men and women, people, especially women, can be intrigued. People may also be more inclined to support a brand that advertises themselves as treating genders equally.

The other points that are consistent between brands can also provide insight for a potential or struggling company:

  • Sneakers should have a similar price range for both genders.
  • Men should be provided with a larger range of pricing options when it comes to shorts and short sleeve shirts.
  • On average men's shorts should be more expensive.
  • On average women's short sleeve shirts should be more expensive.

There are two ways to approach the last three points. These two ways were mentioned earlier. A company can choose to either follow these points of analysis when producing and pricing their products, or to go in the direction of providing and pricing equally for both genders and advertise itself as a brand that does this. Either approach is not more correct than the other. It's up to a company to decide which approach they believe would attract more customers and increase revenue.

Potential Future Work

I think it would be interesting to analyze categories of clothes other than activewear in which the men's products being sold are similar to the women's products being sold. It would be interesting to see if trends were consistent among brands within the categories.

About Author

Leave a Comment

No comments found.

View Posts by Categories

Our Recent Popular Posts

View Posts by Tags

#python #trainwithnycdsa 2019 2020 Revenue 3-points agriculture air quality airbnb airline alcohol Alex Baransky algorithm alumni Alumni Interview Alumni Reviews Alumni Spotlight alumni story Alumnus ames dataset ames housing dataset apartment rent API Application artist aws bank loans beautiful soup Best Bootcamp Best Data Science 2019 Best Data Science Bootcamp Best Data Science Bootcamp 2020 Best Ranked Big Data Book Launch Book-Signing bootcamp Bootcamp Alumni Bootcamp Prep boston safety Bundles cake recipe California Cancer Research capstone car price Career Career Day citibike classic cars classpass clustering Coding Course Demo Course Report covid 19 credit credit card crime frequency crops D3.js data data analysis Data Analyst data analytics data for tripadvisor reviews data science Data Science Academy Data Science Bootcamp Data science jobs Data Science Reviews Data Scientist Data Scientist Jobs data visualization database Deep Learning Demo Day Discount disney dplyr drug data e-commerce economy employee employee burnout employer networking environment feature engineering Finance Financial Data Science fitness studio Flask flight delay gbm Get Hired ggplot2 googleVis H20 Hadoop hallmark holiday movie happiness healthcare frauds higgs boson Hiring hiring partner events Hiring Partners hotels housing housing data housing predictions housing price hy-vee Income Industry Experts Injuries Instructor Blog Instructor Interview insurance italki Job Job Placement Jobs Jon Krohn JP Morgan Chase Kaggle Kickstarter las vegas airport lasso regression Lead Data Scienctist Lead Data Scientist leaflet league linear regression Logistic Regression machine learning Maps market matplotlib Medical Research Meet the team meetup methal health miami beach movie music Napoli NBA netflix Networking neural network Neural networks New Courses NHL nlp NYC NYC Data Science nyc data science academy NYC Open Data nyc property NYCDSA NYCDSA Alumni Online Online Bootcamp Online Training Open Data painter pandas Part-time performance phoenix pollutants Portfolio Development precision measurement prediction Prework Programming public safety PwC python Python Data Analysis python machine learning python scrapy python web scraping python webscraping Python Workshop R R Data Analysis R language R Programming R Shiny r studio R Visualization R Workshop R-bloggers random forest Ranking recommendation recommendation system regression Remote remote data science bootcamp Scrapy scrapy visualization seaborn seafood type Selenium sentiment analysis sentiment classification Shiny Shiny Dashboard Spark Special Special Summer Sports statistics streaming Student Interview Student Showcase SVM Switchup Tableau teachers team team performance TensorFlow Testimonial tf-idf Top Data Science Bootcamp Top manufacturing companies Transfers tweets twitter videos visualization wallstreet wallstreetbets web scraping Weekend Course What to expect whiskey whiskeyadvocate wildfire word cloud word2vec XGBoost yelp youtube trending ZORI