Market Data Analysis of a C2C Fashion Store

Posted on Feb 8, 2021


Consumer-to-consumer (C2C) stores offer an interesting model for market interaction where an electronically facilitated interaction occurs between consumers through the help of a third party. These stores are becoming increasingly popular throughout various industries. Websites such as Craiglist and eBay are good examples of functioning using this business model. In a society that is becoming increasingly dependent on e-commerce, especially with the advent of social media and influencers and the more recent effect of the global pandemic of COVID19, I found it interesting to explore the workings of a C2C e-commerce store. The store I ended up picking based on the available data was a French e-commerce Fashion Store.

The Dataset: Kaggle dataset of 98K users

The dataset contains web-scraped information about a fraction of the registered users (~98,000 users) of a French C2C website (established in 2009 in Europe) which has over 9 million registered users. It contains user information by gender, country, language, apps, the number of products on their wishlist, the number of products they list to sell and buy as well as their social media presence. Additionally, there is data about the top sellers among the users subscribed to the app.Β 

Data Analysis

For my data exploration project, I chose to use the web scraped data to analyze the seller and user information based on a variety of characteristics. I also analyzed the shopping habits as well as the top sellers information to present a holistic view of the analytics of a retail store, help monitor the health of the store and identify the key markets, users, and sellers that could be targeted to improve engagement.

Through the graphs on the side, it is clear that the countries with the biggest market for the target users are France, USA, Great Britain, Italy and Germany. However, the majority of the users use English as their preferred language, followed closely by French, Italian, German and Spanish. The users are predominantly female, men make up only 1/4th of the users.

It is also surprising to note that the majority of the users of the e-commerce fashion store do not use the app to buy or sell products as can be seen from the above graph. Surprisingly, Sweden, Denmark, and Belgium have the highest number of app users, despite the fact that they don't have the highest turnaround. This indicates that there is potential for the store to expand the market with people who use the store but are maybe not incentivized enough to use the app.

Additionally, we can recognize that there is some relationship
between countries with the most users and if they used an app or not. However, it is harder to find a relationship between iOS and
Android using countries. It seems that iOS in general is more
popular than Android among users, but that does vary by region. A lot more Spanish users used Android apps in comparison to iOS apps. However, American users favor the iOS version.

It is not surprising at all to note that the majority of the products listed are by females as can be seen from the plot below. Great Britain and Sweden are the strongest markets for the products listed by women, followed closely by Denmark. However, when we look at the male ratio of both products listed and products sold, it seems that Australia and Italy are strong markets for men too.

The two plots above show some interesting insights. First of them is the fact that the products wished by men in France are much higher than the products wished by women. This is important to note because France is the biggest market for the fashion store as it is the country the company is most directly targeting. Additionally, other countries with near equal ratios of products wished by men and women are Great Britain and USA, with men being higher than women in Great Britain and only falling slightly short of women in the USA.

Furthermore, the statistics follow the same trend closely in the number of products bought. France has a higher conversion rate for men than women. The US falls slightly short for men like it appeared in the trend for the number of products wished.

Based on the user market, one may have expected to see a large number of French, American and UK sellers. In fact, though the top seller market looks different from the users market. The top sellers belong to one of the following 10 countries: Latvia, Romania, Sweden, Bulgaria, Germany, USA, Spain, UK, France, Italy. However, it is interesting to note that the maximum sellers come from Italy. Additionally, countries like Latvia, Romania and Bulgaria are also quite unexpected. It is possible though that these countries provide cheaper alternatives to clothing in comparison to some of the other more prominent EU companies.


A key limitation of this work is that the dataset is only a subset of the users of the fashion store. So most of the identified trends might not hold up when assessing the entirety of the 9 million subscribers. Also, no real analysis can be performed on the basis of the number of iOS and Android users as the trends of the people who own these phones do not follow the trends we observe in the dataset.Β 

Conclusion and Further Work

A preliminary analysis of the subscribed users of an e-commerce C2C French Fashion store was performed using the Kaggle dataset. Several interesting insights were obtained, including the breakdown of users through their country, gender, and preferred language. Additionally, it was observed that there is a big opportunity for subscribed users to be converted into using the website app. Moreover, some key markets for users who list and sell maximum products were identified. In addition, the top sellers across the platform were also identified.Β 

However, there is a lot of future work that can be done using the data. Several elements that were not explored include the identification of the social media activity, followers and following accounts. This could indicate which users could be incentivized to promote the use of the app. Additionally, the preferred languages could be grouped by the countries the users belong to in an effort to personalize their experience of the website as well as the app.Β  Β 



Shiny App

The skills I demoed here can be learned through taking Data Science with Machine Learning bootcamp with NYC Data Science Academy.

About Author

Yukti Kathuria

Yukti holds a B.S. and M.S. in Aerospace Engineering from the University of Illinois at Urbana-Champaign, and is extremely passionate about problem-solving. She finds data visualization to be one of the most interesting and insightful tools to understand...
View all posts by Yukti Kathuria >

Related Articles

Leave a Comment

No comments found.

View Posts by Categories

Our Recent Popular Posts

View Posts by Tags

#python #trainwithnycdsa 2019 2020 Revenue 3-points agriculture air quality airbnb airline alcohol Alex Baransky algorithm alumni Alumni Interview Alumni Reviews Alumni Spotlight alumni story Alumnus ames dataset ames housing dataset apartment rent API Application artist aws bank loans beautiful soup Best Bootcamp Best Data Science 2019 Best Data Science Bootcamp Best Data Science Bootcamp 2020 Best Ranked Big Data Book Launch Book-Signing bootcamp Bootcamp Alumni Bootcamp Prep boston safety Bundles cake recipe California Cancer Research capstone car price Career Career Day citibike classic cars classpass clustering Coding Course Demo Course Report covid 19 credit credit card crime frequency crops D3.js data data analysis Data Analyst data analytics data for tripadvisor reviews data science Data Science Academy Data Science Bootcamp Data science jobs Data Science Reviews Data Scientist Data Scientist Jobs data visualization database Deep Learning Demo Day Discount disney dplyr drug data e-commerce economy employee employee burnout employer networking environment feature engineering Finance Financial Data Science fitness studio Flask flight delay gbm Get Hired ggplot2 googleVis H20 Hadoop hallmark holiday movie happiness healthcare frauds higgs boson Hiring hiring partner events Hiring Partners hotels housing housing data housing predictions housing price hy-vee Income Industry Experts Injuries Instructor Blog Instructor Interview insurance italki Job Job Placement Jobs Jon Krohn JP Morgan Chase Kaggle Kickstarter las vegas airport lasso regression Lead Data Scienctist Lead Data Scientist leaflet league linear regression Logistic Regression machine learning Maps market matplotlib Medical Research Meet the team meetup methal health miami beach movie music Napoli NBA netflix Networking neural network Neural networks New Courses NHL nlp NYC NYC Data Science nyc data science academy NYC Open Data nyc property NYCDSA NYCDSA Alumni Online Online Bootcamp Online Training Open Data painter pandas Part-time performance phoenix pollutants Portfolio Development precision measurement prediction Prework Programming public safety PwC python Python Data Analysis python machine learning python scrapy python web scraping python webscraping Python Workshop R R Data Analysis R language R Programming R Shiny r studio R Visualization R Workshop R-bloggers random forest Ranking recommendation recommendation system regression Remote remote data science bootcamp Scrapy scrapy visualization seaborn seafood type Selenium sentiment analysis sentiment classification Shiny Shiny Dashboard Spark Special Special Summer Sports statistics streaming Student Interview Student Showcase SVM Switchup Tableau teachers team team performance TensorFlow Testimonial tf-idf Top Data Science Bootcamp Top manufacturing companies Transfers tweets twitter videos visualization wallstreet wallstreetbets web scraping Weekend Course What to expect whiskey whiskeyadvocate wildfire word cloud word2vec XGBoost yelp youtube trending ZORI