Leveraging What We Know About Meetups Part 1

Posted on Mar 14, 2018

Photo Credit: https://kryptomoney.com/blockchain-meetup-gurgaon-october-6-2017/



I admit, when I first heard about Meetup.com about a year ago, I thought the whole idea was kind of strange. It sounded like a dating site. But with a growing interest in Data Science, a previous co-worker brought me to a meetup held by New York Open Statistical Programming. Hesitant at first, I ended up seeing the genius of it all. In case you haven't heard already, Meetup.com is a convenient way to meet others who share the same interests and hobbies as you do.  In my case, I saw how easy it was to learn from so many great minds at these gatherings. I knew from that day that these meetups will be vital for my continued growth as a Data Scientist.

One day I saw Meetup.com data  posted on Kaggle by Sumit Kumar. Curiosity got a hold of me and I really wanted to know what kind of meetups other people were forming. Was I missing out on something really cool? Then it dawned on me. For whatever reasons people are forming meetups, the topic must be really important to them if they are actively building a community around it. In a way, this site is basically an encyclopedia of what the people demand. And if we know what the people demand, we know how to supply them as well. This blog is a rough sketch of what applications could be built by leveraging what we know about people's meetups.



Instead of constantly retyping out code, messing with filters and such, building an app using R Shiny would let me explore the Meetup data with ease. The link to my app is included here.

The dashboard consists of three different tabs you can click at the top:

  1. Analysis Overview (a quick look at the data)
  2. Find a Group (locating popular groups within your city)
  3. Find a Venue (locating popular venues within your city)



As you can imagine, a data set size containing info of ALL the meetups since site launch would be quite large. Sumit Kumar who posted the data on Kaggle, limited the data set to only include:

  • Data from three cities: New York, Los Angeles and Chicago
  • Meetup events under the 3 categories:
    • Arts & Culture
    • Book Clubs
    • Career & Business

I was a bit disappointed about the limited categories that I could analyze. But I figured if I could build a template for now, it can be utilized later for when I pull my own data in the future.


Setting the time frame from January 1st 2010 to October 31st 2017, I wanted to see the overall trend of these meetups.

The line graph on the left shows the growth in the number of members joining Meetup groups. Members tend to join more groups in the 3rd quarter of the year, followed by a dip in the fourth quarter. The trend then jumps back up at the start of the new year. This might make for a great Time Series analysis project in the future!

The bar chart on the right shows us that Meetup.com is mainly being utilized to bring Tech communities together. This is followed by Career & Business and then Socializing communities.


With the bubble chart above, I wanted to see if there were any characteristics that made certain events more popular. Was it the category the event fell under? member size? how long the event was? The bubble chart can visualize this all. Interestingly, even though Career & Business events clearly dominate in terms of number of events, it seems Arts & Culture meetups tend to have the most attendees (indicated by the size of the bubble). As a result, Arts & Culture events have a much higher RSVP count (indicated by the height of the bubble). Meetups that last around 3 hours long also seem to be just the right amount for a good head count. Any more and attendees seem to drop. I mean, we've all got plans right?



Those who are new to the scene may want a quick way to find a group to join. In this case, your criteria may just be:

  1. What group falls under category XXX?
  2. What group is located in city YYY?
  3. What group is the most popular?

If this is true, than the second tab of my app can help! Simply filter out what city you're in and what category is of interest and the chart will list the groups with the most members. For example, I live in NYC and I love games, perhaps I'll check out Playcrafting!




On the other hand, what if you want to HOST a group? Where should you host it? For an ease of mind, one might want to host their event at a venue that has held similar events. It would seal the deal if that venue had a lot of positive ratings as well. In such a scenario, the third tab of this app would be useful.

If I wanted to host a Book Club event in NYC, simply filter out the city and category, and the top venues will pop up. The Ukranian East Village Restaurant on 2nd Ave seems to be a good choice. AND lots of ratings...huh.



There are so many things I wish I could do more of with this project. That Time Series Analysis on when people are likely to look and join groups. Another would be to add a feature on the Venue Locator where a user can input an address. The app would then provide the distance required to reach a particular venue. But most of all, I'd love to scrape my own data off Meetup.com and get a closer look into the Tech community...perhaps I will!









About Author

Kenny Moy

Kenny has years of experience providing data driven solutions in industries such as marketing, healthcare, real estate, and public service. In addition to machine learning, he loves the AHA! moments, storytelling, and the creativity aspects of data science.
View all posts by Kenny Moy >

Related Articles

Leave a Comment

No comments found.

View Posts by Categories

Our Recent Popular Posts

View Posts by Tags

#python #trainwithnycdsa 2019 2020 Revenue 3-points agriculture air quality airbnb airline alcohol Alex Baransky algorithm alumni Alumni Interview Alumni Reviews Alumni Spotlight alumni story Alumnus ames dataset ames housing dataset apartment rent API Application artist aws bank loans beautiful soup Best Bootcamp Best Data Science 2019 Best Data Science Bootcamp Best Data Science Bootcamp 2020 Best Ranked Big Data Book Launch Book-Signing bootcamp Bootcamp Alumni Bootcamp Prep boston safety Bundles cake recipe California Cancer Research capstone car price Career Career Day citibike classic cars classpass clustering Coding Course Demo Course Report covid 19 credit credit card crime frequency crops D3.js data data analysis Data Analyst data analytics data for tripadvisor reviews data science Data Science Academy Data Science Bootcamp Data science jobs Data Science Reviews Data Scientist Data Scientist Jobs data visualization database Deep Learning Demo Day Discount disney dplyr drug data e-commerce economy employee employee burnout employer networking environment feature engineering Finance Financial Data Science fitness studio Flask flight delay gbm Get Hired ggplot2 googleVis H20 Hadoop hallmark holiday movie happiness healthcare frauds higgs boson Hiring hiring partner events Hiring Partners hotels housing housing data housing predictions housing price hy-vee Income Industry Experts Injuries Instructor Blog Instructor Interview insurance italki Job Job Placement Jobs Jon Krohn JP Morgan Chase Kaggle Kickstarter las vegas airport lasso regression Lead Data Scienctist Lead Data Scientist leaflet league linear regression Logistic Regression machine learning Maps market matplotlib Medical Research Meet the team meetup methal health miami beach movie music Napoli NBA netflix Networking neural network Neural networks New Courses NHL nlp NYC NYC Data Science nyc data science academy NYC Open Data nyc property NYCDSA NYCDSA Alumni Online Online Bootcamp Online Training Open Data painter pandas Part-time performance phoenix pollutants Portfolio Development precision measurement prediction Prework Programming public safety PwC python Python Data Analysis python machine learning python scrapy python web scraping python webscraping Python Workshop R R Data Analysis R language R Programming R Shiny r studio R Visualization R Workshop R-bloggers random forest Ranking recommendation recommendation system regression Remote remote data science bootcamp Scrapy scrapy visualization seaborn seafood type Selenium sentiment analysis sentiment classification Shiny Shiny Dashboard Spark Special Special Summer Sports statistics streaming Student Interview Student Showcase SVM Switchup Tableau teachers team team performance TensorFlow Testimonial tf-idf Top Data Science Bootcamp Top manufacturing companies Transfers tweets twitter videos visualization wallstreet wallstreetbets web scraping Weekend Course What to expect whiskey whiskeyadvocate wildfire word cloud word2vec XGBoost yelp youtube trending ZORI