Hadoop Workshop III: One Stop Shop -- One System Fit All Sizes of Data

Posted on May 13, 2014

NYC Data Science Academy is offering two relative courses:

RSVP Big Data with Hadoop, Beginner Level

RSVP Big Data with Hadoop, Intermediate Level


Meetup Announcement:

How Could One Hadoop System Fit all Sizes of Data?

Vivian introduced this awesome Hadoop-based system born in Shanghai high tech park.

Speaker: Vivian Zhang, CTO and co-founder of SupStat Inc, organizer of NYC Open Data Meetup, Founder of NYC Data Science Academy. She teaches R and Hadoop.


TDH(Transwarp Data Hub) is a business solution for storage and analysis of data from 100GB to over 2PB. Besides flexibility, it is faster than most popular systems. Moreover, built-in PL/SQL interpreter and SparkR system enable users to do complex analysis directly on HDFS.

Transwarp's founder was Intel's Engineering Manager for Big Data Product Team for 9 years and 11 months. Transwarp recently finished fundraising A round.


  1. Weakness of traditional systems for big data analysis

  2. Unique features of TDH(Transwarp Data Hub)

  3. Industrial applications

  4. Built-in PL/SQL interpreter, Tableau and SparkR.

  5. High-level cooperation and service

Past Hadoop Workshop with Slides:

Hadoop Workshop I: Configure Your First Hadoop Cluster on Amazon EC2

Hadoop Workshop II: Run Map Reduce Jobs on Your Amazon Cloud

About Author

Related Articles

Leave a Comment

No comments found.

View Posts by Categories

Our Recent Popular Posts

View Posts by Tags

2019 airbnb alumni Alumni Interview Alumni Spotlight alumni story Alumnus API artist aws beautiful soup Best Bootcamp Best Data Science 2019 Best Data Science Bootcamp Big Data bootcamp Bootcamp Prep Bundles California Cancer Research capstone Career citibike clustering Coding Course Report D3.js data Data Analyst data science Data Science Academy Data Science Bootcamp Data Scientist Data Scientist Jobs data visualization Deep Learning Demo Day dplyr employer networking feature engineering Finance Financial Data Science Flask gbm Get Hired ggplot2 googleVis Hadoop higgs boson Hiring hiring partner events Industry Experts Job JP Morgan Chase Kaggle lasso regression Lead Data Scienctist Lead Data Scientist leaflet linear regression Logistic Regression machine learning Maps matplotlib Medical Research meetup Networking neural network Neural networks New Courses nlp NYC NYC Data Science nyc data science academy NYC Open Data NYCDSA NYCDSA Alumni Open Data painter pandas Portfolio Development prediction Programming PwC python python machine learning python scrapy python web scraping python webscraping Python Workshop R R language R Programming R Shiny r studio R Visualization R Workshop R-bloggers random forest recommendation recommendation system regression Scrapy scrapy visualization seaborn Selenium sentiment analysis Shiny Shiny Dashboard Spark Special Special Summer Sports statistics streaming Student Interview Student Showcase SVM Tableau Testimonial tf-idf Top Data Science Bootcamp twitter visualization web scraping What to expect word cloud word2vec XGBoost yelp