The Great Escape: Health, Wealth, and the Origins of Inequality

Posted on Dec 7, 2015

Contributed by . He took NYC Data Science Academy 12 week full time Data Science Bootcamp program between Sept 23 to Dec 18, 2015. The post was based on his second class project(due at 4th week of the program).

Overall Goal: Build a companion interactive presentation for a book, The Great Escape: Health, Wealth, and the Origins of Inequality, by Nobel Prize-winning economist, Angus Deaton.

Specific Goal: Build an interactive presentation with embedded Shiny apps

Tools Used: RStudio, Google Charts and GoogleVis

Angus Deaton, winner of the 2015 Nobel Prize in economics, is a Professor of Economics and International Affairs in the Woodrow Wilson School of Public and International Affairs and the Economics Department at Princeton University.

In The Great Escape, Angus Deaton–one of the foremost experts on economic development and on poverty–tells the remarkable story of how, beginning 250 years ago, some parts of the world experienced sustained progress, opening up gaps and setting the stage for today's disproportionately unequal world.

The interactive presentation with the Shiny apps embedded in it, is live at http://216.230.228.88:3838/bootcamp003_student/Project2-Shiny/DavidComfort-Shiny-Project/Great-Escape-v3.Rmd.

A screen capture of the presentation is below:

[youtube http://www.youtube.com/watch?v=qElqGyCSB9k?rel=0&w=560&h=315]

Outline of creating an interactive presentation with embedded Shiny apps

  • Create a new R Markdown file in R Studio.
  • Select Presentation and HTML (ioslides) when creating the presentation (see Figure 2.)
Figure 2: Create an interactive presentation in R Studio

Figure 2: Create an interactive presentation in R Studio

  • The header of the file now contains the following:
  • Now, the ioslides_presentation output format is specified in the front-matter of the document.
  • We have added our own CSS to the ioslides presentation using the css option. The subdirectory where the css file, style.css is located is specified as well by www.
  • When knitr processes an R Markdown input file it creates a markdown (md) file which is subsequently tranformed into HTML by pandoc. If you want to keep a copy of the markdown file after rendering you can do so using the keep_md option.
  • You can add a logo to the presentation using the logo option.
  • You can customize the speed of slide transitions using transition option. This can be “default”, “slower”, “faster”, or a numeric value with a number of seconds (e.g. 0.5) .
  • You can display the presentation using a wider form factor using the widescreen option.
  • The front matter also contains runtime: shiny, so you can embed shiny apps into the document. Once you’ve added runtime: shiny to the document you can run it using either the Run Document command in RStudio or using the rmarkdown::run function. By default, documents are re-rendered on save, so once you’ve got a browser open with the document loaded, just save the R Markdown file to see your changes.
  • You can create a slide show broken up into sections by using the # and ## heading tags (you can also create a new slide without a header using a horizontal rule (----). For example here is the beginning of our slide show:
  • Embed a Shiny app within one of the slides by defining the application inline using the shinyApp function:
  • Note that in all of R code chunks above the echo = FALSE attribute is used. This is to prevent the R code within the chunk from rendering in the document alongside the Shiny components.

Specific Steps for embedding our Shiny App

I embedded several GoogleVis Motion Charts in my interactive presentation. I went over how I created a GoogleVis motion chart using R and R Markdown in a previous blog post, Gapminder Data Visualization using GoogleVis and R.

The specific steps involved were:

  • Load googleVis and shiny libraries
  • Specific the input choices, myChoices, for the checkboxes
  • Load the rda file (previously saved) into an R data frame, gapdata
  • Set the Sub.Region column of the data frame as a factor
  • Call the shinyApp.
  • Set up the ui.
  • I have used a fluidPage and put the interactive element, the checkboxes, in a sidebar.
  • Note that the name of the element is "Sub.Region", the elements are the elements specified myChoices above.
  • I have also chosen to have have all elements checked using selected = myChoices.
  • The main panel simply has a placeholder for the output of the motionchart, called motionchart for simplicity.
  • The next section calls the shiny server.
  • On the server side, Shiny applications use the input object to receive user input from the client web browser. The values in input are set by UI objects on the client web page.
  • There are also non-input values (in the sense that the user doesn’t enter these values through UI components) that are stored in an object called session$clientData.
  • To access session$clientData values, you need to pass a function to shinyServer() that takes session as an argument (session is a special object that is used for finer control over a user’s app session). Once it’s in there, you can access session$clientData just as you would input.
  • The target_gapdata "listens" to the client as far as which checkboxes are selected and creates a subset of the data frame based upon which Sub.Regions a user has selected. It is important that this statement resides outside the output statement below.
  • Then, the output$motionchart statement calls renderGvis.
  • The statement, target_gapdata <- target_gapdata(), calls the aforementioned statement to select only the rows for the Sub.Regions which have been selected by the user.
  • A Google Motion Chart is returned.
  • The parameters for GoogleVis include:
    • data: a data.frame
    • idvar: the id variable , “Country” in our case.
    • timevar: the time variable for the plot, “Years” in our case.
    • xvar: column name of a numerical vector in data to be plotted on the x-axis.
    • yvar: column name of a numerical vector in data to be plotted on the y-axis.
    • colorvar: column name of data that identifies bubbles in the same series. We will use “Region” in our case.
    • sizevar – values in this column are mapped to actual pixel values using the sizeAxis option. We will use this for “Population”.
  • The options=list(state= statement is used to specify the initial conditions for the chart. This state can be copied from an existing chart by going to advanced settings and copying the state string. (See Figure 3.)
Setting State for GoogleVis

Figure 3: Setting State for GoogleVis

Embedding Images, articles and Videos in an interactive presentation

  • Images can easily be embedded in the presentation by using the following format:

<img src="www/equation.gif" alt="Happy Planet Index"></img>

  • It is also possible to embed videos by simply embedding an iframe with the following format:

[youtube http://www.youtube.com/watch?v=XwLNqDbPNBw?rel=0&amp;controls=0&amp;showinfo=0&w=853&h=480]

An article can also be embedded using an iframe:

Lessons Learned

  • I had tried to create an interactive presentation using a package, Slidify, but I could not get it to work with GoogleVis. So rather forcing a tool, Slidify, to do what you want it to (and ends up not working), find another tool, Rstudio ioslides (that works) to do the project.
  • A lot of well-known scientific papers and books are not easily reproducible and a lot of data is not publicly available, or at least readily available.
  • A lot of data on important issues is simply missing, especially going back in time.
  • It would be great to have companion websites / applications for papers and books which rely upon data analysis and visualization.

The interactive presentation with the Shiny apps embedded in it, is live at http://216.230.228.88:3838/bootcamp003_student/Project2-Shiny/DavidComfort-Shiny-Project/Great-Escape-v3.Rmd.

More information about creating interactive presentations using R, R Markdown, Shiny and Knitr can be found at:

About Author

David Comfort

David Comfort, D.Phil. is a data scientist, scientist, activist and writer. His doctoral research at Oxford University was in protein nuclear magnetic resonance (NMR) and computational biology. His post-doctoral research at UCLA involved genomics, bioinformatics as well as...
View all posts by David Comfort >

Related Articles

Leave a Comment

No comments found.

View Posts by Categories


Our Recent Popular Posts


View Posts by Tags

#python #trainwithnycdsa 2019 2020 Revenue 3-points agriculture air quality airbnb airline alcohol Alex Baransky algorithm alumni Alumni Interview Alumni Reviews Alumni Spotlight alumni story Alumnus ames dataset ames housing dataset apartment rent API Application artist aws bank loans beautiful soup Best Bootcamp Best Data Science 2019 Best Data Science Bootcamp Best Data Science Bootcamp 2020 Best Ranked Big Data Book Launch Book-Signing bootcamp Bootcamp Alumni Bootcamp Prep boston safety Bundles cake recipe California Cancer Research capstone car price Career Career Day citibike classic cars classpass clustering Coding Course Demo Course Report covid 19 credit credit card crime frequency crops D3.js data data analysis Data Analyst data analytics data for tripadvisor reviews data science Data Science Academy Data Science Bootcamp Data science jobs Data Science Reviews Data Scientist Data Scientist Jobs data visualization database Deep Learning Demo Day Discount disney dplyr drug data e-commerce economy employee employee burnout employer networking environment feature engineering Finance Financial Data Science fitness studio Flask flight delay gbm Get Hired ggplot2 googleVis H20 Hadoop hallmark holiday movie happiness healthcare frauds higgs boson Hiring hiring partner events Hiring Partners hotels housing housing data housing predictions housing price hy-vee Income Industry Experts Injuries Instructor Blog Instructor Interview insurance italki Job Job Placement Jobs Jon Krohn JP Morgan Chase Kaggle Kickstarter las vegas airport lasso regression Lead Data Scienctist Lead Data Scientist leaflet league linear regression Logistic Regression machine learning Maps market matplotlib Medical Research Meet the team meetup methal health miami beach movie music Napoli NBA netflix Networking neural network Neural networks New Courses NHL nlp NYC NYC Data Science nyc data science academy NYC Open Data nyc property NYCDSA NYCDSA Alumni Online Online Bootcamp Online Training Open Data painter pandas Part-time performance phoenix pollutants Portfolio Development precision measurement prediction Prework Programming public safety PwC python Python Data Analysis python machine learning python scrapy python web scraping python webscraping Python Workshop R R Data Analysis R language R Programming R Shiny r studio R Visualization R Workshop R-bloggers random forest Ranking recommendation recommendation system regression Remote remote data science bootcamp Scrapy scrapy visualization seaborn seafood type Selenium sentiment analysis sentiment classification Shiny Shiny Dashboard Spark Special Special Summer Sports statistics streaming Student Interview Student Showcase SVM Switchup Tableau teachers team team performance TensorFlow Testimonial tf-idf Top Data Science Bootcamp Top manufacturing companies Transfers tweets twitter videos visualization wallstreet wallstreetbets web scraping Weekend Course What to expect whiskey whiskeyadvocate wildfire word cloud word2vec XGBoost yelp youtube trending ZORI