Big Data in a Small Package - Building a Raspberry Pi Cluster for Hadoop and Spark

Scott Edenbaum
Posted on May 6, 2017

Q) What can we do with 5 Raspberry Pi 3 computers?

A) Setup a Jupyter Notebook Environment Tied to a Hadoop/Spark Cluster.

Follow along for detailed instructions!

April 27, 2017

Raspberry Pi 'Bramble' Cluster (The Embedded Linux Wiki says a Bramble is defined as "a Beowulf cluster of Raspberry Pi devices")

This is the first of a three part post regarding my latest project - a Raspberry Pi 3 cluster I created to run a fully distributed version of Hadoop & Spark.

clusterpi

(For those eagle-eyed readers who noticed that there are only 4 Pi's in this picture, I have the 5th Pi in an individual case - outside of the frame)

What is a Raspberry Pi 3 and why should I care?

The Raspberry Pi Foundation is a UK based nonprofit that focuses on promoting basic computer science in schools and in developing countries. Since 2014 the Raspberry Pi Foundation created a series of small single-board computers, and released their latest flagship model, the Raspberry Pi Model 3 in February 2016 for $35.

pi3

Continue to Page 2 for Specifications, Project Details & Motivations!


About Author

Scott Edenbaum

Scott Edenbaum

Scott Edenbaum is a recent graduate from the NYC Data Science Academy. He was hired by the Academy to assist in buildout of the learning management system and seeks to pursue a career as a Data Scientist. Scott's...
Read more

Leave Responses

Your email address will not be published. Required fields are marked *

quality social links November 17, 2017
uxtchfdrn vliez kodkxov goiq secejtvivnlmwdj
get social shares October 2, 2017
kiroqcdng gwqde xlyelnd hnjt kzolcoexsibdbha
Jair June 13, 2017
Hi Scott, Thanks for posting. As a heads up, "sudo pip3 install ipython3" didn't work for me. However, "sudo pip3 install ipython" seems to work fine. Jair