Distributed File Systems and MapReduce

Session 101
King Chung Huang
Portfolio Manager

Google

Google's First Production Server

Google’s First Production Server.jpg on Wikimedia Commons

Scale of Information

Google's Oregon Data Center

Google’s Oregon Data Center

Powerful Software

Research at Google

Google File System

MapReduce

MapReduce Diagram

MapReduce Diagram

Example: Word Count

Word Count Diagram

Example: Video Compression

Video Compression Diagram

Big Data

Hadoop

Hadoop Ecosystem

Exercise (using EMR)

Exercise (Hadoop on EC2)

Exercise (Hadoop on Mac)

Distributed File Systems and MapReduce

May 30, 2013
Fork me on GitHub