Download pdf - HAdoop presentation

Transcript
Page 1: HAdoop presentation

HadoopEdit by Cassell Hsu

2013.04.19

Page 2: HAdoop presentation

Hadoop

Master

Slave

Slave

Slave

Page 3: HAdoop presentation

Hadoop

• Master

• NameNode

• JobTracker

• SecondaryNameNode

Page 4: HAdoop presentation

NameNode• Where is NameNode?

• Master

• HDFS(Hadoop Distributed File System)

• What is NameNode?

• 資料之位置資訊

• 資料之屬性

??

Page 5: HAdoop presentation

NameNode

• 位置資訊 ?

• 所有資料皆存放在 – DataNode

Page 6: HAdoop presentation

DataNode• What is DataNode

• 存放資料• Where is DataNode

• HDFS

• Slaves (and Master)

Page 7: HAdoop presentation

UserUser

DataNodeDataNode

NameNodeNameNode

DataNodeDataNode

DataNodeDataNode

128Mb B

64Mb

A64Mb

Check hdfs-site.xml

B64Mb

A64Mb

Page 8: HAdoop presentation

Hadoop

DataNodeDataNodeNameNodeNameNode

MasterDataNodeDataNode

DataNodeDataNode

DataNodeDataNode

Slaves

Page 9: HAdoop presentation

Hadoop

• Master

• NameNode

• JobTracker

• SecondaryNameNode

Page 10: HAdoop presentation

JobTracker

• What is JobTracker?

• 排程工作• Where is JobTracker?

• Master

誰來工作?

Page 11: HAdoop presentation

JobTracker & TaskTrackerJobTracker TaskTracker

Where Master Slaves

What 排程工作 執行工作

Page 12: HAdoop presentation

Hadoop

DataNodeDataNodeNameNodeNameNode

MasterDataNodeDataNode

Slaves

JobTrackerJobTracker

TaskTrackerTaskTracker

Page 13: HAdoop presentation

Hadoop

• Master

• NameNode

• JobTracker

• SecondaryNameNode

Page 14: HAdoop presentation

SecondaryNameNode

• What is SecondaryNameNode?

• NameNode發生錯誤時補救

• Where is SecondaryNameNode?

• Master

Page 15: HAdoop presentation

Hadoop

DataNodeDataNodeNameNodeNameNode

MasterDataNodeDataNode

Slaves

JobTrackerJobTracker

TaskTrackerTaskTracker

SecondaryNodeSecondaryNode

Page 16: HAdoop presentation

MapReduce

User Master

Slave2

Slave1

A

A1

A2Task

NameNode

Task

Result1

Result2

Reduce

Final ResultHDFS

Page 17: HAdoop presentation

MapReduce

• 案切割檔

• Hadoop上區塊切割

• 程式指定