1 點對點通訊協定及應用種子教師研習課程 Part IV: Infrastructure of P2P & P2P in...

點對點通訊協定及應用種子教師研習課程Part IV: Infrastructure of P2P & P2P in Mob

ile Environment

曹孝櫟交大資工

Outline

Structure of this course Introduction of P2P Infrastructure of P2P P2P in Mobile Environment

課程網要

Introduction to P2P Introduction to P2P – Introduction (what, why)Introduction (what, why)– Survey of P2P networks (commercial, freeware, research)Survey of P2P networks (commercial, freeware, research)– Issues of P2P (infrastructure, search, routing, download)Issues of P2P (infrastructure, search, routing, download)

Infrastructure of P2P Infrastructure of P2P – Centralized (Napster)Centralized (Napster)– Unstructured (Gnutella)Unstructured (Gnutella)– Structured (Chord, CAN, Pastry)Structured (Chord, CAN, Pastry)– Hybrid (unstructured + structured, KaZaa, BT)Hybrid (unstructured + structured, KaZaa, BT)– HierarchicalHierarchical

Performance issues of P2P (improvement of P2P performance)– Neighbor selection– Infrastructure maintenance overhead– Routing (proximity)– Searching (keyword, semantic content search)– Download– Mobile issuesMobile issues– Replication (cache)– Hot spot and Free rider issues

Applications of P2P– File sharing– Storage– Video Streaming (Live, VOD, P2PTV)– VoIP over P2P (skype, P2PSIP)– Wireless (structured or MANET)Wireless (structured or MANET)– Semantic content search

Performance analysis of P2P– Simulation tool: PeerSim– Analytical models

Implementation of P2P – JXTA

Part I: Introduction to P2P

Why P2P?

“Sometimes”, we prefer P2P While there is an infrastructure Because it is natural and convenient

We share resources, information, … We help each other …

Although it is Less reliable Less secure Less efficiency

Why P2P?

What do you mean “an infrastructure”? Thanks to IC/IT technologies Thanks to broadband access technologies

and Internet infrastructure Thanks to P2P mechanisms

Why P2P?

P2P today P2P has dominated Internet traffic

Source: CacheLogic.

In 2006, more than 60% of Internet traffic

What’re P2P Technologies?

What we can share Share resource directory

File directory, phone books, … Share information

Messages, presence, NAT, … Share content

File, MP3, stored and live video, … Share computation power

Grid, … Share physical devices

Virtual hard disk, …

What’re P2P Technologies? (Cont.) How we could share information and

resources Search Retrieval

No one technology fits all applications Really depends on

Characteristics of resources (size, real-time, stored/live, …)

User behaviors (community, access pattern, …)

What’re P2P Technologies? (Cont.) Operations in P2P systems consist of

three phases Peer discovery (bootstrap)

Well-known nodes, cached peers, broadcasting, …

Resource discovery (search) Locate a resource given its identifier

Communication or data transfer Direct communication, NAT/Firewall traversal,

Part II: Infrastructure of P2P

Part II-1: Centralized P2P

Centralized Index Model (1/3) Utilize a central directory for object location,

ID assignment, etc. For file-sharing P2P, location inquiry form

central servers then downloaded directly from peers

Centralized Index Model (2/3)

Centralized Repository

R1 upload index 2 query

3 download

Centralized Index Model (3/3) Benefits

Simplicity Efficient search Limited bandwidth usage

Drawbacks Unreliable (single point of failure) Performance bottleneck Scalability limits (scale the central directory) Vulnerable to DoS attacks Copyright infringement

Case Study - Napster

Why Difficult to find and download music over

networks Want to share music with friends

How A program that allowed computer users to share

and swap files, specifically music, through a centralized file server

Napster – the first popular P2P file-sharing application

Napster: System Overview

A large cluster of dedicated central servers maintain an index of shared files

The centralized servers also monitor the state of each peer and keep track of metadata The metadata is returned with the results of a

query Each peer maintains a connection to one of

the central servers

Napster Operation (1/4)

File list and IP addressis uploaded

napster.com

result

User requests search at server

napster.com

User pings hosts that apparently have data to look for best transfer rate

napster.com

User choose to initiate a file exchange directly

Napster: Summary

Napster is not a pure P2P system but it was the first one that raised important issues to the P2P community

Hybrid decentralized unstructured system File transfer is decentralized but locating content is centralized Combination of client/server and P2P approaches

Napster protocol is proprietary Stanford University senior David Weekly posted the protocol in 2

000 Napster requested that he remove it, but Weekly created the Op

enNap project instead Napster introduces two major problems

Unreliable: central indexing server represents a single point of failure

Legal responsibility for music files sharing

Part II-2: Unstructured P2P

Outline

Introduction Flooded requests model Case study - Gnutella Supernode model Case study - Kazaa

Introduction

Blindly flood a query to the network among peers or among supernodes

Flooded requests model Case study: Gnutella

Supernode model (hierarchical) Case study: FastTrack and Kazaa

Flooded Requests Model (1/2) Each request is flooded (broadcast) to directly

connected peers, which then flood their neighbors Until the request is answered or with a certain scope (TTL

limit) Benefits

Highly decentralized Reliability and fault-tolerance

Drawbacks Excessive query traffic Not scalable Fail to find content that is actually in the system

Flooded Requests Model (2/2)

Request!

…found

Case Study - Gnutella

Napster introduces two major problems Unreliable: central indexing server represents a

single point of failure Legal responsibility for music files sharing

Gnutella Fully distributed peer-to-peer protocol

Reliability and fault-tolerance properties Flooding raises questions of cost and scalability

Gnutella: System Overview

Open, decentralized P2P search protocol Build at the application level a virtual network

with its own routing mechanisms Peers self-organize into an application-level

mesh Each peer initiates a controlled flooding

through the network by sending a query packet to all of its neighbors TTL is decremented on each hop

Gnutella: The Protocol

Peer discovery IRC (Internet Relay Chat), web pages, Ping-Pong messag

es Send GNUTELLA CONNECT to one known address node

then wait for GNUTELLA OK Discovery of peers and searching for files are imple

mented by passing five descriptors (message types) between nodes Ping, Pong, Query, QueryHit, Push

The direct file downloaded via a HTTP GET request

Gnutella: Protocol Message TypesType Description Contained Information

Ping Announce availability and probe for other servents

Pong Response to a Ping IP address and port# of responding servent; number and total kb of file shared

Query Search request Minimum network bandwidth of responding servent; search criteria

QueryHit Returned by servents that have the requested file

IP address, port# and network bandwidth of responding servent; number of results and result set

Push File downloaded requests for firewalled servents

Servent identifier; index of requested file; IP address and port to send file to

Gnutella: Connect operation

CONNECT

Gnutella: Discovery operation

Example: Ping/Pong routing

Source: http://rfc-gnutella.sourceforge.net/developer/stable/index.html

Gnutella: Search & Transfer operations

Example: Query/QueryHit/Push routing

Source: http://rfc-gnutella.sourceforge.net/developer/stable/index.html

Gnutella: Summary

Fully distributed peer-to-peer protocol Reliability and fault-tolerance properties Flooding raises questions of cost and scalability

The current Gnutella protocol can not scale beyond a network size of a few thousand nodes without becoming fragmented

M. Portmann, P. Sookavatana, S. Ardon, and A. Seneviratne, “The cost of peer discovery and searching in the Gnutella peer-to-peer file sharing protocol,” in Proc. of ICON’01, Vol. 1, pp. 263-268, 2001.

Supernode Model (1/3)

Supernode acts both as a local central index for files shared by local peers and as an equal in a network of supernodes

Each peer is either designated as a supernode or assigned to a supernode

Supernodes are equal in search; all peers are equal in download

Examples: FastTrack and Kazaa

supernode

peer node

Benefits No single point of failure

Drawbacks Supernode may become overloaded or been

attacked Copyright infringement

Part II-3: Structured P2P

Outline

Document routing model Case studies - Chord

Document Routing Model (1/4) Each peer is assigned a random or hashed

ID and knows a given number of peers An ID is assigned to every shared document

based on a hash function A request will go to the peer with the ID most

1 點對點通訊協定及應用種子教師研習課程 Part IV: Infrastructure of P2P & P2P in...

Documents

Płatności mobilne P2P

從產業價值鏈觀點探討 P2P 借貸平台的類型與服務模式 · 2015-05-23 · 從產業價值鏈觀點探討P2P 借貸平台的類型與服務模式 Exploring the Category

Civersity - Gobernanza P2P

Presentatie p2p (fr)

Freie Universitätuserpage.fu-berlin.de › ejb › lic.pdf · 2011-09-28 · 6 6 o ! }6 s ! #" * $ %& \ ' w (%*),+-% '. /% 0 1 p [ #, !p2p>p2p2p>p2p

p2p Corp Profile

Introducción al p2p

Frame Relay p2p

P2P file sharing

YaCy: P2P Web-Suchmaschine - uni-freiburg.dearchive.cone.informatik.uni-freiburg.de/teaching/seminar/p2p-networ… · YaCy YaCy = Yet another Cyberspace Koppelung des P2P-Ansatzes

Programas P2P

Turismo P2P

P2P Security - krnet.or.krB3... · 소리바다 Distributed Computing SETI@Home Groove (Virtual Office) KOREA@Home. KRnet 2005 5 P2P 네트워크구조(1) P2P Overlay Network P2P

P2p Aglarda Guvenlik

Peer Sim & P2P

P2p tema1 ale

ccgp-guizhou.gov.cnccgp-guizhou.gov.cn/attachment/201609/ZMC... · p2p p2p p2p p2p p2p, p2p p'2p p2p 30m 70m nternet 60m p2p 30m nternet 70m udp -34-

P2P Tracking

Incentivos em Sistemas P2P - Instituto de Computação€¦ · armazenagem p2p multicast p2p redes adhoc. Esquema de reputação p2p Com técnicas de segurança, podemos: identificar

Network – P2P