40
지능정보시스템연구실

지능정보시스템연구실 황지민datamining.uos.ac.kr/wp-content/uploads/2016/06/10... · 2016-06-14 · 10.3 17 1. Collection partitioning 2. Collection selection 3. inverted

  • Upload
    others

  • View
    0

  • Download
    0

Embed Size (px)

Citation preview

Page 1: 지능정보시스템연구실 황지민datamining.uos.ac.kr/wp-content/uploads/2016/06/10... · 2016-06-14 · 10.3 17 1. Collection partitioning 2. Collection selection 3. inverted

지능정보시스템연구실

황 지 민

Page 2: 지능정보시스템연구실 황지민datamining.uos.ac.kr/wp-content/uploads/2016/06/10... · 2016-06-14 · 10.3 17 1. Collection partitioning 2. Collection selection 3. inverted

I N D E X

10.1 Introduction

10.2 A Taxonomy of Distributed IR System

10.3 Data Partitioning

10.4 Parallel IR

10.5 Cluster-based IR

Page 3: 지능정보시스템연구실 황지민datamining.uos.ac.kr/wp-content/uploads/2016/06/10... · 2016-06-14 · 10.3 17 1. Collection partitioning 2. Collection selection 3. inverted

10.1 Introduction

Page 4: 지능정보시스템연구실 황지민datamining.uos.ac.kr/wp-content/uploads/2016/06/10... · 2016-06-14 · 10.3 17 1. Collection partitioning 2. Collection selection 3. inverted

10.1 4

Page 5: 지능정보시스템연구실 황지민datamining.uos.ac.kr/wp-content/uploads/2016/06/10... · 2016-06-14 · 10.3 17 1. Collection partitioning 2. Collection selection 3. inverted

10.2 A Taxonomy of

Distributed IR Systems

Page 6: 지능정보시스템연구실 황지민datamining.uos.ac.kr/wp-content/uploads/2016/06/10... · 2016-06-14 · 10.3 17 1. Collection partitioning 2. Collection selection 3. inverted

10.2 6

표준 검색 병렬검색 SIMD 병렬검색 MIMD

n/a 클러스터 기반

검색 지역 통합 검색

n/a 분산 검색(P2P) 통합(Federated) 검색

(P2P)

Page 7: 지능정보시스템연구실 황지민datamining.uos.ac.kr/wp-content/uploads/2016/06/10... · 2016-06-14 · 10.3 17 1. Collection partitioning 2. Collection selection 3. inverted

10.2 7

Page 8: 지능정보시스템연구실 황지민datamining.uos.ac.kr/wp-content/uploads/2016/06/10... · 2016-06-14 · 10.3 17 1. Collection partitioning 2. Collection selection 3. inverted

10.2 8

Page 9: 지능정보시스템연구실 황지민datamining.uos.ac.kr/wp-content/uploads/2016/06/10... · 2016-06-14 · 10.3 17 1. Collection partitioning 2. Collection selection 3. inverted

10.2 9

Page 10: 지능정보시스템연구실 황지민datamining.uos.ac.kr/wp-content/uploads/2016/06/10... · 2016-06-14 · 10.3 17 1. Collection partitioning 2. Collection selection 3. inverted

10.3 Data Partitioning

Page 11: 지능정보시스템연구실 황지민datamining.uos.ac.kr/wp-content/uploads/2016/06/10... · 2016-06-14 · 10.3 17 1. Collection partitioning 2. Collection selection 3. inverted

10.3 11

색인 항목

문 헌

Page 12: 지능정보시스템연구실 황지민datamining.uos.ac.kr/wp-content/uploads/2016/06/10... · 2016-06-14 · 10.3 17 1. Collection partitioning 2. Collection selection 3. inverted

10.3 12

Page 13: 지능정보시스템연구실 황지민datamining.uos.ac.kr/wp-content/uploads/2016/06/10... · 2016-06-14 · 10.3 17 1. Collection partitioning 2. Collection selection 3. inverted

1. Collection

partitioning

2. Collection

selection

3. inverted index

partitioning

4. Partitioning

other indexes

10.3 13

Page 14: 지능정보시스템연구실 황지민datamining.uos.ac.kr/wp-content/uploads/2016/06/10... · 2016-06-14 · 10.3 17 1. Collection partitioning 2. Collection selection 3. inverted

1. Collection

partitioning

2. Collection

selection

3. inverted index

partitioning

4. Partitioning

other indexes

10.3 14

Page 15: 지능정보시스템연구실 황지민datamining.uos.ac.kr/wp-content/uploads/2016/06/10... · 2016-06-14 · 10.3 17 1. Collection partitioning 2. Collection selection 3. inverted

1. Collection

partitioning

2. Collection

selection

3. inverted index

partitioning

4. Partitioning

other indexes

10.3 15

Page 16: 지능정보시스템연구실 황지민datamining.uos.ac.kr/wp-content/uploads/2016/06/10... · 2016-06-14 · 10.3 17 1. Collection partitioning 2. Collection selection 3. inverted

10.3 16

1. Collection

partitioning

2. Collection

selection

3. inverted index

partitioning

4. Partitioning

other indexes

Page 17: 지능정보시스템연구실 황지민datamining.uos.ac.kr/wp-content/uploads/2016/06/10... · 2016-06-14 · 10.3 17 1. Collection partitioning 2. Collection selection 3. inverted

10.3 17

1. Collection

partitioning

2. Collection

selection

3. inverted index

partitioning

4. Partitioning

other indexes

𝑤𝑐,𝑖𝑗 = 𝑓𝑐,𝑖𝑗 × 𝐼𝐷𝐹𝑐,𝑖 컬렉션내 용어 𝑘𝑖 의 가중치

컬렉션의 모든 문헌들 내에 있는 용어 𝑘𝑖 의 전체 출현빈도

역 컬렉션 빈도

𝐼𝐷𝐹𝑐,𝑖 = 𝑙𝑜𝑔𝑁𝑐

𝑛𝑐,𝑖 𝑛𝑐,𝑖: 용어 𝑘𝑖 가 출현하는 컬렉션의 수

𝑁𝑐: 전체 컬렉션의 수

Page 18: 지능정보시스템연구실 황지민datamining.uos.ac.kr/wp-content/uploads/2016/06/10... · 2016-06-14 · 10.3 17 1. Collection partitioning 2. Collection selection 3. inverted

10.3 18

1. Collection

partitioning

2. Collection

selection

3. inverted index

partitioning

4. Partitioning

other indexes

Page 19: 지능정보시스템연구실 황지민datamining.uos.ac.kr/wp-content/uploads/2016/06/10... · 2016-06-14 · 10.3 17 1. Collection partitioning 2. Collection selection 3. inverted

10.3 19

1. Collection

partitioning

2. Collection

selection

3. inverted index

partitioning

4. Partitioning

other indexes

Page 20: 지능정보시스템연구실 황지민datamining.uos.ac.kr/wp-content/uploads/2016/06/10... · 2016-06-14 · 10.3 17 1. Collection partitioning 2. Collection selection 3. inverted

10.3 20

1. Collection

partitioning

2. Collection

selection

3. inverted index

partitioning

4. Partitioning

other indexes

Page 21: 지능정보시스템연구실 황지민datamining.uos.ac.kr/wp-content/uploads/2016/06/10... · 2016-06-14 · 10.3 17 1. Collection partitioning 2. Collection selection 3. inverted

10.3 21

1. Collection

partitioning

2. Collection

selection

3. inverted index

partitioning

4. Partitioning

other indexes

Page 22: 지능정보시스템연구실 황지민datamining.uos.ac.kr/wp-content/uploads/2016/06/10... · 2016-06-14 · 10.3 17 1. Collection partitioning 2. Collection selection 3. inverted

10.3 22

1. Collection

partitioning

2. Collection

selection

3. inverted index

partitioning

4. Partitioning

other indexes

Page 23: 지능정보시스템연구실 황지민datamining.uos.ac.kr/wp-content/uploads/2016/06/10... · 2016-06-14 · 10.3 17 1. Collection partitioning 2. Collection selection 3. inverted

10.3 23

1. Collection

partitioning

2. Collection

selection

3. inverted index

partitioning

4. Partitioning

other indexes

Page 24: 지능정보시스템연구실 황지민datamining.uos.ac.kr/wp-content/uploads/2016/06/10... · 2016-06-14 · 10.3 17 1. Collection partitioning 2. Collection selection 3. inverted

10.3 24

1. Collection

partitioning

2. Collection

selection

3. inverted index

partitioning

4. Partitioning

other indexes

Page 25: 지능정보시스템연구실 황지민datamining.uos.ac.kr/wp-content/uploads/2016/06/10... · 2016-06-14 · 10.3 17 1. Collection partitioning 2. Collection selection 3. inverted

10.3 25

1. Collection

partitioning

2. Collection

selection

3. inverted index

partitioning

4. Partitioning

other indexes

Page 26: 지능정보시스템연구실 황지민datamining.uos.ac.kr/wp-content/uploads/2016/06/10... · 2016-06-14 · 10.3 17 1. Collection partitioning 2. Collection selection 3. inverted

10.3 26

1. Collection

partitioning

2. Collection

selection

3. inverted index

partitioning

4. Partitioning

other indexes

Page 27: 지능정보시스템연구실 황지민datamining.uos.ac.kr/wp-content/uploads/2016/06/10... · 2016-06-14 · 10.3 17 1. Collection partitioning 2. Collection selection 3. inverted

10.4 Parallel IR

Page 28: 지능정보시스템연구실 황지민datamining.uos.ac.kr/wp-content/uploads/2016/06/10... · 2016-06-14 · 10.3 17 1. Collection partitioning 2. Collection selection 3. inverted

1. introduction

2. MIMD

architectures

3. SIMD

architectures

10.4 28

Page 29: 지능정보시스템연구실 황지민datamining.uos.ac.kr/wp-content/uploads/2016/06/10... · 2016-06-14 · 10.3 17 1. Collection partitioning 2. Collection selection 3. inverted

1. introduction

2. MIMD

architectures

3. SIMD

architectures

10.4 29

∅ = 1 일때, 이상적인 효율

𝑆:속도향상, 𝑁:프로세서의수

Page 30: 지능정보시스템연구실 황지민datamining.uos.ac.kr/wp-content/uploads/2016/06/10... · 2016-06-14 · 10.3 17 1. Collection partitioning 2. Collection selection 3. inverted

1. introduction

2. MIMD

architectures

3. SIMD

architectures

10.4 30

Page 31: 지능정보시스템연구실 황지민datamining.uos.ac.kr/wp-content/uploads/2016/06/10... · 2016-06-14 · 10.3 17 1. Collection partitioning 2. Collection selection 3. inverted

1. introduction

2. MIMD

architectures

3. SIMD

architectures

10.4 31

Page 32: 지능정보시스템연구실 황지민datamining.uos.ac.kr/wp-content/uploads/2016/06/10... · 2016-06-14 · 10.3 17 1. Collection partitioning 2. Collection selection 3. inverted

1. introduction

2. MIMD

architectures

3. SIMD

architectures

10.4 32

Page 33: 지능정보시스템연구실 황지민datamining.uos.ac.kr/wp-content/uploads/2016/06/10... · 2016-06-14 · 10.3 17 1. Collection partitioning 2. Collection selection 3. inverted

1. introduction

2. MIMD

architectures

3. SIMD

architectures

10.4 33

Page 34: 지능정보시스템연구실 황지민datamining.uos.ac.kr/wp-content/uploads/2016/06/10... · 2016-06-14 · 10.3 17 1. Collection partitioning 2. Collection selection 3. inverted

1. introduction

2. MIMD

architectures

3. SIMD

architectures

10.4 34

Page 35: 지능정보시스템연구실 황지민datamining.uos.ac.kr/wp-content/uploads/2016/06/10... · 2016-06-14 · 10.3 17 1. Collection partitioning 2. Collection selection 3. inverted

1. introduction

2. MIMD

architectures

3. SIMD

architectures

10.4 35

Page 36: 지능정보시스템연구실 황지민datamining.uos.ac.kr/wp-content/uploads/2016/06/10... · 2016-06-14 · 10.3 17 1. Collection partitioning 2. Collection selection 3. inverted

10.5 Cluster-based IR

Page 37: 지능정보시스템연구실 황지민datamining.uos.ac.kr/wp-content/uploads/2016/06/10... · 2016-06-14 · 10.3 17 1. Collection partitioning 2. Collection selection 3. inverted

10.5 37

Page 38: 지능정보시스템연구실 황지민datamining.uos.ac.kr/wp-content/uploads/2016/06/10... · 2016-06-14 · 10.3 17 1. Collection partitioning 2. Collection selection 3. inverted

10.5 38

𝐿𝐵 = max𝑖=1

𝑛|𝑙𝑜𝑎𝑑𝑖 − 𝑙|

𝑙

Page 39: 지능정보시스템연구실 황지민datamining.uos.ac.kr/wp-content/uploads/2016/06/10... · 2016-06-14 · 10.3 17 1. Collection partitioning 2. Collection selection 3. inverted

10.5 39

Page 40: 지능정보시스템연구실 황지민datamining.uos.ac.kr/wp-content/uploads/2016/06/10... · 2016-06-14 · 10.3 17 1. Collection partitioning 2. Collection selection 3. inverted

END