29
Big Data Analysis Process 권정민 ([email protected]) Open Cloud & Big Data 2012

R & big data analysis 20120531

Embed Size (px)

Citation preview

Big Data Analysis Process

권정민

([email protected])

Open Cloud & Big Data 2012

Data Science

There is nothing new

under the sun

Data Engineering

(출처: http://embedded.eecs.berkeley.edu/Alumni/wray/data-eng.html)

Last updated Mon Jan 27 09:26:36 PST 1997

Why ?

muggle wizard

muggle wizard……

……

Limits- Environment

- Functions

- Money

Free- Environment

- Functions

- Money

Rhadoop / Rhipe

Oracle R Enterprise

Rhive

Rhadoop / Rhipe

Oracle R Enterprise

RHive

Big Data Analysis

with R

클라우드로그분석시스템

클라우드로그분석시스템

CDR 분석

Dataraw data summary size

(raw 1 yr +sum2 yrs) 1 Month 1 Month

Wireless

Unrated CDR(VOICE, Data, SMS, MMS)

3.7 2.5 104

Rated CDR 1.5 0.2 22

Wi-Fi 0.4 0.3 12

Wibro 1.5 1.0 42

Wireline Rated CDR 1.5 1.5 55

IPDR IP-TV 1.5 0.1 19

Total 10 5.6 254

Unit : TB[ KT CDR(Call Detail Record) ]

[분석항목]

• 고객 Segmentation

• 위치기반통화품질분석• SNA 분석• Anomaly Detection