71
ETL Pentaho Data Integration [email protected]

Introduction to ETL - Pentaho

Embed Size (px)

DESCRIPTION

สมาคมศูนบ์วิชาการไทย-ออสเตรเลีย การอบรม รุ่นที่ 1

Citation preview

Page 1: Introduction to ETL - Pentaho

ETL

Pentaho Data Integration

[email protected]

Page 2: Introduction to ETL - Pentaho

ETL

stands for extract, transform and load

a process in database usage and especially

in data warehousing that involves:

Extracting data from outside sources

Transforming it to fit operational needs (which can include quality levels)

Loading it into the end target (database or data warehouse)

Page 3: Introduction to ETL - Pentaho

Example

Page 4: Introduction to ETL - Pentaho

Input

Page 5: Introduction to ETL - Pentaho

Transform

Page 6: Introduction to ETL - Pentaho

Load

Page 7: Introduction to ETL - Pentaho

Hands-on Experience

Page 8: Introduction to ETL - Pentaho

DEMO

Page 9: Introduction to ETL - Pentaho

Data Integrationrepository

Page 10: Introduction to ETL - Pentaho

ETL

course2550

Page 11: Introduction to ETL - Pentaho

input

course2550

Page 12: Introduction to ETL - Pentaho

course2550

Create

Page 13: Introduction to ETL - Pentaho

import

Page 14: Introduction to ETL - Pentaho

Browse

Page 15: Introduction to ETL - Pentaho

02course2550_forETL.sql

Page 16: Introduction to ETL - Pentaho
Page 17: Introduction to ETL - Pentaho
Page 18: Introduction to ETL - Pentaho

Data Integration

Page 19: Introduction to ETL - Pentaho

Programs > Pentaho Enterprise Edition

> Design Tools > Data Integration

Page 20: Introduction to ETL - Pentaho

Repository Connection

Page 21: Introduction to ETL - Pentaho
Page 22: Introduction to ETL - Pentaho
Page 23: Introduction to ETL - Pentaho
Page 24: Introduction to ETL - Pentaho
Page 25: Introduction to ETL - Pentaho
Page 26: Introduction to ETL - Pentaho
Page 27: Introduction to ETL - Pentaho
Page 28: Introduction to ETL - Pentaho
Page 29: Introduction to ETL - Pentaho

Repository

Login

Page 30: Introduction to ETL - Pentaho

File > New > Transformation

Page 31: Introduction to ETL - Pentaho

?

?

Page 32: Introduction to ETL - Pentaho

File > New > Database Connection

Page 33: Introduction to ETL - Pentaho

ETL

Page 34: Introduction to ETL - Pentaho

Input > Table input

Page 35: Introduction to ETL - Pentaho

Double Click Table input

Table input > at_course_enrollment

Get SQL select statement…

Page 36: Introduction to ETL - Pentaho

at_course_enrollment

Page 37: Introduction to ETL - Pentaho

ORDER BY course_id

Page 38: Introduction to ETL - Pentaho

Statistics > Group by

Shift

Page 39: Introduction to ETL - Pentaho

Double Click Group by

Get Fields

Get lookup fields

Page 40: Introduction to ETL - Pentaho

Order By ?

Page 41: Introduction to ETL - Pentaho

Group by ?

Aggregates /

Page 42: Introduction to ETL - Pentaho

Transform > Select values

Page 43: Introduction to ETL - Pentaho

Double Click Select values

Get fields to select

Rename to

member_id num_enroll

Page 44: Introduction to ETL - Pentaho

Selected Value ?

Field

Page 45: Introduction to ETL - Pentaho

Input > Table input

Page 46: Introduction to ETL - Pentaho

Double Click Table input

Table input > at_courses

Get SQL select statement…

Page 47: Introduction to ETL - Pentaho

at_courses

Page 48: Introduction to ETL - Pentaho

ORDER BY course_id

Page 49: Introduction to ETL - Pentaho

Joins > Merge Join

Page 50: Introduction to ETL - Pentaho

Double Click Merge Join

Get key fields Keys

Page 51: Introduction to ETL - Pentaho

Order By ?

Page 52: Introduction to ETL - Pentaho

Transform > Select values

Page 53: Introduction to ETL - Pentaho

Double Click Select values 2

Remove

Fieldname course_id_1

Page 54: Introduction to ETL - Pentaho

Selected Value 2 ?

Field

Page 55: Introduction to ETL - Pentaho

Output > Table output

Page 56: Introduction to ETL - Pentaho

?

o_course

course2550

Page 57: Introduction to ETL - Pentaho

Double Click Table output

Browse Target Table

o_course

Page 58: Introduction to ETL - Pentaho

at_courses ( )

DDL > Use Current Connection

Page 59: Introduction to ETL - Pentaho

SQL

Execute

Page 60: Introduction to ETL - Pentaho

Refresh

o_course

Page 61: Introduction to ETL - Pentaho

Truncate table

OK

Page 62: Introduction to ETL - Pentaho
Page 63: Introduction to ETL - Pentaho

Action > Run

Launch

Page 64: Introduction to ETL - Pentaho

Save

Page 65: Introduction to ETL - Pentaho
Page 66: Introduction to ETL - Pentaho

MySQL

o_course

Page 67: Introduction to ETL - Pentaho

feature

Page 68: Introduction to ETL - Pentaho

Excel Output

Page 69: Introduction to ETL - Pentaho
Page 70: Introduction to ETL - Pentaho
Page 71: Introduction to ETL - Pentaho

Google Docs