Kaggle bosch presentation material for Kaggle Tokyo Meetup #2

Preview:

Citation preview

Bosch Production Line Performance 2017/2/4hskksk

1

• Result•

2

RCObosch production line performance

3

RCO

• R, Python, C++

4

xgboost(R) fwrite_libsvmxgboost ( )R

http://www.slideshare.net/hskksk/libsvm

5

: : 2016/8/17 - 2016/11/12

: Matthews correlation coefficient

6

Lx_Sy_Dz Lx_Sy_F{z-1} 7

8

0: 1,176,868 (99.4%)1: 6,879 ( 0.6%)

extremely imbalanced data

9

Result

10

• g_votte

• tkm

• hskksk( )

11

LB

12

Public Leaderboard

13

Private Leaderboard

14

Top Ten !

15

16

• LB(CV )

• ( ) ↑

• xgboost dart

17

Feature engineering

18

• 25

• 3154

19

1. ID

• Forum magic feature

2.

3.

•20

xgboost importance

• = 1

• = 3

21

• ID

22

• ID

23

Station 38

• Station 38!!

• IDStation 38 NA

24

ID

25

• bitmap( 17017 )

• bitmap

26

27

28

29

30

31

32

• hskksk Line2 tkm Line0

33

• 3 fold 1

• MCC LB Feedback

• tkm g_votte

• LB Feedback34

Public Private• tkm submit Public

Score Private

35

• submit

• mcc

36

kaggle•

• Accuracy confusion matrix

• think more, try less

37

Enjoy Kaggle!

38