36
信信信信信信信 信信E123502191 信信信 N954020003 信信信 M965040023 信信信

信用卡風險分析 組員: E123502191 張智欽 N954020003 于亨宗 M965040023 鍾友華

  • View
    248

  • Download
    0

Embed Size (px)

Citation preview

  • Slide 1
  • E123502191 N954020003 M965040023
  • Slide 2
  • Introduction 79
  • Slide 3
  • Introduction
  • Slide 4
  • Introduction
  • Slide 5
  • Slide 6
  • Determination of data set
  • Slide 7
  • Data mining procedure Berry Linoff 10 10
  • Slide 8
  • Step 1. Translate the business problem into a data mining problem
  • Slide 9
  • Step 2. Select appropriate data 4117
  • Slide 10
  • Step 3. Get to know the data ID m/f single/married/divsepwid weekly/monthly y/n good risk bad risk bad profit
  • Slide 11
  • Step4. create a model set 4117 6 4 2455 1662 Overfitting
  • Slide 12
  • Step5. fix problems with the data 4117
  • Slide 13
  • Step5. fix problems with the data ( )
  • Slide 14
  • Step6. transform data to bring information to the surface
  • Slide 15
  • Step6. transform data to bring information to the surface
  • Slide 16
  • Step7. Build models (Decision Tree)
  • Slide 17
  • Step7. Build models (Neural Network)
  • Slide 18
  • Step7. Build models (Logistic)
  • Slide 19
  • Step8. Assess models 1662
  • Slide 20
  • Correct1,87776.46% Wrong57823.54% Total2,455 Correct1,20472.44% Wrong45827.56% Total1,662
  • Slide 21
  • - training bad lossbad profitgood risk bad loss Count31117474 Row %55.6431.1313.24 Total %12.677.093.01 bad profit Count84131774 Row %5.6989.295.02 Total %3.4253.653.01 good risk Count43129249 Row %10.2130.6459.14 Total %1.755.2510.14
  • Slide 22
  • - testing bad lossbad profitgood risk bad loss Count18113135 Row %52.1637.7510.09 Total %10.897.882.11 bad profit Count7582631 Row %8.0588.633.33 Total %4.5149.71.87 good risk Count40146197 Row %10.4438.1251.44 Total %2.418.7811.85
  • Slide 23
  • 1662
  • Slide 24
  • Correct1,78772.79% Wrong66827.21% Total2,455 Correct1,31479.06% Wrong34820.94% Total1,662
  • Slide 25
  • - training bad lossbad profitgood risk bad loss Count19127692 Row %34.1749.3716.46 Total %7.7811.243.75 bad profit Count251334116 Row %1.6990.447.86 Total %1.0254.344.73 good risk Count6153262 Row %1.4336.3462.23 Total %0.246.2310.67
  • Slide 26
  • - testing bad lossbad profitgood risk bad loss Count14717426 Row %42.3650.147.49 Total %8.8410.471.56 bad profit Count1490315 Row %1.596.891.61 Total %0.8454.330.9 good risk Count10109264 Row %2.6128.4668.93 Total %0.66.5615.88
  • Slide 27
  • 1662
  • Slide 28
  • Correct1,74070.88% Wrong71529.12% Total2,455 Correct1,22973.95% Wrong43326.05% Total1,662
  • Slide 29
  • - training bad lossbad profitgood risk bad loss Count23325472 Row %41.6845.4412.88 Total %9.4910.352.93 bad profit Count91130579 Row %6.1788.475.36 Total %3.7153.163.22 good risk Count37182202 Row %8.7943.2347.98 Total %1.517.418.23
  • Slide 30
  • - testing bad lossbad profitgood risk bad loss Count17715020 Row %51.0143.235.76 Total %10.659.031.2 bad profit Count578678 Row %6.1293.030.86 Total %3.4352.170.48 good risk Count32166185 Row %8.3643.3448.3 Total %1.939.9911.13
  • Slide 31
  • Step9. Deploy models 1204 72.44
  • Slide 32
  • 1314 79.06 0.333 0.209 0.207 0.187 0.158 0.156 0.05 0.048 0.046
  • Slide 33
  • 1662 1229 73.95 Equation For good risk 0.01426 * + 0.00001368 * 0.324 * 0.4787 * 0.3575 * 0.1567 * [ =f] + 5.251 * [ =divsepwid] 0.0184 * [ =married] + 0.8561 * [ =monthly] 0.3472 * [ =n] + 1.572 * [ =0] + 1.314 * [ =1] 0.01788 * [ =2] 0.1962 Equation For bad profit 0.06256 * 0.00002478 * 0.1728 * 0.416 * 0.4008 * + 0.1433 * [ =f] + 2.158 * [ =divsepwid] 0.348 * [ =married] 0.01668 * [ =monthly] + 0.0883 * [ =n] 1.382 * [ =0] + 0.4091 * [ =1] + 0.2016 * [ =2] + 5.602
  • Slide 34
  • Step10 Assess results bad risk > good risk 3.01 >2.11 3.15 1.56 2.93 1.2
  • Slide 35
  • Conclusion 1. - 2. bad loss good risk 3.
  • Slide 36
  • Limit 1. 2.