26
Introduction Welcome Machine Learning

Introduction to Programming - Jun Jijun.hansung.ac.kr/ML/docs-slides-Lecture1-kr.pdf · 2016-09-01 · Introduction Supervised Learning Machine Learning. Andrew Ng 0 100 200 300 400

  • Upload
    others

  • View
    3

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Introduction to Programming - Jun Jijun.hansung.ac.kr/ML/docs-slides-Lecture1-kr.pdf · 2016-09-01 · Introduction Supervised Learning Machine Learning. Andrew Ng 0 100 200 300 400

Introduction

Welcome

Machine Learning

Page 2: Introduction to Programming - Jun Jijun.hansung.ac.kr/ML/docs-slides-Lecture1-kr.pdf · 2016-09-01 · Introduction Supervised Learning Machine Learning. Andrew Ng 0 100 200 300 400

Andrew Ng

Page 3: Introduction to Programming - Jun Jijun.hansung.ac.kr/ML/docs-slides-Lecture1-kr.pdf · 2016-09-01 · Introduction Supervised Learning Machine Learning. Andrew Ng 0 100 200 300 400

Andrew Ng

SPAM

Page 4: Introduction to Programming - Jun Jijun.hansung.ac.kr/ML/docs-slides-Lecture1-kr.pdf · 2016-09-01 · Introduction Supervised Learning Machine Learning. Andrew Ng 0 100 200 300 400

Andrew Ng

기계학습(Machine Learning)- AI 의한분야로성장

- 컴퓨터의새로운능력

예: - 데이터베이스탐색(mining)

자동화/웹의성장으로부터의거대한자료. 즉, 웹클릭자료, 의료기록,생물학,공학

- 손으로프로그래밍할수없는응용즉, 자동헬리콥터, 필체인식, 대부분의자연어처리, 컴퓨터비전.

-자가-조절프로그래밍즉, Amazon, Netflix 제품추천

- 사람학습의이해 (뇌, 실제 AI).

Page 5: Introduction to Programming - Jun Jijun.hansung.ac.kr/ML/docs-slides-Lecture1-kr.pdf · 2016-09-01 · Introduction Supervised Learning Machine Learning. Andrew Ng 0 100 200 300 400

Andrew Ng

Introduction

What is machine learning

Machine Learning

Page 6: Introduction to Programming - Jun Jijun.hansung.ac.kr/ML/docs-slides-Lecture1-kr.pdf · 2016-09-01 · Introduction Supervised Learning Machine Learning. Andrew Ng 0 100 200 300 400

Andrew Ng

• Arthur Samuel (1959). 기계학습 : 컴퓨터에게

명시적인프로그램없이학습능력을 제공하는것에대한학문분야

Machine Learning definition

Page 7: Introduction to Programming - Jun Jijun.hansung.ac.kr/ML/docs-slides-Lecture1-kr.pdf · 2016-09-01 · Introduction Supervised Learning Machine Learning. Andrew Ng 0 100 200 300 400

Andrew Ng

• Arthur Samuel (1959). 기계학습 : 컴퓨터에게

명시적인프로그램없이학습능력을 제공하는것에대한학문분야.

• Tom Mitchell (1998) 잘-구성된 학습문제:

성능(performance) P로측정되는임무(task) T에 대한성능이경험(experience) E에따라향상된다면, 컴퓨터프로그램은어떤임무 T 와성능측정값 P 에대한경험E로부터 학습하도록한다.

Machine Learning definition

Page 8: Introduction to Programming - Jun Jijun.hansung.ac.kr/ML/docs-slides-Lecture1-kr.pdf · 2016-09-01 · Introduction Supervised Learning Machine Learning. Andrew Ng 0 100 200 300 400

이메일을스팸 또는스팸아님 으로분류

당신이 이메일을 스팸 또는 스팸아님으로 라벨하는 것을 관찰.

스팸/스펨아님으로 정확히분류된이메일의 수(또는 비율)

위의어느 것도아님 – 이것은기계학습 문제가아님.

당신의이메일프로그램이당신이어느이메일을스팸으로또는스팸이아닌것으로표시하는것을관찰하고, 그것에기초하여스팸을필터하는더 나은 방법을 학습한다고 가정하자. 이러한설정에서임무 T 는 무엇인가?

“성능 P(performance)로 측정되는 임무 T(task)에 대한 성능이경험 E(experience)에 따라서 향상된다면, 컴퓨터 프로그램은 어떤임무 T와 성능 측정 값 P에 대한 경험 E로부터 학습을 한다고말한다.”

Page 9: Introduction to Programming - Jun Jijun.hansung.ac.kr/ML/docs-slides-Lecture1-kr.pdf · 2016-09-01 · Introduction Supervised Learning Machine Learning. Andrew Ng 0 100 200 300 400

Andrew Ng

기계학습 알고리즘:

- 감독학습(Supervised learning)

- 무감독학습(Unsupervised learning)

Others: 강화(Reinforcement) 학습, 추천시스템.

Also talk about: 학습알고리즘을 적용하는데 있어서실제적인 충고.

Page 10: Introduction to Programming - Jun Jijun.hansung.ac.kr/ML/docs-slides-Lecture1-kr.pdf · 2016-09-01 · Introduction Supervised Learning Machine Learning. Andrew Ng 0 100 200 300 400

Andrew Ng

Introduction

Supervised Learning

Machine Learning

Page 11: Introduction to Programming - Jun Jijun.hansung.ac.kr/ML/docs-slides-Lecture1-kr.pdf · 2016-09-01 · Introduction Supervised Learning Machine Learning. Andrew Ng 0 100 200 300 400

Andrew Ng

0

100

200

300

400

0 500 1000 1500 2000 2500

집 가격 예측

Price ($) in 1000’s

Size in feet2 ( ≈ 0.09 meter2 )

회귀(Regression): 연속된값을

갖는출력(가격)을예측

감독학습(Supervised Learning)

“정확한답들이” 주어짐

Page 12: Introduction to Programming - Jun Jijun.hansung.ac.kr/ML/docs-slides-Lecture1-kr.pdf · 2016-09-01 · Introduction Supervised Learning Machine Learning. Andrew Ng 0 100 200 300 400

Andrew Ng

유방암 (악성(Malignant), 양성(Benign))

분류

이산적인값출력 (0 or 1)악성?

1(Y)

0(N)

종양크기(Tumor Size)

종양 크기(Tumor Size)

Page 13: Introduction to Programming - Jun Jijun.hansung.ac.kr/ML/docs-slides-Lecture1-kr.pdf · 2016-09-01 · Introduction Supervised Learning Machine Learning. Andrew Ng 0 100 200 300 400

Andrew Ng

종양 크기

나이

- 덩어리 두께

- 세포 크기의 균일함

- 세포 모양의 균일함 …

Page 14: Introduction to Programming - Jun Jijun.hansung.ac.kr/ML/docs-slides-Lecture1-kr.pdf · 2016-09-01 · Introduction Supervised Learning Machine Learning. Andrew Ng 0 100 200 300 400

둘다 분류문제로다룬다.

문제 1은 분류문제로, 문제2는 회귀문제로 다룬다.

문제 1은 회귀문제로, 문제2는분류문제로다룬다.

둘다 회귀문제로다룬다.

당신은 회사를 운영하고 있으며, 두 가지 문제를 해결할 학습 알고리즘을개발하고자 합니다.

Problem 1: 당신은 동일한품목의 큰 재고가 있습니다. 당신은 이 제품들이 다음3개월 동안 얼마나 팔릴 지를 예측하고 싶습니다. Problem 2: 당신은개별고객의 계정을살펴보고, 각계정에 대하여해킹됐는지/손상됐는지를 결정하는 소프트웨어를 원합니다.

이들을 회귀문제로 다루어야하나, 또는 분류문제로 다루어야하는가?

Page 15: Introduction to Programming - Jun Jijun.hansung.ac.kr/ML/docs-slides-Lecture1-kr.pdf · 2016-09-01 · Introduction Supervised Learning Machine Learning. Andrew Ng 0 100 200 300 400

Andrew Ng

Introduction

Unsupervised Learning

Machine Learning

Page 16: Introduction to Programming - Jun Jijun.hansung.ac.kr/ML/docs-slides-Lecture1-kr.pdf · 2016-09-01 · Introduction Supervised Learning Machine Learning. Andrew Ng 0 100 200 300 400

Andrew Ng

x1

x2

감독학습(Supervised Learning)

Page 17: Introduction to Programming - Jun Jijun.hansung.ac.kr/ML/docs-slides-Lecture1-kr.pdf · 2016-09-01 · Introduction Supervised Learning Machine Learning. Andrew Ng 0 100 200 300 400

Andrew Ng

무감독학습(Unsupervised Learning)

x1

x2

Page 18: Introduction to Programming - Jun Jijun.hansung.ac.kr/ML/docs-slides-Lecture1-kr.pdf · 2016-09-01 · Introduction Supervised Learning Machine Learning. Andrew Ng 0 100 200 300 400

Andrew Ng

Page 19: Introduction to Programming - Jun Jijun.hansung.ac.kr/ML/docs-slides-Lecture1-kr.pdf · 2016-09-01 · Introduction Supervised Learning Machine Learning. Andrew Ng 0 100 200 300 400

Andrew Ng

Page 20: Introduction to Programming - Jun Jijun.hansung.ac.kr/ML/docs-slides-Lecture1-kr.pdf · 2016-09-01 · Introduction Supervised Learning Machine Learning. Andrew Ng 0 100 200 300 400

Andrew Ng[Source: Daphne Koller]

Gen

es

Individuals

Page 21: Introduction to Programming - Jun Jijun.hansung.ac.kr/ML/docs-slides-Lecture1-kr.pdf · 2016-09-01 · Introduction Supervised Learning Machine Learning. Andrew Ng 0 100 200 300 400

Andrew Ng[Source: Daphne Koller]

Gen

es

Individuals

Page 22: Introduction to Programming - Jun Jijun.hansung.ac.kr/ML/docs-slides-Lecture1-kr.pdf · 2016-09-01 · Introduction Supervised Learning Machine Learning. Andrew Ng 0 100 200 300 400

Andrew Ng

Organize computing clusters Social network analysis

Image credit: NASA/JPL-Caltech/E. Churchwell (Univ. of Wisconsin, Madison)

Astronomical data analysisMarket segmentation

Page 23: Introduction to Programming - Jun Jijun.hansung.ac.kr/ML/docs-slides-Lecture1-kr.pdf · 2016-09-01 · Introduction Supervised Learning Machine Learning. Andrew Ng 0 100 200 300 400

Andrew Ng

Cocktail party problem

Microphone #1

Microphone #2

Speaker #1

Speaker #2

Page 24: Introduction to Programming - Jun Jijun.hansung.ac.kr/ML/docs-slides-Lecture1-kr.pdf · 2016-09-01 · Introduction Supervised Learning Machine Learning. Andrew Ng 0 100 200 300 400

Andrew Ng[Audio clips courtesy of Te-Won Lee.]

Microphone #1:

Microphone #2:

Microphone #1:

Microphone #2:

Output #1:

Output #2:

Output #1:

Output #2:

Page 25: Introduction to Programming - Jun Jijun.hansung.ac.kr/ML/docs-slides-Lecture1-kr.pdf · 2016-09-01 · Introduction Supervised Learning Machine Learning. Andrew Ng 0 100 200 300 400

Andrew Ng

Cocktail party problem algorithm

[W,s,v] = svd((repmat(sum(x.*x,1),size(x,1),1).*x)*x');

[Source: Sam Roweis, Yair Weiss & Eero Simoncelli]

Page 26: Introduction to Programming - Jun Jijun.hansung.ac.kr/ML/docs-slides-Lecture1-kr.pdf · 2016-09-01 · Introduction Supervised Learning Machine Learning. Andrew Ng 0 100 200 300 400

다음의예에서, 무감독학습알고리즘을사용해야 하는것은? (해당하는것들을모두고르시오.)

고객 자료의 데이타베이스가주어진 경우, 자동적인 시장 부문 발견과, 시장 부문별로 고객들은 그룹짖기.

스팸/스팸아님 으로 라벨된 이메일들이 주어진 경우, 스팸필터 학습.

웹에서 찾아진 뉴스기사들이 주어진 경우, 같은 소식에 대한기사들끼리 그룹 짖기.

당뇨병이거나 또는 아닌 것으로 진단된 환자들의 자료들이 주어진경우, 새로운 환자를 당뇨병 유무로 분류하는 방법 학습.