5) 6) 7) 8) 9) INTRODUCTION TO - Courses · 2018-09-20 · BAYESIAN NETWORKS • A Bayesian network...

v=1v=–1 v=–1 v=–1

v=–1 v=1

v=–1

optimaalin

en peli

5) 6) 7) 8) 9)

I N T R O D U C T I O N T O A R T I F I C I A L I N T E L L I G E N C E

D A TA 1 5 0 0 1

E P I S O D E 6 : B A Y E S I A N N E T W O R K S

1. N E T W O R K S T R U C T U R E S

2. C A R E X A M P L E

3. I N F E R E N C E ( E X A C T A N D A P P R O X I M AT E )

T O D A Y ’ S M E N U

B AY E S I A N N E T W O R K S

• A Bayesian network is a representation of a probabilistic model

• The nodes of the network (X, Y, Z, Å) are random variables (r.v.) such as the result of a die, or a medical condition, ...

• The edges correspond to direct dependency: no edge ⇔ conditional independence (exact definition will be studied in DATA12002 Probabilistic Graphical Models)

• Each r.v. is given a conditional distribution of the form P(V = v | PaV = pav), where PaV are the parents of node V

• No directed cycles allowed

• Joint probabilities are obtained as P(x,y,z,å) = P(x) P(y) P(z | x,y) P(å | x)

PARENTSZ

• No directed cycles allowed

• Joint probabilities are obtained as P(x,y,z,å) = P(x) P(y) P(z | x,y) P(å | x)

• Compare this with the chain rule P(x,y,z,å) = P(x) P(y | x) P(z | x,y) P(å | x,y,z)

c o n d i t i o n a l i n d e p e n d e n c e !

• The power of BNs: – easier to define conditional distributions, e.g.,

P(å | x) rather than P(å | x,y,z) – efficient inference procedures for computing posterior

probabilities

E X A M P L E : C A R P R O B L E M S ?

BATTERY

IGNITION GAS

STARTS

• If the battery is dead, no radio and no ignition

• If there's no ignition, the car won't start

• If there's no gas, the car won't start

• If the car won't start, it won't move

• Car won't move: where is the problem? P(state | obs)

• Music on the radio? Gas meter? <– obs

BATTERY

IGNITION GAS

STARTS

[R.I.P. Chester Bennington (1976–2017)]

BATTERY

IGNITION GAS

STARTS

q u i t e s u r e ?

BATTERY

IGNITION GAS

STARTS

9 5 % p r o b .

BATTERY

IGNITION GAS

STARTS

9 5 % p r o b .9 0 % p r o b . 9 9 % p r o b .

9 9 % p r o b .

9 0 % p r o b .

9 5 % p r o b .

• P(“battery alive”) = 0.9

• P(“radio ok” | “battery alive”) = 0.9P(“radio ok” | ¬”battery alive”) = 0

• p(“ignition” | “battery alive”) = 0.95P(“ignition” | ¬”battery alive”) = 0

• p(“gas”) = 0.95

• p(“starts” | “ignition” AND “gas”) = 0.99p(“starts” | ¬”ignition” OR ¬”gas”) = 0

• p(“moves” | “starts”) = 0.99p(“moves” | ¬”starts”) = 0

• P(“battery alive” | ¬“starts” AND “radio ok” AND "gas") = ?

• Exact approach: P(B,¬S,R,G) P(B | ¬S,R,G) = ----------- P(¬S,R,G) P(B,¬S,R,G) = P(B,R,I,G,¬S,M) + P(B,R,I,G,¬S,¬M) + P(B,R,¬I,G,¬S,M) + P(B,R,¬I,G,¬S,¬M)

• Again, the probability of an event, (B,¬S,R,G), is a sum of atomic (elementary) event probabilities

• The atomic event probabilities are conveniently obtained from the Bayesian network, e.g.,P(B,¬S,R,G) = P(B,R,I,G,¬S,M) + P(B,R,I,G,¬S,¬M) + P(B,R,¬I,G,¬S,M) + P(B,R,¬I,G,¬S,¬M) P(B,R,I,G,¬S,M) = P(B) P(R|B) P(I|B) P(G) P(¬S|I,G) P(M|¬S) = 0.9 · 0.9 · 0.95 · 0.95 · 0.01 · 0.0

• Note that the product has terms of the form P(V | PaV)

• This gives a numerical value for P(B,¬S,R,G)

• A similar sum yields P(¬S,R,G)

• This direct approach always gives the exact solution

• However, the sums can quickly become very large (no. of terms is exponential in the size of the network)

• More clever inference algorithms exploit the structure of the network

• For example, in tree-shaped networks (any two nodes are connected by at most one path), belief propagation runs in linear time wrt. number of nodes

• These algorithms are not discussed on this course

• Instead of exact inference algorithms, we take a "hackers approach" to probability

• The probability of any event can be approximated by the Monte Carlo method / sampling: repeat the trial many times and calculate the relative frequency of the event

• E.g., toss a coin 106 times: P(heads) ≈ #heads / #tosses

• To approximate conditional probability P(A | B):

1. generate N tuples (A, B)

2. discard all but those where B occurs

3. among the remaining tuples, calculate the portion where A occurs

A P P R O X I M AT E I N F E R E N C E

• In the car problem, to approximate P(B| ¬S, R, G):

1. generate N cases (tuples) from the car BN

2. choose tuples where car doesn't start, radio ok, gas

3. calculate the portion of these where battery is alive

• As N → ∞, the approximation converges to the exact value

generate_tuples(N, model): for i = 1 to N: v = empty array for V in model.variables: pa = v[V.Pa] # parents of V v.append(sample(V.CPT(pa))) output v

c o n d i t i o n a l p r o b a b i l i t y t a b l e

• generate N cases (tuples) from the car BN

• choose tuples where car doesn't start, radio ok, gas

• calculate the portion of these where battery is alive

v = [] V = 'B' V.Pa = pa = [] V.CPT(pa) = [0.1, 0.9]

v = [1] V = 'R' V.Pa = 'B' pa = [1] V.CPT(pa) = [0.1, 0.9]

C P T o f ' R a d i o ' : ( 1 . 0 , 0 . 0 ) i f B a t t e r y = 0 ( 0 . 1 , 0 . 9 ) i f B a t t e r y = 1

v = [1,1] V = 'I' V.Pa = 'B' pa = [1] V.CPT(pa) = [0.05, 0.95]

v = [1,1,1] V = 'G' V.Pa = pa = [] V.CPT(pa) = [0.05, 0.95]

v = [1,1,1,1] V = 'S' V.Pa = 'I,G' pa = [1,1] V.CPT(pa) = [0.01, 0.99]

C P T o f ' S t a r t s ' : ( 1 . 0 0 , 0 . 0 0 ) i f I g n i t i o n = 0 , G a s = 0 ( 1 . 0 0 , 0 . 0 0 ) i f I g n i t i o n = 0 , G a s = 1 ( 1 . 0 0 , 0 . 0 0 ) i f I g n i t i o n = 1 , G a s = 0 ( 0 . 0 1 , 0 . 9 9 ) i f I g n i t i o n = 1 , G a s = 1

v = [1,1,1,1,1] V = 'M' V.Pa = 'S' pa = [1] V.CPT(pa) = [0.01, 0.99]

v = [1,1,1,1,1,1]

v = [1,1,1,0,0,0]

v = [1,1,0,1,0,0]

• Spam filter! (and million other naive Bayes classifiers)

• Dynamic Bayesian networks for ecological modelling

• Medical diagnostics (causal factors –> disease status –> symptoms)

• Player matching: Microsoft TrueSkillTM

(well, factors graphs really, but closely related graphical models)

B AY E S I A N N E T W O R K A P P L I C AT I O N S

Source: R. Herbrich, T. Minka, T. Graepel, "TrueSkillTM: A Bayesian Skill Rating System", NIPS-2006

• Spam filter! (and million other naive Bayes classifiers)

• Dynamic Bayesian networks for ecological modelling

• Medical diagnostics (causal factors –> disease status –> symptoms)

• Player matching: Microsoft TrueSkillTM

(well factors graphs really, but closely related graphical models)

• Error correcting codes ("Turbo codes", e.g., Mars mission)

• Football score prediction

• ...

B AY E S I A N N E T W O R K A P P L I C AT I O N S

1. N E T W O R K S T R U C T U R E S

2. C A R E X A M P L E

3. I N F E R E N C E ( E X A C T A N D A P P R O X I M AT E )

S U M M A R YZ

[1,1,1,1,1,1][1,1,1,0,0,0] [1,1,0,1,0,0] ⋮

P(B,¬S,R,G) P(B | ¬S,R,G) = ----------- P(¬S,R,G)

N E X T W E E K : M A C H I N E L E A R N I N G

5) 6) 7) 8) 9) INTRODUCTION TO - Courses · 2018-09-20 · BAYESIAN NETWORKS • A Bayesian network...

Documents

Web Intrusion Detection with Bayesian Network by Kanatoko AVTokyo 2013.5 English Slide

Klasi kasi Emosi Pada Twitter Menggunakan Bayesian Network · Klasi kasi Emosi Pada Twitter Menggunakan Bayesian Network Mohamad Syahrul Mubarok, Muhammad Surya Asriadie and Adiwijaya

Naive Bayesian and Bayesian Network

Bayesian Inference - Wellcome Trust Centre for …€¢ Some probability densities/distributions • Probabilistic (generative) models • Bayesian inference • A simple example –

Bayesian Network · 2020. 2. 19. · Definition: Bayesian Networks • The Bayesian network consists of the following. – A set of n variables X = {X 1, X 2, …, X n} and a set

Machine Learning Probabilistic Machine Learning · Machine Learning Probabilistic Machine Learning learning as inference, Bayesian Kernel Ridge regression = Gaussian Processes, Bayesian

A Generic Bayesian Network for Identiﬁcation and ...eturwg.c4i.gmu.edu/sites/default/files/sites/...Figure 1: Bayesian ID Network without tree structure Graph (DAG) with each node

Satzung zur Änderung der Studien- und Prüfungs- ordnung ... · Praktikum Maschinelles Lernen 9 mündlich ja 1.0 Probabilistic and Bayesian Modelling in ML and AI 6 mündlich ja

OptimizationofCognitiveWireless ...paduaresearch.cab.unipd.it/3720/1/PhD_thesis.pdfList of Acronyms BIC Bayesian Information Criterion BN Bayesian Network CA Cognitive Agent CEF Cognitive

IMPLEMENTASI METODE BAYESIAN NETWORK UNTUK SISTEM

Bayesian Network

Learning Bayesian Network Structure from Massive Datasets: The ``Sparse Candidate'' Algorithm

Waikato Environment for Knowledge Analysiskuze-lab/RS2016/Weka_Lecture_Slides_161026.pdf · ・ベイジアンネットワーク Bayesian network ・ロジスティック識別 Logistic

bayesian bayesian network

Using Relevance Feedback in Bayesian Probabilistic Mixture ... · [8] Donna Harman, “Relevance Feedback Revisited”, In Proceedings of the 15th Annual International ACM SIGIR In

Bayesian Network Meta-Analysis in BUGS - REES FRANCErees-france.com/wp-content/uploads/2015/12/3_MTC_WinBUGS_SFES… · 1 Bayesian Network Meta-Analysis in BUGS 21 January 2016 Thi

Probabilistic Neural Network (PNN)

A P2P flow Identification Model Based On Bayesian Network

Bayesian Network By DengKe Dong. Key Points Today Intro to Graphical Model Conditional Independence Intro to Bayesian Network Reasoning BN: D-Separation

Lecture 7: Network Inference · 2019. 11. 26. · Bayesian Inference vs. MLE (Cont.) • In our example, MLE and Bayesian prediction differ. • However, If prior is well-behaved