120
Bayesian networks Ontologies BNs and Ontologies Int´ egration de connaissances ontologiques pour l’apprentissage des r´ eseaux bay´ esiens Montassar Ben Messaoud 12 , Mouna Ben Ishak 12 , Philippe Leray 2 , Nahla Ben Amor 1 1 LARODEC, ISG, Universit´ e de Tunis, Tunisie 2 Connaissances et D´ ecision, LINA UMR 6241, Nantes, France Colloque Apprentissage Artificiel & Fouille de Donn´ ees 28 juin 2012, Univ. Paris 13 Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben Amor Apprentissage RB et Ontologies 1/22

Ontological knowledge integration for Bayesian network structure learning

Embed Size (px)

DESCRIPTION

Intégration de connaissances ontologiques pour l'apprentissage des réseaux bayésiens 5èmes Journées thématiques "Apprentissage Artificiel & Fouille de Données", 28 juin 2012, Université Paris 13, France. (title in French, but slides english)

Citation preview

Page 1: Ontological knowledge integration for Bayesian network structure learning

Bayesian networks Ontologies BNs and Ontologies

Integration de connaissances ontologiquespour l’apprentissage des reseaux bayesiens

Montassar Ben Messaoud12, Mouna Ben Ishak12,Philippe Leray2, Nahla Ben Amor1

1LARODEC, ISG, Universite de Tunis, Tunisie2Connaissances et Decision, LINA UMR 6241, Nantes, France

Colloque Apprentissage Artificiel & Fouille de Donnees28 juin 2012, Univ. Paris 13

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 1/22

Page 2: Ontological knowledge integration for Bayesian network structure learning

Bayesian networks Ontologies BNs and Ontologies

Motivations

Knowledge-based systems aim to make expertise available fordecision making and information sharing

To resolve some complex problems, the combination ofdifferent knowledge-based systems can be very powerful

Our idea

Combine Two KBSs : Bayesian Networks and Ontologies, to helpboth

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 2/22

Page 3: Ontological knowledge integration for Bayesian network structure learning

Bayesian networks Ontologies BNs and Ontologies

Motivations

Knowledge-based systems aim to make expertise available fordecision making and information sharing

To resolve some complex problems, the combination ofdifferent knowledge-based systems can be very powerful

Our idea

Combine Two KBSs : Bayesian Networks and Ontologies, to helpboth

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 2/22

Page 4: Ontological knowledge integration for Bayesian network structure learning

Bayesian networks Ontologies BNs and Ontologies

Motivations

Knowledge-based systems aim to make expertise available fordecision making and information sharing

To resolve some complex problems, the combination ofdifferent knowledge-based systems can be very powerful

Our idea

Combine Two KBSs : Bayesian Networks and Ontologies, to helpboth

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 2/22

Page 5: Ontological knowledge integration for Bayesian network structure learning

Bayesian networks Ontologies BNs and Ontologies

Outline ...

1 Bayesian networksBN definitionBN learning

2 OntologiesOntology definitionOntology learning

3 BNs and OntologiesExisting workOur proposal

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 3/22

Page 6: Ontological knowledge integration for Bayesian network structure learning

Bayesian networks Ontologies BNs and Ontologies

Bayesian network definition

Definition [Pearl, 1985]

A Bayesian network (BN) is defined by

one qualitative description of (conditional) dependences /independences between variables

directed acyclic graph (DAG)one quantitative description of these dependences

conditional probability distributions (CPDs)

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 4/22

Page 7: Ontological knowledge integration for Bayesian network structure learning

Bayesian networks Ontologies BNs and Ontologies

Example

one topological order : B,E ,A,R,T (not unique)

R Radio

E Earthquake

A Alarm

B Burglary

T TV

P(Alarm|Burglary,Earthquake)

Burglary,Earthquake = Y,Y Y,N N,Y N,N

Alarm=Y 0.75 0.10 0.99 0.10Alarm=N 0.25 0.90 0.01 0.90

P(TV|Radio)

Radio = Y N

TV=Y 0.99 0.50TV=N 0.01 0.50

P(Radio|Earthquake)

Earthquake = Y N

Radio=Y 0.99 0.01Radio=N 0.01 0.99

P(Burglary)=[0.001 0.999] P(Earthquake)=[0.0001 0.9999]

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 5/22

Page 8: Ontological knowledge integration for Bayesian network structure learning

Bayesian networks Ontologies BNs and Ontologies

Consequence

Chain rule

P(S) = P(S1)× P(S2|S1)× P(S3|S1, S2)× · · · × P(Sn|S1 . . . Sn−1)

Consequence with a BN

P(Si |S1 . . . Si−1) = P(Si |parents(Si )) so

P(S) = Πni=1P(Si |parents(Si ))

The (global) joint probability distribution is decomposed in aproduct of (local) conditional distributions

BN = compact representation of the joint distribution P(S)given some information about dependence relationshipsbetween variables

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 6/22

Page 9: Ontological knowledge integration for Bayesian network structure learning

Bayesian networks Ontologies BNs and Ontologies

Consequence

Chain rule

P(S) = P(S1)× P(S2|S1)× P(S3|S1, S2)× · · · × P(Sn|S1 . . . Sn−1)

Consequence with a BN

P(Si |S1 . . . Si−1) = P(Si |parents(Si )) so

P(S) = Πni=1P(Si |parents(Si ))

The (global) joint probability distribution is decomposed in aproduct of (local) conditional distributions

BN = compact representation of the joint distribution P(S)given some information about dependence relationshipsbetween variables

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 6/22

Page 10: Ontological knowledge integration for Bayesian network structure learning

Bayesian networks Ontologies BNs and Ontologies

Markov equivalence

Definition

B1 and B2 are Markov equivalent iff both describe exactly thesame conditional (in)dependence statements

Graphical properties

B1 and B2 have the same skeleton, V-structures and inferrededges

All the equivalent graphs (= equivalence class) can besummarized by one partially directed DAG named CPDAG orEssential Graph

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 7/22

Page 11: Ontological knowledge integration for Bayesian network structure learning

Bayesian networks Ontologies BNs and Ontologies

Markov equivalence

Definition

B1 and B2 are Markov equivalent iff both describe exactly thesame conditional (in)dependence statements

Graphical properties

B1 and B2 have the same skeleton, V-structures and inferrededges

All the equivalent graphs (= equivalence class) can besummarized by one partially directed DAG named CPDAG orEssential Graph

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 7/22

Page 12: Ontological knowledge integration for Bayesian network structure learning

Bayesian networks Ontologies BNs and Ontologies

Markov equivalence

A S

T L B

O

X D

A S

T L B

O

X D

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 8/22

Page 13: Ontological knowledge integration for Bayesian network structure learning

Bayesian networks Ontologies BNs and Ontologies

BN structure learning

How to build / learn a Bayesian Network ?

1 DAG is known, how to determine the CPDs ?

from experts : knowledge elicitationfrom complete data / incomplete data

2 DAG is unknown, how to determine it (or the CPDAG) ?

from complete data / incomplete data

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 9/22

Page 14: Ontological knowledge integration for Bayesian network structure learning

Bayesian networks Ontologies BNs and Ontologies

Structure learning is a complex task

Size of the ”solution” space

The number of possible DAGs with n variables issuper-exponential w.r.t n (Robinson 77)

NS(n) =

{1 , n = 0 or 1∑n

i=1(−1)i+1(ni

)2i(n−1)NS(n − i), n > 1

NS(5) = 29281 NS(10) = 4.2× 1018

An exhaustive search is impossible !

One thousand millenniums = 3.2× 1015 seconds

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 10/22

Page 15: Ontological knowledge integration for Bayesian network structure learning

Bayesian networks Ontologies BNs and Ontologies

Structure learning algorithms

How to search a good BN ?

Constraint-based methodsBN = independence model⇒ find CI in data in order to build the DAG

Score-based methodsBN = probabilistic model that must fit data as well as possible⇒ search the DAG space in order to maximize a scoringfunction

Hybrid methods more

Some issues

Identifiability : learning algorithms can’t distinguish between anystructure in the same equivalence class

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 11/22

Page 16: Ontological knowledge integration for Bayesian network structure learning

Bayesian networks Ontologies BNs and Ontologies

Structure learning algorithms

How to search a good BN ?

Constraint-based methodsBN = independence model⇒ find CI in data in order to build the DAG

Score-based methodsBN = probabilistic model that must fit data as well as possible⇒ search the DAG space in order to maximize a scoringfunction

Hybrid methods more

Some issues

Learning algorithms can deal with a priori knowledge in order toreduce search space : white list, black list ,node ordering,repetition of local structures ...

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 11/22

Page 17: Ontological knowledge integration for Bayesian network structure learning

Bayesian networks Ontologies BNs and Ontologies

Outline ...

1 Bayesian networksBN definitionBN learning

2 OntologiesOntology definitionOntology learning

3 BNs and OntologiesExisting workOur proposal

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 12/22

Page 18: Ontological knowledge integration for Bayesian network structure learning

Bayesian networks Ontologies BNs and Ontologies

Ontology

Basics

Definition [Gruber, 1993]

shared understanding within a community of peopledeclarative specification of entities and their relationships witheach otherseparate the domain knowledge from the operationalknowledge

Construction/Evolution: expertise and/or machine learning

Reasoning: description logic reasoners

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 13/22

Page 19: Ontological knowledge integration for Bayesian network structure learning

Bayesian networks Ontologies BNs and Ontologies

Elements of an ontology

C: Classes (concepts)

P: Attributes (properties)

H: Hierarchical structure (is-a,part-of relations)

R: Other semantic relationships

I: Instances (individuals)

A: Axioms (logic statements)

Landslide

TsunamiFire

Catastrophes

Volcano

Earthquake

Flood

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 14/22

Page 20: Ontological knowledge integration for Bayesian network structure learning

Bayesian networks Ontologies BNs and Ontologies

Elements of an ontology

C: Classes (concepts)

P: Attributes (properties)

H: Hierarchical structure (is-a,part-of relations)

R: Other semantic relationships

I: Instances (individuals)

A: Axioms (logic statements)

Landslide

TsunamiFire

Catastrophes

Volcano

Earthquake

FloodNameLocalizationDateNb_killed

NameMgnitudeLocalizationDateNb_killed

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 14/22

Page 21: Ontological knowledge integration for Bayesian network structure learning

Bayesian networks Ontologies BNs and Ontologies

Elements of an ontology

C: Classes (concepts)

P: Attributes (properties)

H: Hierarchical structure (is-a,part-of relations)

R: Other semantic relationships

I: Instances (individuals)

A: Axioms (logic statements) Landslide TsunamiFire

Natural

VolcanoEarthquakeFlood

Catastrophes

Man-made

is-ais-ais-ais-ais-ais-a

is-ais-a

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 14/22

Page 22: Ontological knowledge integration for Bayesian network structure learning

Bayesian networks Ontologies BNs and Ontologies

Elements of an ontology

C: Classes (concepts)

P: Attributes (properties)

H: Hierarchical structure (is-a,part-of relations)

R: Other semantic relationships

I: Instances (individuals)

A: Axioms (logic statements)

Landslide TsunamiFire

Natural

VolcanoEarthquakeFlood

Catastrophes

Man-made

is-ais-ais-ais-ais-ais-a

is-ais-a

Causes

Causes Causes

CausesCausesCauses

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 14/22

Page 23: Ontological knowledge integration for Bayesian network structure learning

Bayesian networks Ontologies BNs and Ontologies

Elements of an ontology

C: Classes (concepts)

P: Attributes (properties)

H: Hierarchical structure (is-a,part-of relations)

R: Other semantic relationships

I: Instances (individuals)

A: Axioms (logic statements)

Landslide TsunamiFire

Natural

VolcanoEarthquakeFlood

Catastrophes

Man-made

is-ais-ais-ais-ais-ais-a

is-ais-a

Aleppo Shaanxi Haiti

is-instanceis-instance is-instance

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 14/22

Page 24: Ontological knowledge integration for Bayesian network structure learning

Bayesian networks Ontologies BNs and Ontologies

Elements of an ontology

C: Classes (concepts)

P: Attributes (properties)

H: Hierarchical structure (is-a,part-of relations)

R: Other semantic relationships

I: Instances (individuals)

A: Axioms (logic statements)

Landslide TsunamiFire

Natural

VolcanoEarthquakeFlood

Catastrophes

Man-made

is-ais-ais-ais-ais-ais-a

is-ais-a

Aleppo Shaanxi Haiti

is-instanceis-instance is-instance

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 14/22

Page 25: Ontological knowledge integration for Bayesian network structure learning

Bayesian networks Ontologies BNs and Ontologies

Ontology learning : subtasks

Ontology population

Get new instances of concept(s) already present in the ontology

Ontology enrichment

Update (add or modify) concepts, properties and relations in agiven ontology

Evolution vs. revolution

Evolution (ontology continuity): Add new knowledge

Revolution (ontology discontinuity): Modify existingknowledge

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 15/22

Page 26: Ontological knowledge integration for Bayesian network structure learning

Bayesian networks Ontologies BNs and Ontologies

Ontology learning : subtasks

Ontology population

Get new instances of concept(s) already present in the ontology

Ontology enrichment

Update (add or modify) concepts, properties and relations in agiven ontology

Evolution vs. revolution

Evolution (ontology continuity): Add new knowledge

Revolution (ontology discontinuity): Modify existingknowledge

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 15/22

Page 27: Ontological knowledge integration for Bayesian network structure learning

Bayesian networks Ontologies BNs and Ontologies

Ontology learning : subtasks

Ontology population

Get new instances of concept(s) already present in the ontology

Ontology enrichment

Update (add or modify) concepts, properties and relations in agiven ontology

Evolution vs. revolution

Evolution (ontology continuity): Add new knowledge

Revolution (ontology discontinuity): Modify existingknowledge

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 15/22

Page 28: Ontological knowledge integration for Bayesian network structure learning

Bayesian networks Ontologies BNs and Ontologies

Ontology learning : methods

[Buitelaar & Cimiano, 2008] ”Bridging the gap between text andknowledge”

Natural Language Processing

[Buitelaar et al., 2003, Velardi et al., 2005]

Concept extraction

Taxonomy learning (is-a, part-of)

Population (information extraction)

Machine Learning

Clustering for taxonomy learning [Bisson et al., 2000]

Association rules for relation discovery [Madche & Staab,2000]

ILP for relation discovery [Rudolph et al., 2007]

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 16/22

Page 29: Ontological knowledge integration for Bayesian network structure learning

Bayesian networks Ontologies BNs and Ontologies

Ontology learning : methods

[Buitelaar & Cimiano, 2008] ”Bridging the gap between text andknowledge”

Natural Language Processing

[Buitelaar et al., 2003, Velardi et al., 2005]

Concept extraction

Taxonomy learning (is-a, part-of)

Population (information extraction)

Machine Learning

Clustering for taxonomy learning [Bisson et al., 2000]

Association rules for relation discovery [Madche & Staab,2000]

ILP for relation discovery [Rudolph et al., 2007]

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 16/22

Page 30: Ontological knowledge integration for Bayesian network structure learning

Bayesian networks Ontologies BNs and Ontologies

Ontology learning : methods

[Buitelaar & Cimiano, 2008] ”Bridging the gap between text andknowledge”

Natural Language Processing

[Buitelaar et al., 2003, Velardi et al., 2005]

Concept extraction

Taxonomy learning (is-a, part-of)

Population (information extraction)

Machine Learning

Clustering for taxonomy learning [Bisson et al., 2000]

Association rules for relation discovery [Madche & Staab,2000]

ILP for relation discovery [Rudolph et al., 2007]

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 16/22

Page 31: Ontological knowledge integration for Bayesian network structure learning

Bayesian networks Ontologies BNs and Ontologies

Outline ...

1 Bayesian networksBN definitionBN learning

2 OntologiesOntology definitionOntology learning

3 BNs and OntologiesExisting workOur proposal

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 17/22

Page 32: Ontological knowledge integration for Bayesian network structure learning

Bayesian networks Ontologies BNs and Ontologies

Existing work

Bayesian Network =⇒ Ontology

BayesOWL [Ding & Peng, 2004]

OntoBayes [Yang & Calmet, 2005]

PR-OWL [Costa & Laskey, 2006]

Use of BNs for probabilistic modeling and reasoning (no learning)

Ontology =⇒ Bayesian Network

BN ”basic” construction using ontologies [Devitt et al., 2006]

Ontology-based semi-automatic construction of BN inE-health applications [Jeon & Ko, 2007]

Use of ontologies to manually build a BN (without data)

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 18/22

Page 33: Ontological knowledge integration for Bayesian network structure learning

Bayesian networks Ontologies BNs and Ontologies

Existing work

Bayesian Network =⇒ Ontology

BayesOWL [Ding & Peng, 2004]

OntoBayes [Yang & Calmet, 2005]

PR-OWL [Costa & Laskey, 2006]

Use of BNs for probabilistic modeling and reasoning (no learning)

Ontology =⇒ Bayesian Network

BN ”basic” construction using ontologies [Devitt et al., 2006]

Ontology-based semi-automatic construction of BN inE-health applications [Jeon & Ko, 2007]

Use of ontologies to manually build a BN (without data)

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 18/22

Page 34: Ontological knowledge integration for Bayesian network structure learning

Bayesian networks Ontologies BNs and Ontologies

Existing work

Bayesian Network =⇒ Ontology

BayesOWL [Ding & Peng, 2004]

OntoBayes [Yang & Calmet, 2005]

PR-OWL [Costa & Laskey, 2006]

Use of BNs for probabilistic modeling and reasoning (no learning)

Ontology =⇒ Bayesian Network

BN ”basic” construction using ontologies [Devitt et al., 2006]

Ontology-based semi-automatic construction of BN inE-health applications [Jeon & Ko, 2007]

Use of ontologies to manually build a BN (without data)

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 18/22

Page 35: Ontological knowledge integration for Bayesian network structure learning

Bayesian networks Ontologies BNs and Ontologies

Existing work

Bayesian Network =⇒ Ontology

BayesOWL [Ding & Peng, 2004]

OntoBayes [Yang & Calmet, 2005]

PR-OWL [Costa & Laskey, 2006]

Use of BNs for probabilistic modeling and reasoning (no learning)

Ontology =⇒ Bayesian Network

BN ”basic” construction using ontologies [Devitt et al., 2006]

Ontology-based semi-automatic construction of BN inE-health applications [Jeon & Ko, 2007]

Use of ontologies to manually build a BN (without data)

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 18/22

Page 36: Ontological knowledge integration for Bayesian network structure learning

Bayesian networks Ontologies BNs and Ontologies

Our proposal

”Bridging the gap between text and knowledge”

Ontology + data =⇒ Bayesian Network =⇒ Ontology

BN Structure Learning Ontology Evolution

2 variations

1 Causal BNs more

semantical causal discovery SemCaDo 1.0 [Ben Messaoud &al. 2009]causal discovery for ontology evolution SemCaDo 2.0 [BenMessaoud & al., 2009] more

2 Object-Oriented BNs

OOBN structure learning and ontology evolution O2C [BenIshak et al., 2011] more

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 19/22

Page 37: Ontological knowledge integration for Bayesian network structure learning

Bayesian networks Ontologies BNs and Ontologies

Our proposal

”Bridging the gap between data and knowledge”

Ontology + data =⇒ Bayesian Network =⇒ Ontology

BN Structure Learning Ontology Evolution

2 variations

1 Causal BNs more

semantical causal discovery SemCaDo 1.0 [Ben Messaoud &al. 2009]causal discovery for ontology evolution SemCaDo 2.0 [BenMessaoud & al., 2009] more

2 Object-Oriented BNs

OOBN structure learning and ontology evolution O2C [BenIshak et al., 2011] more

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 19/22

Page 38: Ontological knowledge integration for Bayesian network structure learning

Bayesian networks Ontologies BNs and Ontologies

Our proposal

”Bridging the gap between data and knowledge”

Ontology + data =⇒ Bayesian Network =⇒ Ontology

BN Structure Learning Ontology Evolution

2 variations

1 Causal BNs more

semantical causal discovery SemCaDo 1.0 [Ben Messaoud &al. 2009]causal discovery for ontology evolution SemCaDo 2.0 [BenMessaoud & al., 2009] more

2 Object-Oriented BNs

OOBN structure learning and ontology evolution O2C [BenIshak et al., 2011] more

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 19/22

Page 39: Ontological knowledge integration for Bayesian network structure learning

Bayesian networks Ontologies BNs and Ontologies

Our proposal

”Bridging the gap between data and knowledge”

Ontology + data =⇒ Bayesian Network =⇒ Ontology

BN Structure Learning Ontology Evolution

2 variations

1 Causal BNs more

semantical causal discovery SemCaDo 1.0 [Ben Messaoud &al. 2009]causal discovery for ontology evolution SemCaDo 2.0 [BenMessaoud & al., 2009] more

2 Object-Oriented BNs

OOBN structure learning and ontology evolution O2C [BenIshak et al., 2011] more

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 19/22

Page 40: Ontological knowledge integration for Bayesian network structure learning

Bayesian networks Ontologies BNs and Ontologies

Conclusion

BN structure learning is NP hard, need of prior knowledgeOntology evolution is difficult, based on text-miningCooperation can help for both tasks

Originality of our proposal

BN structure learning : use ontology instead of expertknowledge : separation between expert acquisition andstructure learning

Ontology evolution : use BN structure learning to directlydiscover relationships from data

Difficulties

No similar work or benchmark for a comparative study

SemCaDo : one concept = attribute = node

O2C : OOBN learning is too complexMontassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 20/22

Page 41: Ontological knowledge integration for Bayesian network structure learning

Bayesian networks Ontologies BNs and Ontologies

Conclusion

BN structure learning is NP hard, need of prior knowledgeOntology evolution is difficult, based on text-miningCooperation can help for both tasks

Originality of our proposal

BN structure learning : use ontology instead of expertknowledge : separation between expert acquisition andstructure learning

Ontology evolution : use BN structure learning to directlydiscover relationships from data

Difficulties

No similar work or benchmark for a comparative study

SemCaDo : one concept = attribute = node

O2C : OOBN learning is too complexMontassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 20/22

Page 42: Ontological knowledge integration for Bayesian network structure learning

Bayesian networks Ontologies BNs and Ontologies

Conclusion

BN structure learning is NP hard, need of prior knowledgeOntology evolution is difficult, based on text-miningCooperation can help for both tasks

Originality of our proposal

BN structure learning : use ontology instead of expertknowledge : separation between expert acquisition andstructure learning

Ontology evolution : use BN structure learning to directlydiscover relationships from data

Difficulties

No similar work or benchmark for a comparative study

SemCaDo : one concept = attribute = node

O2C : OOBN learning is too complexMontassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 20/22

Page 43: Ontological knowledge integration for Bayesian network structure learning

Bayesian networks Ontologies BNs and Ontologies

Future works

What next ?

Creation of benchmarks

Generalize SemCaDo or O2C with more general models

Ideas : Multi Entity BNs [Laskey, 2006] Relational BNs[Getoor et al., 2007]

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 21/22

Page 44: Ontological knowledge integration for Bayesian network structure learning

Bayesian networks Ontologies BNs and Ontologies

Thank you for your [email protected]

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 22/22

Page 45: Ontological knowledge integration for Bayesian network structure learning

Bayesian network Structure Learning Causal Bayesian Networks SemCaDo algorithm O2C algorithm

Outline ...

4 Bayesian network Structure Learningconstraint-based methodsscore-based methodslocal search methods

5 Causal Bayesian Networksdefinitioncausal BN structure learning

6 SemCaDo algorithmdefinitionexperimental study

7 O2C algorithmdefinitionalgorithm

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 1/49

Page 46: Ontological knowledge integration for Bayesian network structure learning

Bayesian network Structure Learning Causal Bayesian Networks SemCaDo algorithm O2C algorithm

Constraint-based methods

How to search a good BN ?

Constraint-based methodsBN = independence model⇒ find CI in data in order to build the DAG

Score-based methodsBN = probabilistic model that must fit data as well as possible⇒ search the DAG space in order to maximize a scoringfunction

Hybrid methods

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 2/49

Page 47: Ontological knowledge integration for Bayesian network structure learning

Bayesian network Structure Learning Causal Bayesian Networks SemCaDo algorithm O2C algorithm

Constraint-based methods

Two reference algorithms

Pearl et Verma : IC, IC*

Spirtes, Glymour et Scheines : SGS, PC, CI, FCI

Common principle

Build an undirected graph describing direct dependencesbetween variables (χ2 tests)

by adding edges (Pearl et Verma)by deleting edges (SGS)

Detect V-structures (from previous statistical tests)

Propagate some edge orientation (inferred edges) in order toobtain a CPDAG

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 3/49

Page 48: Ontological knowledge integration for Bayesian network structure learning

Bayesian network Structure Learning Causal Bayesian Networks SemCaDo algorithm O2C algorithm

constraint-based methods

Some inconvenients

Reliability of CI test conditionally to several variables with alimited amount of data)

SGS heuristic : if df < N10 , then declare dependence

Combinatorial explosion of the number of tests

PC heuristic : begin with order 0 (XA⊥XB) then order 1(XA⊥XB | XC ), etc ...

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 4/49

Page 49: Ontological knowledge integration for Bayesian network structure learning

Bayesian network Structure Learning Causal Bayesian Networks SemCaDo algorithm O2C algorithm

PC algorithm

Step 0 : undirected complete graph

Left : target BN used to generate 5000 samples

A S

T L B

O

X D

A S

T L B

O

X D

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 5/49

Page 50: Ontological knowledge integration for Bayesian network structure learning

Bayesian network Structure Learning Causal Bayesian Networks SemCaDo algorithm O2C algorithm

PC algorithm

Step 1a : delete all order 0 independences discovered

χ2: S⊥A L⊥A B⊥A O⊥A X⊥A D⊥A T⊥S L⊥T O⊥B X⊥B

A S

T L B

O

X D

A S

T L B

O

X D

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 5/49

Page 51: Ontological knowledge integration for Bayesian network structure learning

Bayesian network Structure Learning Causal Bayesian Networks SemCaDo algorithm O2C algorithm

PC algorithm

Step 1a : delete all order 1 independences discovered

χ2: T⊥A|O O⊥S |L X⊥S |L B⊥T |S X⊥T |O D⊥T |O ...

A S

T L B

O

X D

A S

T L B

O

X D

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 5/49

Page 52: Ontological knowledge integration for Bayesian network structure learning

Bayesian network Structure Learning Causal Bayesian Networks SemCaDo algorithm O2C algorithm

PC algorithm

Step 1a : delete all order 2 independences discovered

χ2: D⊥S |{L,B} X⊥O|{T , L} D⊥O|{T , L}

A S

T L B

O

X D

A S

T L B

O

X D

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 5/49

Page 53: Ontological knowledge integration for Bayesian network structure learning

Bayesian network Structure Learning Causal Bayesian Networks SemCaDo algorithm O2C algorithm

PC algorithm

Step 2 : research V-structures

χ2 : one V-structure T → O ← L is discovered

A S

T L B

O

X D

A S

T L B

O

X D

Step 3 : inferred edges

no one in this example

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 5/49

Page 54: Ontological knowledge integration for Bayesian network structure learning

Bayesian network Structure Learning Causal Bayesian Networks SemCaDo algorithm O2C algorithm

PC algorithm

From CPDAG to DAG

Orientation of the remaining undirected edges(only constraint : do not create any new V-structure)

A S

T L B

O

X D

A S

T L B

O

X D

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 5/49

Page 55: Ontological knowledge integration for Bayesian network structure learning

Bayesian network Structure Learning Causal Bayesian Networks SemCaDo algorithm O2C algorithm

PC algorithm

Obtained DAG versus target one

χ2 test with 5000 samples fails to discoverA→ T , O → X and O → D

A S

T L B

O

X D

A S

T L B

O

X D

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 5/49

Page 56: Ontological knowledge integration for Bayesian network structure learning

Bayesian network Structure Learning Causal Bayesian Networks SemCaDo algorithm O2C algorithm

Score-based methods

How to search a good BN ?

Constraint-based methodsBN = independence model⇒ find CI in data in order to build the DAG

Score-based methodsBN = probabilistic model that must fit data as well as possible⇒ search the DAG space in order to maximize a scoringfunction

Hybrid methods

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 6/49

Page 57: Ontological knowledge integration for Bayesian network structure learning

Bayesian network Structure Learning Causal Bayesian Networks SemCaDo algorithm O2C algorithm

Notion of score

General principle : Occam razor

Pluralitas non est ponenda sine neccesitate(La pluralite (des notions) ne devrait pas etre posee sansnecessite) plurality should not be posited without necessity

Frustra fit per plura quod potest fieri per pauciora(C’est en vain que l’on fait avec plusieurs ce que l’on peutfaire avec un petit nombre) It is pointless to do with morewhat can be done with fewer

= Parcimony principle : find a model

Fitting the data D : likelihood : L(D|θ,B)

The simplest possible : dimension of B : Dim(B)

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 7/49

Page 58: Ontological knowledge integration for Bayesian network structure learning

Bayesian network Structure Learning Causal Bayesian Networks SemCaDo algorithm O2C algorithm

Score examples

AIC and BIC

Compromise between likelihood and complexity

Application of AIC (Akaıke 70) and BIC (Schwartz 78) criteria

SAIC (B,D) = log L(D|θMV ,B)− Dim(B)

SBIC (B,D) = log L(D|θMV ,B)− 1

2Dim(B) log N

Bayesian scores : BD, BDe, BDeu

SBD(B,D) = P(B,D) (Cooper et Herskovits 92)

BDe = BD + score equivalence (Heckerman 94)

SBD(B,D) = P(B)n∏

i=1

qi∏j=1

Γ(αij)

Γ(Nij + αij)

ri∏k=1

Γ(Nijk + αijk)

Γ(αijk)

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 8/49

Page 59: Ontological knowledge integration for Bayesian network structure learning

Bayesian network Structure Learning Causal Bayesian Networks SemCaDo algorithm O2C algorithm

Score properties

Two important properties

Decomposability

(Global)Score(B,D) =n∑

i=1

(local)score(Xi , pai )

Score equivalence

If two BN B1 and B2 are Markov equivalent thenS(B1,D) = S(B2,D)

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 9/49

Page 60: Ontological knowledge integration for Bayesian network structure learning

Bayesian network Structure Learning Causal Bayesian Networks SemCaDo algorithm O2C algorithm

Score properties

Two important properties

Decomposability

(Global)Score(B,D) =n∑

i=1

(local)score(Xi , pai )

Score equivalence

If two BN B1 and B2 are Markov equivalent thenS(B1,D) = S(B2,D)

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 9/49

Page 61: Ontological knowledge integration for Bayesian network structure learning

Bayesian network Structure Learning Causal Bayesian Networks SemCaDo algorithm O2C algorithm

Heuristic exploration of search space

Search space and heuristics

B space

restriction to tree space : Chow&Liu, MWST

DAG with node ordering : K2 algorithmgreedy searchgenetic algorithms, ...

E space

greedy equivalence search

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 10/49

Page 62: Ontological knowledge integration for Bayesian network structure learning

Bayesian network Structure Learning Causal Bayesian Networks SemCaDo algorithm O2C algorithm

Restriction to tree space

Principle

What is the best tree connecting all the nodes,i.e. maximizing a weight defined for each possible edge ?

Answer : maximal weighted spanning tree (MWST)

weight = mutual information [Chow & Liu, 1968]

W (XA,XB) =∑a,b

Nab

Nlog

NabN

Na.N.b

weight = any local score variation [Heckerman, 1994]

W (XA,XB) = score(XA,Pa(XA) = XB)− score(XA, ∅)

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 11/49

Page 63: Ontological knowledge integration for Bayesian network structure learning

Bayesian network Structure Learning Causal Bayesian Networks SemCaDo algorithm O2C algorithm

Restriction to tree space

Principle

What is the best tree connecting all the nodes,i.e. maximizing a weight defined for each possible edge ?

Answer : maximal weighted spanning tree (MWST)

weight = mutual information [Chow & Liu, 1968]

W (XA,XB) =∑a,b

Nab

Nlog

NabN

Na.N.b

weight = any local score variation [Heckerman, 1994]

W (XA,XB) = score(XA,Pa(XA) = XB)− score(XA, ∅)

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 11/49

Page 64: Ontological knowledge integration for Bayesian network structure learning

Bayesian network Structure Learning Causal Bayesian Networks SemCaDo algorithm O2C algorithm

Restriction to tree space

Remarks

MWST returns an undirected tree

This undirected tree = CPDAG of all the directed tree withthis skeleton

Obtain a directed tree by (randomly) choosing one root andorienting the edges with a depth first search over this tree

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 12/49

Page 65: Ontological knowledge integration for Bayesian network structure learning

Bayesian network Structure Learning Causal Bayesian Networks SemCaDo algorithm O2C algorithm

Example : obtained DAG vs. target one

A S

T L B

O

X D

A S

T L B

O

X D

MWST can not discover cycles neither V-structures (tree space !)

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 13/49

Page 66: Ontological knowledge integration for Bayesian network structure learning

Bayesian network Structure Learning Causal Bayesian Networks SemCaDo algorithm O2C algorithm

Heuristic exploration of search space

Search space and heuristics

B space

restriction to tree space : Chow&Liu, MWSTDAG with node ordering : K2 algorithm

greedy search

genetic algorithms, ...

E space

greedy equivalence search

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 14/49

Page 67: Ontological knowledge integration for Bayesian network structure learning

Bayesian network Structure Learning Causal Bayesian Networks SemCaDo algorithm O2C algorithm

Greedy search

Principle

Exploration of the search space with traversal operators

add edgeinvert edgedelete edge

and respect the DAG definition (no cycle)

Exploration can begin from any given DAG

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 15/49

Page 68: Ontological knowledge integration for Bayesian network structure learning

Bayesian network Structure Learning Causal Bayesian Networks SemCaDo algorithm O2C algorithm

Example : obtained DAG vs. target one

A S

T L B

O

X D

A S

T L B

O

X D

start = empty graph. GS result = local optimum :-(

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 16/49

Page 69: Ontological knowledge integration for Bayesian network structure learning

Bayesian network Structure Learning Causal Bayesian Networks SemCaDo algorithm O2C algorithm

Example : obtained DAG vs. target one

A S

T L B

O

X D

A S

T L B

O

X D

start = MWST result. GS result is better

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 16/49

Page 70: Ontological knowledge integration for Bayesian network structure learning

Bayesian network Structure Learning Causal Bayesian Networks SemCaDo algorithm O2C algorithm

Heuristic exploration of search space

Search space and heuristics

B space

restriction to tree space : Chow&Liu, MWSTDAG with node ordering : K2 algorithmgreedy searchgenetic algorithms, ...

E space

greedy equivalence search

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 17/49

Page 71: Ontological knowledge integration for Bayesian network structure learning

Bayesian network Structure Learning Causal Bayesian Networks SemCaDo algorithm O2C algorithm

What about changing our search space

Preliminaries

IC/PC result = CPDAG

MWST result = CPDAG

Score-based methods do not distinguish equivalent DAGs

Search in EE = CPDAG space

Better properties : YES

2 equivalent structures = 1 unique structure in EBetter size : NO

E size is quasi similar to DAG spaceasymptotic ratio is 3,7 : [Gillispie & Perlman, 2001]

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 18/49

Page 72: Ontological knowledge integration for Bayesian network structure learning

Bayesian network Structure Learning Causal Bayesian Networks SemCaDo algorithm O2C algorithm

Greedy Equivalent Search

Principe [Chickering, 2002]

Greedy search in EPhase 1 : add edges until convergence

Phase 2 : delete edges until convergence

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 19/49

Page 73: Ontological knowledge integration for Bayesian network structure learning

Bayesian network Structure Learning Causal Bayesian Networks SemCaDo algorithm O2C algorithm

Add edge examples in E

X1

X2

X3

X4

X1

X2

X3

X4

X1

X2

X3

X4

X1

X2

X3

X4

X1

X2

X3

X4

X1

X2

X3

X4

X1

X2

X3

X4

E0

neighborhood of E0

E1

X1

X2

X3

X4

X1

X2

X3

X4

X1

X2

X3

X4

X1

X2

X3

X4

X1

X2

X3

X4

X1

X2

X3

X4

neighborhood of E1

X1

X2

X3

X4

X1

X2

X3

X4

X1

X2

X3

X4

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 20/49

Page 74: Ontological knowledge integration for Bayesian network structure learning

Bayesian network Structure Learning Causal Bayesian Networks SemCaDo algorithm O2C algorithm

Score-based methods

How to search a good BN ?

Constraint-based methodsBN = independence model⇒ find CI in data in order to build the DAG

Score-based methodsBN = probabilistic model that must fit data as well as possible⇒ search the DAG space in order to maximize ascoring/fitness function

Hybrid methods

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 21/49

Page 75: Ontological knowledge integration for Bayesian network structure learning

Bayesian network Structure Learning Causal Bayesian Networks SemCaDo algorithm O2C algorithm

Hybrid methods = local search methods

Local search and global learning

Search one local neighborhood for a given node T

Reiterate for each T

Learn the global structure with these local informations

which neighborhood ?

PC (T ) : Parents and Children T (without distinction)

MB(T ) : Markov Blanket of T - Parents, children andspouses

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 22/49

Page 76: Ontological knowledge integration for Bayesian network structure learning

Bayesian network Structure Learning Causal Bayesian Networks SemCaDo algorithm O2C algorithm

Hybrid methods = local search methods

Local search and global learning

Search one local neighborhood for a given node T

Reiterate for each T

Learn the global structure with these local informations

which neighborhood ?

PC (T ) : Parents and Children T (without distinction)

MB(T ) : Markov Blanket of T - Parents, children andspouses

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 22/49

Page 77: Ontological knowledge integration for Bayesian network structure learning

Bayesian network Structure Learning Causal Bayesian Networks SemCaDo algorithm O2C algorithm

Local search identification

Identification of MB(T) or PC(T)

IAMB [Aliferis et al., 2002]

MMPC [Tsamardinos et al., 2003], ...

Hybrid structure learning algorithms

MMHC [Tsamardinos et al., 2006] = MMPC + Greedy search

back

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 23/49

Page 78: Ontological knowledge integration for Bayesian network structure learning

Bayesian network Structure Learning Causal Bayesian Networks SemCaDo algorithm O2C algorithm

Outline ...

4 Bayesian network Structure Learningconstraint-based methodsscore-based methodslocal search methods

5 Causal Bayesian Networksdefinitioncausal BN structure learning

6 SemCaDo algorithmdefinitionexperimental study

7 O2C algorithmdefinitionalgorithm

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 24/49

Page 79: Ontological knowledge integration for Bayesian network structure learning

Bayesian network Structure Learning Causal Bayesian Networks SemCaDo algorithm O2C algorithm

A BN is not a causal model

Usual BN

A→ B does not imply direct causal relationship between Aand B,

Only edges from the CPDAG represent causal relationships ∗

Confusion

When the DAG is given by an expert, this graph is very oftencausal

When the DAG is learnt from data, no reason to be causal !

Causal BN

Each A→ B represents one direct causal relationship, i.e. A isthe direct cause which generates B

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 25/49

Page 80: Ontological knowledge integration for Bayesian network structure learning

Bayesian network Structure Learning Causal Bayesian Networks SemCaDo algorithm O2C algorithm

A BN is not a causal model

Usual BN

A→ B does not imply direct causal relationship between Aand B,

Only edges from the CPDAG represent causal relationships ∗

Confusion

When the DAG is given by an expert, this graph is very oftencausal

When the DAG is learnt from data, no reason to be causal !

Causal BN

Each A→ B represents one direct causal relationship, i.e. A isthe direct cause which generates B

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 25/49

Page 81: Ontological knowledge integration for Bayesian network structure learning

Bayesian network Structure Learning Causal Bayesian Networks SemCaDo algorithm O2C algorithm

A BN is not a causal model

Usual BN

A→ B does not imply direct causal relationship between Aand B,

Only edges from the CPDAG represent causal relationships ∗

Confusion

When the DAG is given by an expert, this graph is very oftencausal

When the DAG is learnt from data, no reason to be causal !

Causal BN

Each A→ B represents one direct causal relationship, i.e. A isthe direct cause which generates B

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 25/49

Page 82: Ontological knowledge integration for Bayesian network structure learning

Bayesian network Structure Learning Causal Bayesian Networks SemCaDo algorithm O2C algorithm

Intervention vs. Observation

Probabilistic inference :we observe B = b,we compute P(A|B = b)

Causal inference [Pearl 00]:we manipulate/intervene upon B : do(B = b)

example with A→ B

P(A|do(B = b)) = P(A),

P(B|do(A = a)) = P(B|A = a)

example with A← B

P(A|do(B = b)) = P(A|B = b),

P(B|do(A = a)) = P(B)

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 26/49

Page 83: Ontological knowledge integration for Bayesian network structure learning

Bayesian network Structure Learning Causal Bayesian Networks SemCaDo algorithm O2C algorithm

Causal structure learning

Usual situation : observational datawhatever the method, the right result is the CPDAGpartial determination of the causal structure

How to find a full causal graph ?

Use only experimental data, and decide at every step themore interesting experiment to realize (active learning[Murphy, 2001], ...)

Use only observational data, for a very specific distribution(LiNGAM models [Hoyer et al., 2008])

Another idea: MyCaDo algorithm [Meganck et al., 2006]

Use (already existing) observational data to find the CPDAG

Complete the orientation with experimental data

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 27/49

Page 84: Ontological knowledge integration for Bayesian network structure learning

Bayesian network Structure Learning Causal Bayesian Networks SemCaDo algorithm O2C algorithm

Causal structure learning

Usual situation : observational datawhatever the method, the right result is the CPDAGpartial determination of the causal structure

How to find a full causal graph ?

Use only experimental data, and decide at every step themore interesting experiment to realize (active learning[Murphy, 2001], ...)

Use only observational data, for a very specific distribution(LiNGAM models [Hoyer et al., 2008])

Another idea: MyCaDo algorithm [Meganck et al., 2006]

Use (already existing) observational data to find the CPDAG

Complete the orientation with experimental data

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 27/49

Page 85: Ontological knowledge integration for Bayesian network structure learning

Bayesian network Structure Learning Causal Bayesian Networks SemCaDo algorithm O2C algorithm

Causal structure learning

Usual situation : observational datawhatever the method, the right result is the CPDAGpartial determination of the causal structure

How to find a full causal graph ?

Use only experimental data, and decide at every step themore interesting experiment to realize (active learning[Murphy, 2001], ...)

Use only observational data, for a very specific distribution(LiNGAM models [Hoyer et al., 2008])

Another idea: MyCaDo algorithm [Meganck et al., 2006]

Use (already existing) observational data to find the CPDAG

Complete the orientation with experimental data

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 27/49

Page 86: Ontological knowledge integration for Bayesian network structure learning

Bayesian network Structure Learning Causal Bayesian Networks SemCaDo algorithm O2C algorithm

MyCaDo algorithm

Donnéesd'observation

Donnéesexpérimentales

SystèmeAlgorithmed'apprentissage

de structured'un RB

CPDAG

Algorithme MyCaDo

Choixde l'exp.

Réalisationde l'exp.

Analysedes résultats

Réseau Bayésiencausal

(1) Choice of the experiment = what variable M manipulate ?

the one potentially orienting more edges

by taking into account experiment/observation costback

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 28/49

Page 87: Ontological knowledge integration for Bayesian network structure learning

Bayesian network Structure Learning Causal Bayesian Networks SemCaDo algorithm O2C algorithm

MyCaDo algorithm

Donnéesd'observation

Donnéesexpérimentales

SystèmeAlgorithmed'apprentissage

de structured'un RB

CPDAG

Algorithme MyCaDo

Choixde l'exp.

Réalisationde l'exp.

Analysedes résultats

Réseau Bayésiencausal

(2) Experimentation

do(M = m) for all possible values m

observe all candidate variables C (C –M)back

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 28/49

Page 88: Ontological knowledge integration for Bayesian network structure learning

Bayesian network Structure Learning Causal Bayesian Networks SemCaDo algorithm O2C algorithm

MyCaDo algorithm

Donnéesd'observation

Donnéesexpérimentales

SystèmeAlgorithmed'apprentissage

de structured'un RB

CPDAG

Algorithme MyCaDo

Choixde l'exp.

Réalisationde l'exp.

Analysedes résultats

Réseau Bayésiencausal

(3) Result analysis : P(C |M) (obs.) ' P(C |do(M)) (exp.) ?

if equal C ← M else M ← C

orient some other edges by applying specific rulesback

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 28/49

Page 89: Ontological knowledge integration for Bayesian network structure learning

Bayesian network Structure Learning Causal Bayesian Networks SemCaDo algorithm O2C algorithm

Outline ...

4 Bayesian network Structure Learningconstraint-based methodsscore-based methodslocal search methods

5 Causal Bayesian Networksdefinitioncausal BN structure learning

6 SemCaDo algorithmdefinitionexperimental study

7 O2C algorithmdefinitionalgorithm

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 29/49

Page 90: Ontological knowledge integration for Bayesian network structure learning

Bayesian network Structure Learning Causal Bayesian Networks SemCaDo algorithm O2C algorithm

SemCaDo assumptions

BN ⇔ Ontology general assumptions

Nodes ⇐⇒ Concepts

Random variables ⇐⇒ Concept attributes

Causal dependencies ⇐⇒ Semantic causal relations

Data ⇐⇒ Concept-attribute instances

SemCaDo specific assumptions

Causal relations concern concepts sharing the same semantictype

Ontology continuity (evolution)

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 30/49

Page 91: Ontological knowledge integration for Bayesian network structure learning

Bayesian network Structure Learning Causal Bayesian Networks SemCaDo algorithm O2C algorithm

SemCaDo assumptions

BN ⇔ Ontology general assumptions

Nodes ⇐⇒ Concepts

Random variables ⇐⇒ Concept attributes

Causal dependencies ⇐⇒ Semantic causal relations

Data ⇐⇒ Concept-attribute instances

SemCaDo specific assumptions

Causal relations concern concepts sharing the same semantictype

Ontology continuity (evolution)

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 30/49

Page 92: Ontological knowledge integration for Bayesian network structure learning

Bayesian network Structure Learning Causal Bayesian Networks SemCaDo algorithm O2C algorithm

Example

Landslide TsunamiFire

Natural

VolcanoEarthquakeFlood

Catastrophes

Man-made

is-ais-ais-ais-ais-ais-a

is-ais-a

Causes

Causes Causes

CausesCausesCauses

Our causal BN will represent the grey part of the ontology

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 31/49

Page 93: Ontological knowledge integration for Bayesian network structure learning

Bayesian network Structure Learning Causal Bayesian Networks SemCaDo algorithm O2C algorithm

SemCaDo three main steps

Obs.Data

1st stepStructure learning

3rd stepOntology evolutionChoice of experienceChoice of experience

Analyse the results

2nd stepCausal discovery

PDAG CBN

Perform the experience

Enriched ontology

Domainontology

Inter.Data

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 32/49

Page 94: Ontological knowledge integration for Bayesian network structure learning

Bayesian network Structure Learning Causal Bayesian Networks SemCaDo algorithm O2C algorithm

1st step: initial BN structure learning

Extraction of the causal relationships (Rc) from the ontology

Integration of these edge as constraints in the structurelearning algorithm [De Campos & al., 2007]

Continuity : these edges will not be ”questioned” duringlearning

Interest

Ontology helps in reducing the search task complexity

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 33/49

Page 95: Ontological knowledge integration for Bayesian network structure learning

Bayesian network Structure Learning Causal Bayesian Networks SemCaDo algorithm O2C algorithm

1st step: initial BN structure learning

Extraction of the causal relationships (Rc) from the ontology

Integration of these edge as constraints in the structurelearning algorithm [De Campos & al., 2007]

Continuity : these edges will not be ”questioned” duringlearning

Interest

Ontology helps in reducing the search task complexity

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 33/49

Page 96: Ontological knowledge integration for Bayesian network structure learning

Bayesian network Structure Learning Causal Bayesian Networks SemCaDo algorithm O2C algorithm

2nd step: serendipitous causal discovery

Experimentations are needed in order to find the causalorientation of some edges

Our solution : MyCaDo [Meganck & al., 2006], iterativecausal discovery process

Adaptation to take into account ontological knowledge : Radadistance on H between one set of concepts and their mostspecific common subsumer

Interest

Ontology helps in potentially orienting the more unexpected(serendipitous) links

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 34/49

Page 97: Ontological knowledge integration for Bayesian network structure learning

Bayesian network Structure Learning Causal Bayesian Networks SemCaDo algorithm O2C algorithm

2nd step: serendipitous causal discovery

Experimentations are needed in order to find the causalorientation of some edges

Our solution : MyCaDo [Meganck & al., 2006], iterativecausal discovery process

Adaptation to take into account ontological knowledge : Radadistance on H between one set of concepts and their mostspecific common subsumer

Interest

Ontology helps in potentially orienting the more unexpected(serendipitous) links

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 34/49

Page 98: Ontological knowledge integration for Bayesian network structure learning

Bayesian network Structure Learning Causal Bayesian Networks SemCaDo algorithm O2C algorithm

3rd Step: Ontology evolution process

[Stojanovic et al., 2002]

Interest

BN structure learning from data helps in discovering new relationsin the ontology

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 35/49

Page 99: Ontological knowledge integration for Bayesian network structure learning

Bayesian network Structure Learning Causal Bayesian Networks SemCaDo algorithm O2C algorithm

3rd Step: Ontology evolution process

[Stojanovic et al., 2002]

Interest

BN structure learning from data helps in discovering new relationsin the ontology

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 35/49

Page 100: Ontological knowledge integration for Bayesian network structure learning

Bayesian network Structure Learning Causal Bayesian Networks SemCaDo algorithm O2C algorithm

Experimental study (1)

Benchmark : no existing real benchmark & system :-(

BN graph : random generation (50 to 200 nodes)

Ontology :

Causal relationships : BN edgesHierarchy of concept : generation by clustering BN nodes

Data is generated by using BN as a generative model

Experimental protocol

Hierarchy of concepts and 10% to 40% of existing causalrelationships are given as inputs

Semantic gain : cumulative Rada distance of the discoveredrelationships

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 36/49

Page 101: Ontological knowledge integration for Bayesian network structure learning

Bayesian network Structure Learning Causal Bayesian Networks SemCaDo algorithm O2C algorithm

Experimental study (1)

Benchmark : no existing real benchmark & system :-(

BN graph : random generation (50 to 200 nodes)

Ontology :

Causal relationships : BN edgesHierarchy of concept : generation by clustering BN nodes

Data is generated by using BN as a generative model

Experimental protocol

Hierarchy of concepts and 10% to 40% of existing causalrelationships are given as inputs

Semantic gain : cumulative Rada distance of the discoveredrelationships

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 36/49

Page 102: Ontological knowledge integration for Bayesian network structure learning

Bayesian network Structure Learning Causal Bayesian Networks SemCaDo algorithm O2C algorithm

Experimental study (2)

Ontology helps structure learning

SemCaDo performs better in less steps than MyCaDo

back

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 37/49

Page 103: Ontological knowledge integration for Bayesian network structure learning

Bayesian network Structure Learning Causal Bayesian Networks SemCaDo algorithm O2C algorithm

Experimental study (2)

Structure learning helps ontology evolution

Original causal discoveries are discovered first and can beadded to the ontology

back

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 37/49

Page 104: Ontological knowledge integration for Bayesian network structure learning

Bayesian network Structure Learning Causal Bayesian Networks SemCaDo algorithm O2C algorithm

Outline ...

4 Bayesian network Structure Learningconstraint-based methodsscore-based methodslocal search methods

5 Causal Bayesian Networksdefinitioncausal BN structure learning

6 SemCaDo algorithmdefinitionexperimental study

7 O2C algorithmdefinitionalgorithm

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 38/49

Page 105: Ontological knowledge integration for Bayesian network structure learning

Bayesian network Structure Learning Causal Bayesian Networks SemCaDo algorithm O2C algorithm

Object Oriented Bayesian Networks

An extension of BNs using the object paradigm

[Bangsø and Wuillemin, 2000a; Bangsø and Wuillemin,

2000b; Koller and Pfeffer, 1997]

Support several aspects of the object oriented modeling. (e.g.,inheritance, instantiation)

Designed to model large and complex domains

OOBN structure learning : OO-SEM

This algorithm [Langseth and Nielsen, 2003] is based on 2 steps

Generation of a prior OOBN based on a prior expert knowledge

Grouping nodes into instantiations and instantiations intoclassesGiving a prior information about the candidate interfaces

Adaptation of Structural EM algorithm to learn the final structure

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 39/49

Page 106: Ontological knowledge integration for Bayesian network structure learning

Bayesian network Structure Learning Causal Bayesian Networks SemCaDo algorithm O2C algorithm

Object Oriented Bayesian Networks

An extension of BNs using the object paradigm

[Bangsø and Wuillemin, 2000a; Bangsø and Wuillemin,

2000b; Koller and Pfeffer, 1997]

Support several aspects of the object oriented modeling. (e.g.,inheritance, instantiation)

Designed to model large and complex domains

OOBN structure learning : OO-SEM

This algorithm [Langseth and Nielsen, 2003] is based on 2 steps

Generation of a prior OOBN based on a prior expert knowledge

Grouping nodes into instantiations and instantiations intoclassesGiving a prior information about the candidate interfaces

Adaptation of Structural EM algorithm to learn the final structure

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 39/49

Page 107: Ontological knowledge integration for Bayesian network structure learning

Bayesian network Structure Learning Causal Bayesian Networks SemCaDo algorithm O2C algorithm

OOBN structure learning

Drawbacks

This prior knowledge is not always obvious to obtain

The expert should be familiar with the object orientedmodeling

Our idea

Harness ontologies representation capabilities in order to generatethe prior OOBN structure

Ontologies OOBNsConcepts Cp Classes

Properties Pcpi Real nodes

Inheritance relations HR Class hierarchies

Semantic relations SR Links/ Interfaces

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 40/49

Page 108: Ontological knowledge integration for Bayesian network structure learning

Bayesian network Structure Learning Causal Bayesian Networks SemCaDo algorithm O2C algorithm

OOBN structure learning

Drawbacks

This prior knowledge is not always obvious to obtain

The expert should be familiar with the object orientedmodeling

Our idea

Harness ontologies representation capabilities in order to generatethe prior OOBN structure

Ontologies OOBNsConcepts Cp Classes

Properties Pcpi Real nodes

Inheritance relations HR Class hierarchies

Semantic relations SR Links/ Interfaces

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 40/49

Page 109: Ontological knowledge integration for Bayesian network structure learning

Bayesian network Structure Learning Causal Bayesian Networks SemCaDo algorithm O2C algorithm

The 2OC approach

Ontology

Prior OOBN

Final OOBN

it seems interesting to refine the ontology

used initially !

Morphing process

Learning process

Change detection process

Proposals of evolution

Data (1) ontology to prior OOBN

[Ben Ishak et al., 2011a]

(2) final OOBN to ontology

[Ben Ishak et al., 2011b]

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 41/49

Page 110: Ontological knowledge integration for Bayesian network structure learning

Bayesian network Structure Learning Causal Bayesian Networks SemCaDo algorithm O2C algorithm

Onto2PriorOOBN algorithm

Ontology based generation of a prior OOBN

Ontology graph traversal and morphing into a prior OOBNstructure

3 steps

Initialization step: to generate the OOBN class and a classto each conceptDiscovery step: to define input, internal and output sets foreach class of the OOBNClosing step: to define instances to add to the global OOBNclass

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 42/49

Page 111: Ontological knowledge integration for Bayesian network structure learning

Bayesian network Structure Learning Causal Bayesian Networks SemCaDo algorithm O2C algorithm

Onto2PriorOOBN algorithm

Ontology based generation of a prior OOBN

Ontology graph traversal and morphing into a prior OOBNstructure

3 steps

Initialization step: to generate the OOBN class and a classto each conceptDiscovery step: to define input, internal and output sets foreach class of the OOBNClosing step: to define instances to add to the global OOBNclass

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 42/49

Page 112: Ontological knowledge integration for Bayesian network structure learning

Bayesian network Structure Learning Causal Bayesian Networks SemCaDo algorithm O2C algorithm

Onto2PriorOOBN algorithm

Ontology based generation of a prior OOBN

Ontology graph traversal and morphing into a prior OOBNstructure

3 steps

Initialization step: to generate the OOBN class and a classto each conceptDiscovery step: to define input, internal and output sets foreach class of the OOBNClosing step: to define instances to add to the global OOBNclass

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 42/49

Page 113: Ontological knowledge integration for Bayesian network structure learning

Bayesian network Structure Learning Causal Bayesian Networks SemCaDo algorithm O2C algorithm

Onto2PriorOOBN algorithm

Ontology based generation of a prior OOBN

Ontology graph traversal and morphing into a prior OOBNstructure

3 steps

Initialization step: to generate the OOBN class and a classto each conceptDiscovery step: to define input, internal and output sets foreach class of the OOBNClosing step: to define instances to add to the global OOBNclass

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 42/49

Page 114: Ontological knowledge integration for Bayesian network structure learning

Bayesian network Structure Learning Causal Bayesian Networks SemCaDo algorithm O2C algorithm

Illustrative example : initial ontology

MedCostThisCarDamThisCarCostPropCostILiCostOtherCarCost

Insurance AgeGoodStudenSocioEconHomeBaseAntiTheftVehicleYearMakeModelOtherCar

CarOwner

AccidentAccident

AirbagCushioningMileageCarValueRuggedAutoAntilock

CarTheft

Theft

concerns

DrivQualitySeniorTrainDrivingSkillDrivHistRiskAversion

Driver

has drivercharacteristics

repaiesrepaiesconcerns

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 43/49

Page 115: Ontological knowledge integration for Bayesian network structure learning

Bayesian network Structure Learning Causal Bayesian Networks SemCaDo algorithm O2C algorithm

Illustrative example : prior OOBN

MedCost

ThisCarCostThisCarDam

PropCost

OtherCarCost

ILiCost

GoodStuden

OtherCar

SocioEcon

Age

AntiTheft

MakeModel

HomeBase

VehicleYear

GoodStuden

OtherCar

SocioEcon

Age

AntiTheft

MakeModel

HomeBase

VehicleYear

CO:CarOwner

Global_Insurance

accident

A:Accident

accident

C:Car

Airbag Cushioning Mileage

CarValue RuggedAuto Antilock

Airbag Cushioning Mileage

CarValue RuggedAuto Antilock

Theft

T: Theft

Theft

I:Insurance

SeniorTrain

D:Driver

DrivingSkill

RiskAversion DrivQuality

DrivHist

SeniorTrain DrivingSkill

RiskAversion DrivQuality

DrivHist

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 44/49

Page 116: Ontological knowledge integration for Bayesian network structure learning

Bayesian network Structure Learning Causal Bayesian Networks SemCaDo algorithm O2C algorithm

FinalOOBN2Onto

Ontology enrichment based on OOBN learning

Part of the ontology evolution process

Consists in adding, removing or modifying concepts,properties and/or relations

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 45/49

Page 117: Ontological knowledge integration for Bayesian network structure learning

Bayesian network Structure Learning Causal Bayesian Networks SemCaDo algorithm O2C algorithm

Example : remove relations

No common interface identified between two classes⇒ their corresponding concepts should be independent

After morphing

After learning

Possible change

p1.1p1.2

Cp1p2.1P2.2

Cp2

I1: cCp1

I2: cCp2

p1.1

p1.2p2.1p2.2

cp2.2 cp2.1

p1.1p1.2

Cp1p2.1P2.2

Cp2

I1: cCp1

I2: cCp2

p1.1

p1.2p2.1p2.2

SR

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 46/49

Page 118: Ontological knowledge integration for Bayesian network structure learning

Bayesian network Structure Learning Causal Bayesian Networks SemCaDo algorithm O2C algorithm

Example : add concepts / relations

If ccp communicates with only one class → Add relation

Otherwise, check classes similarities → Add concepts /relations

p1.1p1.2

Cp1

p2.1p2.2

Cp2

p3.1

Cp3p4.1p4.2

Cp4

I1: cCp1I4: cCp4

p1.1 p1.2

p4.1p4.2

p4.2 p4.1

I3: cCp3

p3.2

p3.2

I2: cCp2

p2.1p2.2

p2.2 p2.1

I1: cCp1

p1.2

I2: cCp2

p2.1p2.2

p2.2

p1.1p1.2

Cp1

p3.1

Cp3

p super.1

CpSuper

p2.2

Cp2p4.1

Cp4

After morphing Possible changeAfter learning

is-a is-a

I4: cCp4

p4.1p4.2

p4.1

I3: cCp3

p3.2p3.2

p1.1

SR1 SR2

SR3

SR1 SR2

SR3

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 47/49

Page 119: Ontological knowledge integration for Bayesian network structure learning

Bayesian network Structure Learning Causal Bayesian Networks SemCaDo algorithm O2C algorithm

Example : concepts redefinition

If the class contains more than one component, then Thecorresponding concept may be deconstructed into morerefined ones

p1p2p3p4

Cp

Ccp

p1p2

p3p4

I1 I2

A disconnected graph

After learningp1p2p3

Cp1

p4

Cp2

Possible change

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 48/49

Page 120: Ontological knowledge integration for Bayesian network structure learning

Bayesian network Structure Learning Causal Bayesian Networks SemCaDo algorithm O2C algorithm

Concepts and relations identification

Semi-automated process

The possible changes are communicated to an expert

The expert semantically identify the discovered relations and /or concepts

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 49/49