24
1 PalGov © 2011 1 PalGov © 2011 فلسطينيةلكترونية الديمية الحكومة ا أكاThe Palestinian eGovernment Academy www.egovacademy.ps Tutorial 4: Ontology Engineering & Lexical Semantics Session 6.2 Knowledge Double-Articulation Dr. Mustafa Jarrar University of Birzeit [email protected] www.jarrar.info

Pal gov.tutorial4.session6 2.knowledge double-articulation

Embed Size (px)

Citation preview

Page 1: Pal gov.tutorial4.session6 2.knowledge double-articulation

1PalGov © 2011 1PalGov © 2011

أكاديمية الحكومة اإللكترونية الفلسطينيةThe Palestinian eGovernment Academy

www.egovacademy.ps

Tutorial 4: Ontology Engineering & Lexical Semantics

Session 6.2

Knowledge Double-Articulation

Dr. Mustafa Jarrar

University of Birzeit

[email protected]

www.jarrar.info

Page 2: Pal gov.tutorial4.session6 2.knowledge double-articulation

2PalGov © 2011 2PalGov © 2011

About

This tutorial is part of the PalGov project, funded by the TEMPUS IV program of the

Commission of the European Communities, grant agreement 511159-TEMPUS-1-

2010-1-PS-TEMPUS-JPHES. The project website: www.egovacademy.ps

University of Trento, Italy

University of Namur, Belgium

Vrije Universiteit Brussel, Belgium

TrueTrust, UK

Birzeit University, Palestine

(Coordinator )

Palestine Polytechnic University, Palestine

Palestine Technical University, PalestineUniversité de Savoie, France

Ministry of Local Government, Palestine

Ministry of Telecom and IT, Palestine

Ministry of Interior, Palestine

Project Consortium:

Coordinator:

Dr. Mustafa Jarrar

Birzeit University, P.O.Box 14- Birzeit, Palestine

Telfax:+972 2 2982935 [email protected]

Page 3: Pal gov.tutorial4.session6 2.knowledge double-articulation

3PalGov © 2011 3PalGov © 2011

© Copyright Notes

Everyone is encouraged to use this material, or part of it, but should

properly cite the project (logo and website), and the author of that part.

No part of this tutorial may be reproduced or modified in any form or by

any means, without prior written permission from the project, who have

the full copyrights on the material.

Attribution-NonCommercial-ShareAlike

CC-BY-NC-SA

This license lets others remix, tweak, and build upon your work non-

commercially, as long as they credit you and license their new creations

under the identical terms.

Page 4: Pal gov.tutorial4.session6 2.knowledge double-articulation

4PalGov © 2011

Tutorial Map

Topic Time

Session 1_1: The Need for Sharing Semantics 1.5

Session 1_2: What is an ontology 1.5

Session 2: Lab- Build a Population Ontology 3

Session 3: Lab- Build a BankCustomer Ontology 3

Session 4: Lab- Build a BankCustomer Ontology 3

Session 5: Lab- Ontology Tools 3

Session 6_1: Ontology Engineering Challenges 1.5

Session 6_2: Ontology Double Articulation 1.5

Session 7: Lab - Build a Legal-Person Ontology 3

Session 8_1: Ontology Modeling Challenges 1.5

Session 8_2: Stepwise Methodologies 1.5

Session 9: Lab - Build a Legal-Person Ontology 3

Session 10: Zinnar – The Palestinian eGovernmentInteroperability Framework

3

Session 11: Lab- Using Zinnar in web services 3

Session 12_1: Lexical Semantics and Multilingually 1.5

Session 12_2: WordNets 1.5

Session 13: ArabicOntology 3

Session 14: Lab-Using Linguistic Ontologies 3

Session 15: Lab-Using Linguistic Ontologies 3

Intended Learning ObjectivesA: Knowledge and Understanding

4a1: Demonstrate knowledge of what is an ontology,

how it is built, and what it is used for.

4a2: Demonstrate knowledge of ontology engineering

and evaluation.

4a3: Describe the difference between an ontology and a

schema, and an ontology and a dictionary.

4a4: Explain the concept of language ontologies, lexical

semantics and multilingualism.

B: Intellectual Skills

4b1: Develop quality ontologies.

4b2: Tackle ontology engineering challenges.

4b3: Develop multilingual ontologies.

4b4: Formulate quality glosses.

C: Professional and Practical Skills

4c1: Use ontology tools.

4c2: (Re)use existing Language ontologies.

D: General and Transferable Skills

d1: Working with team.

d2: Presenting and defending ideas.

d3: Use of creativity and innovation in problem solving.

d4: Develop communication skills and logical reasoning

abilities.

Page 5: Pal gov.tutorial4.session6 2.knowledge double-articulation

5PalGov © 2011 5PalGov © 2011

Outline and Session ILOs

This session will help student to:

4a2: Demonstrate knowledge of ontology engineering and

evaluation.

4b1: Develop quality ontologies.

4b2: Tackle ontology engineering challenges.

4b4: Formulate quality glosses.

4c2: (Re)use existing Language ontologies.

Page 6: Pal gov.tutorial4.session6 2.knowledge double-articulation

6PalGov © 2011 6PalGov © 2011

Reading

Mustafa Jarrar: Building A Formal Arabic Ontology (Invited Paper) . In proceedings of

the Experts Meeting On Arabic Ontologies And Semantic Networks. Alecso, Arab League.

Tunis, July 26-28, 2011.Article http://www.jarrar.info/publications/J11.pdf.htm

Slides: http://mjarrar.blogspot.com/2011/08/building-formal-arabic-ontology-invited.html

Mustafa Jarrar: Towards The Notion Of Gloss, And The Adoption Of Linguistic

Resources In Formal Ontology Engineering. In proceedings of the 15th International World

Wide Web Conference (WWW2006). Edinburgh, Scotland. Pages 497-503. ACM Press. ISBN:

1595933239. May 2006.http://www.jarrar.info/publications/J06.pdf.htm

Page 7: Pal gov.tutorial4.session6 2.knowledge double-articulation

7PalGov © 2011 7PalGov © 2011

Knowledge Double-Articulation

Ontology

Base

Commitment

Layer

Onto

logy

Particular

Application

Application-kind

Axiomatizations

Domain

AxiomatizationD

ouble

Artic

ula

tion

The meaning of a vocabulary should be doubly-articulated into domain

axiomatization and application axiomatization(s).

• Domain axiomatization (or a linguistic resource) is mainly concerned with

characterizing the “intended meaning/models” of a vocabulary at the

community/domain level.

• Application axiomatization is more concerned with the utility of these vocabularies

according to certain application/usability perspectives.

• Ontologies built in this way are easier to build, highly reusable and usable, easier

to integrate with other ontologies, and smoother to maintain.

A methodology to engineer ontologies

Page 8: Pal gov.tutorial4.session6 2.knowledge double-articulation

8PalGov © 2011 8PalGov © 2011

Knowledge Double-Articulation

Bibliotheek

Domain axiomatization Application-kind axiomatizations Particular Applications

Highly usable (application level)Highly reusable (domain/community level)

Page 9: Pal gov.tutorial4.session6 2.knowledge double-articulation

9PalGov © 2011 9PalGov © 2011

Knowledge Double-Articulation

Bibliotheek

Domain axiomatization Application-kind axiomatizations Particular Applications

Highly usable (application level)Highly reusable (domain/community level)

OntologyBase, holding

linguistic knowledge, such as

WordNet

Page 10: Pal gov.tutorial4.session6 2.knowledge double-articulation

10PalGov © 2011 10PalGov © 2011

Knowledge Double-Articulation

Bibliotheek

Domain axiomatization Application-kind axiomatizations Particular Applications

Highly usable (application level)Highly reusable (domain/community level)

OntologyBase, holding

linguistic knowledge, such as

WordNet

accounts for the

intended meaning of

domain vocabularies;

rooted at a human

language/community

conceptualization.

interpreted

intensionally;

a shared vocabulary

space for application

axiomatizations;

Page 11: Pal gov.tutorial4.session6 2.knowledge double-articulation

11PalGov © 2011 11PalGov © 2011

Knowledge Double-Articulation Theory

I I: The set of the intended models for concept C

e.g. “Book” at the a human language conceptualization level

A concept is a set of rules in our mind about a certain thing in reality.

IA

IA1: The set of the legal models (/possible extensions) of application CA

e.g. “Book” for museum applicationsIB

IA2: The set of the legal models (/possible extensions) of application CB

e.g. “Book” for public/university librariesICIA3: The set of the legal models(/possible extensions) of application CC

e.g. “Book” for bookstores

For concept C, the set I of “all possible” instances that comply with these rules

are called the intended models of the concept C.

An application A that is interested -according to its usability perspectives- in a

subset IAi of the set I, is supposed to provide some rules to specialize I, IAi is

called legal models.

Back

IAi I

Domain/Language Level

Application Level

Page 12: Pal gov.tutorial4.session6 2.knowledge double-articulation

12PalGov © 2011 12PalGov © 2011

Applying the Double-Articulation Theory

Rooting vocabulary: all vocabulary used in an application

axiomatization is linked with a vocabulary in the domain

axiomatization (which can be linguistic resources, e.g.,

WordNet). e.g., each concept in an ORM model/OWL file is linked with

a concept WordNet/ArabicOntology.

Glosses: If a certain vocabulary does not exist in the domain

axiomatization, then it must define introduced with gloss.

Context: Each application axiomatization must have a context,

as its scope of interpretation.

Modularize application axiomatization into several modules.

To apply the Double-Articulation Theory in practice you may assure that

your ontology is engineering in this way:

Page 13: Pal gov.tutorial4.session6 2.knowledge double-articulation

13PalGov © 2011 13PalGov © 2011

Rooting vocabulary

Each vocabulary in your ontology can be linked (e.g. though a

namespace) with a concept in a linguistic resource (e.g. a synset in

WordNet).

Page 14: Pal gov.tutorial4.session6 2.knowledge double-articulation

14PalGov © 2011 14PalGov © 2011

Example (Customer Complaint Ontology)

Central complaining portal

See http://www.jarrar.info/publications/mjarrar-CCFORM-chapter.pdf.htm

Page 15: Pal gov.tutorial4.session6 2.knowledge double-articulation

15PalGov © 2011 15PalGov © 2011

Example (Customer Complaint Ontology)

See http://www.jarrar.info/publications/mjarrar-CCFORM-chapter.pdf.htm

Page 16: Pal gov.tutorial4.session6 2.knowledge double-articulation

16PalGov © 2011 16PalGov © 2011

CC Ontology (Example)D

om

ain

Axio

mat

izat

ion

CCApplication1 CCApplication2 CCApplicationn

7 axiomatization Modules

ContractRecipient

Complaint

Problem

ComplianantResolution

Address

CC Glossary: 220 glosses

CC Ontology base: 300 lexons

CCcontext

Page 17: Pal gov.tutorial4.session6 2.knowledge double-articulation

17PalGov © 2011 17PalGov © 2011

Defining Glosses

An auxiliary informal (but controlled) account of the intended meaning of a

linguistic term, for the commonsense perception of humans.

(NOT) to catalogue general information and comments, as e.g. conventional

dictionaries and encyclopedias usually do, or as <rdfs:comment>.

A gloss is supposed to render factual knowledge that is critical to understand a

concept, but that e.g. is implausible, unreasonable, or very difficult to formalize

and/or articulate explicitly

Page 18: Pal gov.tutorial4.session6 2.knowledge double-articulation

18PalGov © 2011 18PalGov © 2011

What should and what should not be provided in a gloss:

1. Start with the principal/super type of the concept being defined.

E.g. „Search engine‟: “A computer program that …”, „Invoice‟: “A business document that…”,

„University‟: “An institution of …”.

2. Written in a form of propositions, offering the reader inferential knowledgethat help him to construct the image of the concept.

E.g. Compare „Search engine‟:

“A computer program for searching the internet, it can be defined as one of the most useful aspects

of the World Wide Web. Some of the major ones are Google, ….”;

A computer program that enables users to search and retrieves documents or data from a database

or from a computer network…”.

3. Focus on distinguishing characteristics and intrinsic prosperities that

differentiate the concept out of other concepts.

E.g. Compare, „Laptop computer‟:

“A computer that is designed to do pretty much anything a desktop computer can do, it runs for a

short time (usually two to five hours) on batteries”.

“A portable computer small enough to use in your lap…”.

The ontological notion of Gloss

Page 19: Pal gov.tutorial4.session6 2.knowledge double-articulation

19PalGov © 2011 19PalGov © 2011

4. Use supportive examples :

- To clarify cases that are commonly known to be false but they are true, or that are known to

be true but they are false;

- To strengthen and illustrate distinguishing characteristics (e.g. define by examples, counter-

examples).

Examples can be types and/or instances of the concept being defined.

5. Be consistent with formal definitions/axioms.

6. Be sufficient, clear, and easy to understand.

The ontological notion of Gloss

Page 20: Pal gov.tutorial4.session6 2.knowledge double-articulation

20PalGov © 2011 20PalGov © 2011

Specifying a Context

• Context: A scope of Interpretation • That is: An abstract identifier that refers to implicit (or maybe tacit)

assumptions, in which the interpretation of a term is bounded to a concept

In practice, we define context by referring to a source (e.g. a set of

documents, laws and regulations, informal description of “best practice”, etc.),

which, by human understanding, is assumed to “contain” those assumptions.

Concepts, relations and rules are assumed (by human understanding) to be

“true within their context‟s source”.

Page 21: Pal gov.tutorial4.session6 2.knowledge double-articulation

21PalGov © 2011 21PalGov © 2011

Context (Example)

Customer complaining Context

Page 22: Pal gov.tutorial4.session6 2.knowledge double-articulation

22PalGov © 2011 22PalGov © 2011

Ontology Modularization

Develop an application

axiomatization as a set of

modules and later compose

to form one module.

Page 23: Pal gov.tutorial4.session6 2.knowledge double-articulation

23PalGov © 2011 23PalGov © 2011

Ontology Modularization (why? How?)

Applic

atio

n

Axio

matizatio

n

Dom

ain

Axio

matizatio

n

Why to modularize?

Because Modules are:

1. Easier to reuse

2. Easier to build,

maintain, and replace

3. Enable distributed

development of

modules

4. Enable the effective

management and

browsing

When to modularize?

Modularity criteria:

1. Subject-oriented

2. Purpose/Task-oriented

3. Stability

Page 24: Pal gov.tutorial4.session6 2.knowledge double-articulation

PalGov © 2011 24

References

• Mustafa Jarrar: Building A Formal Arabic Ontology (Invited Paper) . In

proceedings of the Experts Meeting On Arabic Ontologies And Semantic

Networks. Alecso, Arab League. Tunis, July 26-28,

2011.Article http://www.jarrar.info/publications/J11.pdf.htm

Slides: http://mjarrar.blogspot.com/2011/08/building-formal-arabic-ontology-

invited.html

• Mustafa Jarrar: Towards The Notion Of Gloss, And The Adoption Of

Linguistic Resources In Formal Ontology Engineering. In proceedings of

the 15th International World Wide Web Conference (WWW2006). Edinburgh,

Scotland. Pages 497-503. ACM Press. ISBN: 1595933239. May

2006.http://www.jarrar.info/publications/J06.pdf.htm

• Mustafa Jarrar: Towards methodological principles for ontology engineering.

PhD Thesis. Vrije Universiteit Brussel. (May 2005)