Towards a Model of Social Coherence in Multi-Agent Organizations

Towards a Model of Social CoherenceTowards a Model of Social CoherenceIn Multi-Agent OrganizationsIn Multi-Agent Organizations

Erick MartErick MartííneznezIvan KwiatkowskiIvan KwiatkowskiPhilippe PasquierPhilippe Pasquier

{emartinez, pasquier}@sfu.ca{emartinez, pasquier}@sfu.ca

Contributions

● ModelModel

● Operational model where agent behaviour is driven by Operational model where agent behaviour is driven by tractable tractable coherence calculuscoherence calculus

● Local cLocal coherenceoherence-driven agent behaviour drives the dynamics of -driven agent behaviour drives the dynamics of multi-agent organizations; from where multi-agent organizations; from where social coherencesocial coherence emerges emerges

● Local Local coherence calculuscoherence calculus of agents incorporates of agents incorporates sanction policiessanction policies

● ImplementationImplementation

● Java-based Java-based simulation frameworksimulation framework for studying the dynamics of for studying the dynamics of social systemssocial systems

● ExperimentsExperiments

● We illustrate our model by running some We illustrate our model by running some preliminary experimentspreliminary experiments, , and contrasting two different and contrasting two different sanction policiessanction policies

22 / 21 / 21 {emartinez, pasquier}@sfu.ca{emartinez, pasquier}@sfu.ca

Social Modelling

Pizza Delivery ExamplePizza Delivery Example

Role =⟨ Actions , SocCommitmentSchema⟩Ag =⟨RolesAg , Agenda , ProbAgaction⟩Org =⟨Roles , Agents , f assign : AgentsRoles⟩

PizzeriaPizzeria(Organization)(Organization)

DeliveryDeliveryCookCook

TechTech

CustomerCustomer


Actions & Exogenous Events

ActionsActions::● Performed by agentsPerformed by agents

● DiscreetDiscreet, , instant-basedinstant-based, , sequential model of timesequential model of time

● Duration time > 0Duration time > 0

⟨orderPizza x , 1⟩⟨cleanOvenx , 5⟩⟨repairOvenx , 30⟩⟨cookPizza x , 7⟩⟨deliverPizzax , 20⟩⟨ payOrder x , 1⟩

Exog. EventsExog. Events::● Not necessarily performed Not necessarily performed

by agentsby agents

● Periodicity > 0, max. Periodicity > 0, max. period within which the period within which the event will occur once event will occur once

⟨becomeHungryexogx , 5⟩

⟨makeOvenDirtyexog x , 100⟩⟨breakOvenexogx , 200 ⟩


Relationships Between Actions

Relationships inspired by TRelationships inspired by TÆÆMS' taxonomy MS' taxonomy [H[Höörling et al., 1999]rling et al., 1999]

breakOvenexogx disables cookPizza x

becomeHungry exog x enables orderPizza x makeOvenDirtyexog x hinders cookPizza x cleanKitchenVent x facilitates cookPizza x

orderPizza x enables cookPizza x cookPizza x enables deliverPizza x deliverPizza x enables payOrder xcleanOvenx disables cookPizza x repairOvenx disables cookPizza x


Social (Action) Commitments

● Oriented responsibilities Oriented responsibilities contracted by contracted by debtordebtor towards towards creditorcreditor

● Dynamics formalized as Dynamics formalized as a finite state machine a finite state machine (FSM) (FSM) [Pasquier et al., 2006][Pasquier et al., 2006]

● Commitments can be Commitments can be manipulated (state / manipulated (state / transitions)transitions)


Social Commitment Schema (SCS)

Role =⟨ Actions , SocCommitmentSchema⟩

action0SC debtor , creditor , action1, duration , Sanctionsdebtor , Sanctionscreditor

orderPizza x SC cook , delivery , cookPizza x , 8, Scok , Sdelivery

breakOvenexogx SC tech , cook , repairOvenx , 31, Stech , Scook

makeOvenDirtyexog x SC cook , tech , cleanOvenx , 6, Scook , Stech

Role Role CookCook


Pizza Delivery (SCS) Work-flow

becomeHungryexogx SC customer , cook , orderPizzax , 2, Scustomer , Scook

orderPizza x SC cook , delivery , cookPizza x , 8, Scook , Sdelivery

cookPizza x SC delivery , customer , deliverPizza x , 21, Sdelivery , Scustomer

deliverPizza x SC customer , delivery , payOrder x , 2, Scustomer , Sdelivery

breakOvenexogx SC tech , cook , repairOvenx , 31, Stech , Scook

makeOvenDirtyexogx SC cook , tech , cleanOvenx , 6, Scook , Stech

becomeHungry exogx orderPizza x ...

cookPizza x deliverPizzax ...payOrder x

Main work-flowMain work-flowcaptured by SCScaptured by SCS


Instantiated Soc. Commitments (ISC)

orderPizza x SC cook , delivery , cookPizza x , 8, Scook , Sdelivery

[ t inst , t inst duration]

ISC Yves :cook , Tom :delivery , deliverPizza α i , [13, 21] , {0, 0, 0}yves , {0}tom

breakOvenexogx SC tech , cook , repairOvenx , 31, S tech , Scook

ISC Lee : tech, Yves :cook , repairOvenα j , [18, 49] , {0, 0, 0}lee , {0}yves


Social Control Mechanisms

ISC Yves :cook , Tom :delivery , deliverPizza α i , [13, 21] , {0F ,−1C ,−1V }yves , {0C}tom

ISC Lee : tech , Yves :cook , repairOvenα j , [18, 49] , {1F ,−1C , −1V}lee , {−1C

}yves

● Sanction-based: Sanction-based: positivepositive & & negativenegative incentives, decided incentives, decided a prioria priori, static, , static, centralizedcentralized enforcement, applied at the enforcement, applied at the time of violation time of violation

● SanctionsSanctions are embedded into the life-cycle of are embedded into the life-cycle of social social commitmentscommitments, e.g.,, e.g.,

Sanctions Sanctions DebtorDebtorSanctions Sanctions CreditorCreditor


Sanction Policy

σ SC : T [−1, 1]

T set of transitions FSM

σSC t ={st=5V , st=2

CD , st=2CC , st=7

F }

● Determines what Determines what sanction gets associated sanction gets associated to what transitionto what transition


Constraint Between ISC● Constraints over ISC generated Constraints over ISC generated automaticallyautomatically from from

relationships between actions and time interval relationships between actions and time interval overlapping between ISCoverlapping between ISC

Hard constraintsHard constraints Soft constraintsSoft constraints

DisablingDisabling (w = 3)(w = 3) HinderingHindering (w = 1)(w = 1)

OverlappingOverlapping (w = 2.5)(w = 2.5) FacilitatingFacilitating (w = 1)(w = 1)

EnablingEnabling (w = 2)(w = 2)

ISC0 Yves :cook , Tom :delivery , deliverPizzaα i , [13, 21] , {0F , −1C , −1V}yves , {0C

}tom

ISC1Lee : tech , Yves :cook , repairOvenα j , [18, 49] , {1F , −1C , −1V}lee , {−1C

}yves

repairOvenx disables cookPizza x generates neg.constraint C−ISC1, ISC0


Time Overlap Constraint (ISC)

● Agent's Agent's llevel of activityevel of activity: # of : # of acceptedaccepted ISCs in its ISCs in its agenda at any given timeagenda at any given time

● Agents cannot do more than one thing at the timeAgents cannot do more than one thing at the time

ISC0 Yves :cook , Tom :delivery , deliverPizza α i , [13, 21] , {0F , −1C , −1V}yves , {0C

}tom

ISC1Yves :cook , Liz :delivery , deliverPizza α j , [18, 26] , {0F , −1C , −1V}yves , {0C

}liz

ISC0 « ISC1 time overlapping constraint


Coherence Degree

● ISCs can have ISCs can have weighted constraintsweighted constraints between them between them

● An agent will do An agent will do constraint optimizationconstraint optimization over the network over the network of ISCs (of ISCs (agendaagenda) it's involved in) it's involved in

● Coherence degreeCoherence degree: total weight of : total weight of satisfied constraintssatisfied constraints between ISC in agent's between ISC in agent's agendaagenda, divided by total weight of , divided by total weight of overall constraintsoverall constraints

CoherenceDegree Agenda = ∑ x , y∈Sat Agenda

Weight x , y / ∑x , y∈ConAgenda

Weight x , y


Expected Utility Function

G W ' = CoherenceDegree W ' − CoherenceDegree W − ResToChange x , T

where : ResToChange x , T ≡− σ SC T

● The expected utility for an agent to attempt to reach state The expected utility for an agent to attempt to reach state W' from state from state W (which (which only differsonly differs by the change of state by the change of state of a of a singlesingle ISC ISC x))

● For now, no probabilities. Decision making is myopic as For now, no probabilities. Decision making is myopic as agents only consider cancellation penaltiesagents only consider cancellation penalties

● Utility function can be improved by incorporating Utility function can be improved by incorporating uncertainty. E.g., considering probability of failure, uncertainty. E.g., considering probability of failure, rewards & penaltiesrewards & penalties

SanctionSanctionPolicyPolicy


Social Coherence

● In order to In order to maximizemaximize the the coherence degreecoherence degree of its agenda of its agenda (i.e., ISCs) an agent tries to do (i.e., ISCs) an agent tries to do constraint optimizationconstraint optimization

● Agent cycle:Agent cycle: 1. Calculate CoherenceDegree Agenda2. For each active ISC x do3. Calculate utility of flipping ISC x4. End For5. Return ISC x with higher utility gain if any

● Recursive local searchRecursive local search algorithm, algorithm, no backtrackingno backtracking, worst-case , worst-case complexity is polynomial:complexity is polynomial: O(mn2)

● n is the # of ICsis the # of ICs● m is the # of constraints between ICsis the # of constraints between ICs


SC-JSim Simulator


Experimental Setting

● Pizza delivery organization, with 4 agents:Pizza delivery organization, with 4 agents: 1 cook,, 2 delivery,, && 1 technician; plus severalplus several customers

● Simulation parameters: Simulation parameters: periodicityperiodicity & & sanction policysanction policy● Changed Changed periodicityperiodicity of event of event <<becomeHungrybecomeHungryexogexog(x)(x), , p>>, , withwith

p = 80, 40, 20, 10, 5, 2, 1 time steps time steps →→ increases frequency increases frequency of ordersof orders

● Two Two sanction policiessanction policies::

SPol0 Sdebtor = {0F , 0C , 0V}; Screditor = { 0C}

SPol1 Sdebtor = {0F , −1C , −1V }; Screditor = {−1C}

● Metric: Metric: overall overall % of ISCs fulfilled% of ISCs fulfilled ( (efficiencyefficiency))


Observations

Observation 3Observation 3. Under . Under SPolSPol11

the organization was the organization was more more efficientefficient than without any than without any sanctionssanctions (i.e., (i.e., SpolSpol

00). This is ). This is

because the sanction policy because the sanction policy acts as deterrence for easy acts as deterrence for easy cancellations.cancellations.

Observation 1Observation 1. Desirable . Desirable agent behaviour results from agent behaviour results from local local coherence coherence maximizationmaximization. Macro-level . Macro-level social coherencesocial coherence does does emerge from local emerge from local coherence maximizationcoherence maximization..


Observation 2Observation 2. The . The efficiencyefficiency of the organization of the organization degradeddegraded from nearly from nearly optimal as optimal as frequencyfrequency of of orders and agent's orders and agent's level of level of activityactivity was increased. was increased.

Future Work

● ModelModel (extensions): (extensions):● Introducing Introducing uncertainty reasoninguncertainty reasoning into the into the coherence coherence

calculuscalculus; reasoning about time and actions; reasoning about time and actions● Modelling agents with no knowledge, with partial knowledge, Modelling agents with no knowledge, with partial knowledge,

or with complete/shared knowledgeor with complete/shared knowledge● Machine learning mechanisms would allow agents to Machine learning mechanisms would allow agents to

progressively learn these probabilitiesprogressively learn these probabilities

● ExperimentsExperiments::● Impact of different organizational structures (e.g., Impact of different organizational structures (e.g.,

hierarchies, holarchies, societies, federations)hierarchies, holarchies, societies, federations)● Investigate other sanction policiesInvestigate other sanction policies


Acknowledgements

● National Sciences & Engineering Research National Sciences & Engineering Research Council of Canada (NSERC)Council of Canada (NSERC)

● Marek Hatala (SIAT, SFU)Marek Hatala (SIAT, SFU)● Anonymous ReviewersAnonymous Reviewers


Technology

Towards a Model of Social Coherence in Multi-Agent Organizations