bachrachb08.ppt

Download Report

Transcript bachrachb08.ppt

Distributed Multiagent Resource
Allocation In Diminishing
Marginal Return Domains
Yoram Bachrach(Hebew University)
Jeffrey S. Rosenschein (Hebrew University)
Outline

Multiagent Resource Allocation (MARA)

General problem
 Applications

Centralized and decentralized mechanisms


Specific restricted domain


VCG solution in restricted domain
Allocation by interaction


Selfish behavior challenge
Market motivation behind method
Allocation protocol and suggested strategies

Convergence to optimal allocation
 Strategic and selfish behaviour
 Expected time to convergence

Conclusions and future research
Multiagent Resource Allocation

Allocating resources to users
 Scarce
resources
 Selfish agents with private information


Both users and resource owners
An allocation maps resources to users
MARA Applications
Industrial procurement
 Satellite resources
 Tasks in manufacturing systems
 Grid computing
 RF spectrum and coverage
…

MARA Domain Properties

Divisible / Indivisible


Sharable / Non-Sharable


Can a resource be allocated to several agents simultaneously?
Single-Unit / Multi-Unit


Can parts of a single resource be allocated to several agents?
Are there bundles of identical resources?
Transferable / Non Transferable Utility

Can agents compensate by transferring utility among them?
MARA Approaches

Attempt to maximize social welfare



Centralized mechanisms


A central mechanism gets the agents’ preferences and chooses
an outcome
Decentralized approaches


Other possible goals – Maximin, fairness, …
There may be more than one optimal allocation
Agents actively participate in choosing the outcome
Problem – agents are selfish and attempt to maximize
their own utility
Centralized Mechanisms

The mechanism must elicit the agents’ private information
about allocations


We are interested in incentive compatible mechanisms



But agents may manipulate to increase their own utility
Agents reply truthfully, under a certain rational behavior
Rational behavior captured in a game theoretic solution concept
Vickery-Clarke-Groves (VCG) approach


Tax agents to make truth telling is a dominant strategy
Strategyproof, allocatively efficient but only weakly budget balanced
Distributed Mechanisms

Central mechanisms may not be
appropriate in distributed environments
 Hard
to establish a trusted central authority
 Scalability concerns – the central mechanism
may be a performance bottleneck

Have agents interact among themselves to
choose the allocation
 Need
to define the protocol for interaction
 Selfish agents may still manipulate
Specific Domain

Set of identical agents


Set of resources



Each agent only requires a single resource, and does not benefit
from being allocated more than one resource
Cannot be divided among agents
Can be shared among agents
Diminishing marginal production

The total utility of the agents who are allocated a certain
resource drops as more agents use that resource
Diminishing Marginal Return
10
14
15
5
10
7
5
7
5
Diminishing Marginal Return
Total production is 10
10
14
15
5
10
7
5
7
5
Diminishing Marginal Return
Total production increases to 14
10
14
15
5
10
7
5
7
5
Diminishing Marginal Return
Total production increased by 4 when adding a single agent
Marginal production of 4
10
14
15
5
10
7
5
7
5
Diminishing Marginal Return
Total production increased by 1 when adding a single agent
Marginal production of 1
10
14
15
5
10
7
5
7
5
What needs to be decided?

A mechanism must decide:
 An

allocation – which agent gets which resource
We want to maximize the social welfare – total production
 Utility

Agents gain utility due to the allocation


Resource owners receive nothing
Resource owners hold the private information



transfers
Eliciting this information requires incentivizing the resource
owners to report their production function
Requires giving resource owners some of the utility
We assume the total production across all the
resources can be redistributed in any way
VCG in Restricted Domain

Easy to compute an optimal allocation
 Resources
report total production functions
 Find maximal social welfare by a greedy algorithm


Assign to the resource with maximal marginal production
Induce truthfullness by VCG tax
 Requires


establishing a trusted central authority
Trust and security issues, central bottleneck, …
Weakly budget balanced – some of the total production is
kept in the mechanism and not distributed
Allocation by Interaction

Define a protocol for interaction between agents and
resource owners


Interaction proceeds in discrete time rounds



Simulate a market for services
Each round determines both an allocation and transfers
Design protocol and suggest interaction strategies
so that the optimal allocation is always reached
Challenges


Achieve the optimal allocation despite selfishness
Make sure the optimal allocation is reached quickly
Interaction Protocol
R1
Currently on R1,
getting utility 5
R2
Round Payment (5)
R3
Interaction Protocol
R1
Currently on R1,
getting utility 5
R2
Resource Request
R3
Interaction Protocol
R1
R2
Payment Bid (10)
R3
Interaction Protocol
Switch to R2 with
utility 10
R1
R2
Accept
R3
Interaction Protocol
Stay on R1, with
utility 5
R1
R2
Decline
R3
Interaction Protocol
R1
Currently on R2
with utility 10
R2
Round Payment 10
R3
Interaction Protocol
R1
Currently on R2
with utility 5
R2
Payment Change (5)
The Resource Owner’s
Perspective
Production – 12
Payments – 10
Utility – 2
12
Production – 13
Payments – 12
Utility – 1
13
4
5
4
5
4
Chosen Allocation

The interaction decides both the allocation and
redistribution of the utility





Agents are allocated the last resource whose bid they accepted
Agents get the utility as in the last payment bid they accepted
Resource owners keep the reminder of the production on the
resource not redistributed to the agents
The allocation may change at the end of every round
An allocation is stable if once reached it never changes

Depends on the strategies of the participants

Agents and resource owners
Suggested Strategy - Agents

Each round, randomly choose a resource
and request using the resource
 If
the bid in that resource is better than the
current bid, switch to that resource (accept)
 If the bid is lower than the current resource
offers, stay with current resource
Suggested Strategy –
Resource Owners



Keep the agents’ share of the utility in the level
of the marginal production on the resource
On round start, offer all the agents allocated to
the resource the current last marginal production
Answer resource requests with bid of the next
marginal production on the resource
 If
accepted, set the bid for all the agents to the new
marginal production by a Payment Change message
 If declined – do nothing
Resource Owners - Example
MP = 4
10
MP = 1
14
15
1
10
4
1
4
1
Protocol Stable Allocation

Given a set of strategies for the agents and resource owners, a
protocol stable allocation is one that, once reached, never changes


Under these strategies, no interaction results in an agent switching to a
different resource
Protocol stable under the suggested strategies

No agent is ever given a bid higher than what he is currently getting on
his current resource
 Resource owners bid the next marginal production
 There is no resource where the next marginal production is greater than
the current marginal production on other resources

Similar to greedily allocating agents to resources according to marginal production
Convergence to Optimum

Under the suggested strategies, the chosen allocation
always converges to the optimal allocation

Monotonic improvement


Stability in optimum


The optimal allocation is protocol stable
No “local” optima – protocol stable is optimal


If an agent switches resources, the social welfare increases
If a non optimal allocation is chosen, there is a possible round
where an agent switches resources
What about strategic behavior?
Strategic Behavior

Agents and resource owners have to follow the
protocol, but not the suggested strategies
 Might
obtain higher utility by choosing a different
strategy


Agents may accept a bid lower than what they currently have
Resource owners may suggest a bid different than the
current marginal production



Higher, to attract more agents
Lower, to give a lower share of utility to the agents
Is such strategic behavior rational for self
interested agents?
Strategic Agents (Our domain)

If an agent gained from strategic behavior,
we still reach an optimal allocation
 If
a single agent has deviated from the
suggested strategy and gained utility

Gained utility: a protocol stable allocation has been
reached, in which the agent gets a higher utility
 Then
the reached protocol stable allocation is
also optimal
Strategic Resource Owners

Resource owners who set too high a bid


Attract more agents but pay more and lose utility
Resource owners who set too low a bid

Pay less, but lose agents to competing resources



who offer higher bids
When the domain is competitive for resource owners, such a
manipulation is irrational
Highly competitive settings

Condition that occurs mostly in environments where there are
many resources with similar marginal production values

Similar resources or slight changes in marginal production
Strategic Resource Owners

In our specific domain
 Diminishing
marginal return
 Highly competitive for resource owners

If a resource owner gained from strategic
behavior, we still reach an optimal allocation
 If
a single resource owner has deviated from the
suggested strategy and gained utility

Gained utility: a protocol stable allocation has been reached,
in which the resource owner gets a higher utility
 Then
the reached protocol stable allocation is optimal
Convergence Time

When agents and resource owners behave
rationally, we converge to an optimal allocation
 But

how quickly is the optimal allocation reached?
Under the suggested strategies
 Expected
time to convergence:
 Bound on convergence time:

Quick polynomial convergence
Related Work

TFG-MARA survey


Distributed mechanism design approaches


J. Feigenbaum and S. Shenker. Distributed algorithmic mechanism design: Recent results and
future directions.
Scheduling domains


Y. Chevaleyre, P. E. Dunne, U. Endriss, J. Lang, M. Lemaître, N. Maudet, J. Padget, S. Phelps, J. A.
Rodríguez-Aguilar, and P. Sousa. Issues in Multiagent Resource Allocation.
B. Heydenreich, R. Muller, and M. Uetz. Decentralization and mechanism design for online machine
scheduling.
Negotiations over resources


U. Endriss, N. Maudet, F. Sadri, and F. Toni. Negotiating socially optimal allocations of resources.
T. W. Sandholm. Contract types for satisficing task allocation.
Conclusions

A distributed approach to resource allocation in
a specific domain
 Achieves
optimal allocation (maximal social welfare)
 No central authority required
 All utility divided among agents and resource owners

“Strongly budget balanced”
 Quick

convergence
Can a similar approach be applied to other
domains (or more general domains)?