Characterizing distribution rules for cost sharing games

Download Report

Transcript Characterizing distribution rules for cost sharing games

Characterizing distribution rules for cost sharing games
Raga Gopalakrishnan
Caltech
Joint work with Jason R. Marden & Adam Wierman
Cost sharing games:
Self-interested agents make decisions, and
share the incurred cost among themselves.
Key Question: How should the cost be shared?
Lots of examples:
Network formation games
Facility location games
Profit sharing games
Cost sharing games:
Self-interested agents make decisions, and
share the incurred cost among themselves.
Key Question: How should the cost be shared?
Lots of examples:
Network formation games
Facility location games
Profit sharing games
D1
S1
S2
D2
Cost sharing games:
Self-interested agents make decisions, and
share the incurred cost among themselves.
Key Question: How should the cost be shared?
Lots of examples:
Network formation games
Facility location games
Profit sharing games
Cost sharing games:
Self-interested agents make decisions, and
share the incurred cost among themselves.
Key Question: How should the cost be shared?
Lots of examples:
Network formation games
Facility location games
Profit sharing games
Cost sharing games:
Self-interested agents make decisions, and
share the incurred cost among themselves.
Key Question: How should the cost be shared?
Lots of examples:
Network formation games
[Jackson 2003][Anshelevich et al. 2004]
Facility location games
[Goemans et al. 2000] [Chekuri et al. 2006]
Profit sharing games
Huge literature
in Economics
Growing
literature in CS
[Kalai et al. 1982] [Ju et al. 2003]
New application: Designing for distributed control
[Gopalakrishnan et al. 2011][Ozdaglar et al. 2009][Alpcan et al. 2009]
Cost sharing games (more formally):
set of resources
𝐺 = ( 𝑁, 𝑅, π’œπ‘–
set of agents/players
utility function
of agent 𝑖
𝒰𝑖 (π‘Ž1 , π‘Ž2 , … , π‘Žπ‘› )
welfare function
𝒲(π‘Ž1 , π‘Ž2 , … , π‘Žπ‘› )
π‘–βˆˆπ‘ , 𝒲,
𝒰𝑖
π‘–βˆˆπ‘
action set of agent 𝑖
π’œπ‘– βŠ† 2𝑅
)
D1
S1
Example:
S2
D2
Cost sharing games (more formally):
set of resources
𝐺 = ( 𝑁, 𝑅, π’œπ‘–
set of agents/players
utility function
of agent 𝑖
𝒰𝑖 (π‘Ž1 , π‘Ž2 , … , π‘Žπ‘› )
welfare function
𝒲(π‘Ž1 , π‘Ž2 , … , π‘Žπ‘› )
π‘–βˆˆπ‘ , 𝒲,
𝒰𝑖
π‘–βˆˆπ‘
)
action set of agent 𝑖
π’œπ‘– βŠ† 2𝑅
Assumption: 𝒲 is separable across resources
𝒲 π‘Ž =
π’²π‘Ÿ ({π‘Ž}π‘Ÿ )
π‘ŸβˆˆR
set of agents choosing
resource π‘Ÿ in allocation π‘Ž
Cost sharing games (more formally):
set of resources
welfare function
at resource π‘Ÿ
π’²π‘Ÿ : 2𝑁 β†’ ℝ
𝐺 = ( 𝑁, 𝑅, π’œπ‘–
set of agents/players
π‘–βˆˆπ‘ ,
π’²π‘Ÿ
utility function
of agent 𝑖
𝒰𝑖 (π‘Ž1 , π‘Ž2 , … , π‘Žπ‘› )
π‘Ÿβˆˆπ‘… ,
𝒰𝑖
π‘–βˆˆπ‘
action set of agent 𝑖
π’œπ‘– βŠ† 2𝑅
Assumption: π’²π‘Ÿ is scalable
π’²π‘Ÿ β‹… = π‘£π‘Ÿ π‘Š β‹…
π‘£π‘Ÿ ∈ ℝ++
common base
welfare function
)
Cost sharing games (more formally):
set of resources
resource-specific
coefficients
𝐺 = ( 𝑁, 𝑅, π’œπ‘–
set of agents/players
π‘–βˆˆπ‘ ,
π‘£π‘Ÿ
utility function
of agent 𝑖
𝒰𝑖 (π‘Ž1 , π‘Ž2 , … , π‘Žπ‘› )
π‘Ÿβˆˆπ‘… , π‘Š,
action set of agent 𝑖
π’œπ‘– βŠ† 2𝑅
𝒰𝑖
π‘–βˆˆπ‘
welfare
function
)
Cost sharing games (more formally):
resource-specific
coefficients
set of resources
𝐺 = ( 𝑁, 𝑅, π’œπ‘–
set of agents/players
π‘–βˆˆπ‘ ,
π‘£π‘Ÿ
utility function
of agent 𝑖
𝒰𝑖 (π‘Ž1 , π‘Ž2 , … , π‘Žπ‘› )
π‘Ÿβˆˆπ‘… , π‘Š,
action set of agent 𝑖
π’œπ‘– βŠ† 2𝑅
𝒰𝑖
π‘–βˆˆπ‘
)
welfare
function
Assumption: Utility functions are also separable/scalable
𝒰𝑖 π‘Ž =
π‘£π‘Ÿ 𝑓(𝑖, π‘Ž π‘Ÿ )
π‘Ÿβˆˆπ‘Žπ‘–
common base distribution rule
(portion of welfare at π‘Ÿ to agent 𝑖)
Cost sharing games (more formally):
set of resources
resource-specific
coefficients
𝐺 = ( 𝑁, 𝑅, π’œπ‘–
set of agents/players
π‘–βˆˆπ‘ ,
π‘£π‘Ÿ
distribution
rule
π‘Ÿβˆˆπ‘… , π‘Š, 𝑓
action set of agent 𝑖
π’œπ‘– βŠ† 2𝑅
Goal: Design the distribution rule 𝑓
)
welfare
function
Requirements on the distribution rule
The distribution rule should be:
(i) Budget-balanced
(ii) β€œStable” and/or β€œFair”
(iii) β€œEfficient”
Requirements on the distribution rule
The distribution rule should be:
(i) Budget-balanced
(ii) β€œStable” and/or β€œFair”
(iii) β€œEfficient”
𝑓 𝑖, 𝑆 = π‘Š(𝑆)
π‘–βˆˆπ‘†
Requirements on the distribution rule
The distribution rule should be:
(i) Budget-balanced
(ii) β€œStable” and/or β€œFair”
(iii) β€œEfficient”
Lots of work on characterizing
β€œstability” and β€œfairness”
Core
[Gillies 1959]
[Devanur et al. 2003]
[Chander et al. 2006]
Nash
equilibrium
[von Neumann et al. 1944] [Nash 1951]
[Moulin 1992]
[Albers et al. 2006]
Requirements on the distribution rule
The distribution rule should be:
(i) Budget-balanced
(ii) β€œStable” and/or β€œFair”
(iii) β€œEfficient”
Lots of work on characterizing
β€œstability” and β€œfairness”
Core
[Gillies 1959]
[Devanur et al. 2003]
[Chander et al. 2006]
Nash
equilibrium
[von Neumann et al. 1944] [Nash 1951]
[Moulin 1992]
[Albers et al. 2006]
Requirements on the distribution rule
The distribution rule should be:
(i) Budget-balanced
(ii) β€œStable” and/or β€œFair”
(iii) β€œEfficient”
Has good Price of Anarchy and
Price of Stability properties
The Shapley value [Shapley 1953]
A player’s share of the welfare should depend on their
β€œaverage” marginal contribution
|𝑇| !
𝑓 𝑖, 𝑆 =
π‘‡βŠ†π‘†\{𝑖}
𝑆 βˆ’ 𝑇 βˆ’1 !
π‘Š 𝑇βˆͺ 𝑖
|𝑆| !
βˆ’ π‘Š(𝑇)
Example: If players are homogeneous, 𝑓 𝑖, 𝑆 = π‘Š(𝑆)/|𝑆|
Note: There is also a weighted Shapley value
Players are assigned β€˜weights’ πœ”π‘–
Properties of the Shapley value
+ Guaranteed to be in the core for β€œbalanced” games
[Shapley 1967]
+ Results in a potential game
[Ui 2000]
+ Guarantees the existence of a Nash equilibrium
- Often intractable to compute
approximations are
often tractable
[Conitzer et al. 2004]
- Not β€œefficient” in terms of social welfare
e.g. Price of Anarchy/Stability β‰₯ 2
[Marden et al. 2011]
[Castro et al. 2009]
Research question:
Are there distribution rules besides the (weighted) Shapley
value that always guarantee a Nash equilibrium?
If so: can designs be more efficient and/or more tractable?
If not: we can optimize over πœ”π‘– to determine the best design!
Research question:
Are there distribution rules besides the (weighted) Shapley
value that always guarantee a Nash equilibrium?
Our (surprising) answer:
NO, for any submodular welfare function.
β€œdecreasing marginal returns”
natural way to model
many real-world problems
The inspiration for our work
Theorem (Chen, Roughgarden, Valiant):
There exists a welfare function π‘Š, for which no
distribution rules other than the weighted Shapley
value guarantee a Nash equilibrium in all games.
[Chen et al. 2010]
Our result
A game is specified by (𝑁, 𝑅, π’œπ‘–
Theorem:
For any submodular welfare function π‘Š , no
distribution rules other than the weighted Shapley
value guarantee a Nash equilibrium in all games.
π‘–βˆˆπ‘ ,
πœˆπ‘Ÿ
π‘Ÿβˆˆπ‘… , π‘Š, 𝑓)
The inspiration for our work
Theorem (Chen, Roughgarden, Valiant):
Given 𝑁, 𝑓, all games (𝑁, βˆ’, βˆ’, βˆ’, βˆ’, 𝑓) posses a
Nash equilibrium if and only if 𝑓 is a weighted
Shapley value.
[Chen et al. 2010]
Our result
Theorem:
Given 𝑁, 𝑓, and any submodular π‘Š, all games
(𝑁, βˆ’, βˆ’, βˆ’, π‘Š, 𝑓) posses a Nash equilibrium if and
only if 𝑓 is a weighted Shapley value.
Our result
Theorem:
For any submodular welfare function π‘Š , no
distribution rules other than the weighted Shapley
value guarantee a Nash equilibrium in all games.
Consequences
 Can obtain the best distribution rule by optimizing the
player weights, πœ”π‘–
 Can always work within a potential game
β€’ Small, well-defined class of games
β€’ Several learning algorithms for Nash equilibrium
 Fundamental limits on tractability and efficiency
Proof Sketch
First step: Represent π‘Š using a linear basis
– Define a 𝑇-welfare function: π‘Š 𝑇
1
π‘‡βŠ†π‘†
𝑆 =
0 otherwise
– Given any π‘Š, there exists a set 𝒯 βŠ† 2𝑁 , and a sequence
𝒬 = {π‘žπ‘‡ } π‘‡βˆˆπ’― of weights indexed by 𝒯, such that:
π‘žπ‘‡ π‘Š 𝑇
π‘Šβ‰”
π‘‡βˆˆπ’―
β€œcontributing
coalition”
β€œmagnitude of
contribution”
Proof technique: Establish a series of necessary conditions on 𝑓
Proof Sketch (A single T-Welfare Function)
Proof technique: Establish a series necessary conditions on 𝑓
𝑇 is not formed in 𝑆
What is required
of 𝑓 𝑖, 𝑆 ?
𝑇 is formed in 𝑆
Don’t allocate welfare
to any player
Allocate welfare only
to players in 𝑇,
independent of others
⇓
⇓
𝑓 is completely specified by 𝑓(𝑖, 𝑇)
πœ”π‘– =
𝑓(𝑖, 𝑇)
π‘–βˆˆπ‘‡
π‘žπ‘‡
arbitrary 𝑖 βˆ‰ 𝑇
𝑓 is a weighted Shapley value
Proof Sketch (General Welfare Functions)
Proof technique: Establish a series necessary conditions on 𝑓
What is required
of 𝑓 𝑖, 𝑆 ?
no coalition from
𝒯 is formed in 𝑆
Don’t allocate welfare
to any player
a coalition from
𝒯 is formed in 𝑆
Allocate welfare only
to players in these
formed coalitions,
independent of others
⇓
𝑓 𝑇 is the basis
weighted Shapley
value corresponding
to π‘Š 𝑇 , with weights πŽπ‘‡
π‘žπ‘‡ 𝑓 𝑇
𝑓≔
π‘‡βˆˆπ’―
Key challenge: Each basis 𝑓 𝑇 might use different πŽπ‘‡ !
Proof Sketch (General Welfare Functions)
Proof technique: Establish a series necessary conditions on 𝑓
What is required
of 𝑓 𝑖, 𝑆 ?
no coalition from
𝒯 is formed in 𝑆
Don’t allocate welfare
to any player
a coalition from
𝒯 is formed in 𝑆
Allocate welfare only
to players in these
formed coalitions,
independent of others
⇓
Weights of common players
in any two coalitions must
be linearly dependent
⇓
𝑓 is a weighted
Shapley value
π‘Š is submodular
π‘žπ‘‡ 𝑓 𝑇
𝑓≔
π‘‡βˆˆπ’―
Cost Sharing Games
Research question:
Are there distribution rules besides the (weighted) Shapley
value that always guarantee a Nash equilibrium?
Our answer:
NO, for any submodular welfare function.
what about for other welfare functions?
Understand what causes this fundamental
restriction – perhaps some structure of action sets?
Characterizing distribution rules for cost sharing games
Raga Gopalakrishnan
Caltech
Joint work with Jason R. Marden & Adam Wierman
References
β€’
β€’
β€’
β€’
β€’
β€’
β€’
β€’
β€’
β€’
β€’
β€’
β€’
β€’
β€’
β€’
β€’
β€’
β€’
β€’
β€’
β€’
[von Neumann et al. 1944]
[Nash 1951]
[Shapley 1953]
[Gillies 1959]
[Shapley 1967]
[Kalai et al. 1982]
[Moulin 1992]
[Goemans et al. 2000]
[Ui 2000]
[Devanur et al. 2003]
[Jackson 2003]
[Ju et al. 2003]
[Anshelevich et al. 2004]
[Conitzer et al. 2004]
[Albers et al. 2006]
[Chander et al. 2006]
[Chekuri et al. 2006]
[Alpcan et al. 2009]
[Ozdaglar et al. 2009]
[Chen et al. 2010]
[Gopalakrishnan et al. 2011]
[Marden et al. 2011]