行为运筹学和格式运筹学简介

Transcript 行为运筹学和格式运筹学简介

数学建模讲座（太原，2013年5月）
行为运筹学建模
Modeling with Behavioral
Operations Research (BOR)
清华大学数学科学系
办公室：理科楼A-202#
谢金星
电话：62787812
E-mail: [email protected]
http: //faculty.math.tsinghua.edu.cn/~jxie
提
纲
• 什么是行为运筹学/运作管理？
– 传统OR/OM：存在的主要问题
– BOR/BOM的兴起
• BOR/BOM：几个例子
– 公平模型
– 报童模型
– 认知层次模型
• BOR/BOM：知识要点及研究方法
运筹学（OR）与运作管理（OM）
运筹学(OR-Operations Research)：
应用数学方法研究经济、民政和国防等部门在
内外环境的约束下合理分配人力、物力、财力等资
源，使实际系统有效运行的技术科学。
（中国大百科全书）
运作管理(OM-Operations Management)：
对生产和服务系统进行设计、运行和维护的管
理，属于管理范畴。
存在的主要问题
初衷：
运筹学作为理论工具解决运作管理中的实际问题
尴尬：
有多少真正成功的应用？（成功的极少、
不成功的很多）
学员（如MBA）怀疑
企业管理者不信任
公式无用论不绝于耳
OR迟早要死掉
……
存在的主要问题
反思：
问题到底出在哪儿!?
实践者的素质太低?
所有的定量分析工具先天性地实用价值差?
……
与其他学科对比：
为什么定量分析方法在计算机系统、通信网络系
统中能得到坚实的应用!?
为什么优化算法在芯片制造中可发挥巨大作用!?
还有，在自动仓储系统中堆垛机的优化运行、柔
性制造系统的调度与控制也很成功!?
……
存在的主要问题
系统分类
机械系统：
系统中的所有元素无主观能动性，系统的行为
是完全理性的。
含人系统：
系统中包含有人，系统的行为可以是完全理性、
有限理性、非理性。
主要特征：有限理性
存在的主要问题
对于不含人的“机械”系统，系统的行为是“完全
理性”，经典的运筹学对这类系统进行了令人满意
的理论分析
但对于含人的系统，人的行为的最基本特征是“有
限理性”，（如损失厌恶、参照依赖、现状偏见、
不等值贴现、锚定心理、从众现象等），导致人们
在行为上并不总是追求“效用最大”，而是会根据
对环境的认知和自己有限的思维，做出“让自己满
意的选择”
存在的主要问题
对于以人为中心的系统，由于人并不等同于机械，这就导致了
经典的运筹学不能被直接应用。
等待成本
W
平均等待时间
平均队长 L
例如：排队模型，运筹学的经典分支，可用于研究
1、生产系统、通信系统、计算机系统、交通系统
2、以及服务系统（银行、邮局、医疗、公交等）
但对系统的规划应有很大的差别
0.2
0.4
0.6
利用率
0.8
1
临界点
等待时间
存在的主要问题
对于以人为中心的系统，由于人并不等同于机械，这就导致了
经典的运筹学不能被直接应用。
例如：对策论，运筹学的经典分支，可用于研究
经济、营销、金融、生产等
基本假设：参与博弈的各方的行为是理性的
现实情况：通常参与博弈的各方的行为并不是理性的
结果：产生了行为经济学、行为营销学、行为金融学等新领域
存在的主要问题
对于以人为中心的系统，由于人并不等同于机械，这就导致了
经典的运筹学不能被直接应用。
例如：库存论，运筹学的经典分支，可用于研究
生产系统、以及服务系统等
目标：最小化成本（订货成本、持货成本、缺货惩罚成本）
决策：最优订货策略
现实情况：1、决策者对持货成本和缺货惩罚成本并不同等看待
2、缺货等待的顾客通常不会有长久耐心
3、有国外学者实证研究发现几乎没有管理者按照最优
订货策略订货
存在的主要问题
对于以人为中心的系统，由于人并不等同于机械，这就导致了
经典的运筹学不能被直接应用。
例如：供应链管理，运筹学的经典分支，可用于研究
采购、合同、协调机制等
方法：以VMI（供应商管理库存）为例，有收益返还、批发价格、
回购买等协调机制
理论上可以达到双赢目标
现实情况：有国外学者实证研究发现几乎没有哪家企业对实施VMI
感到满意，而且通常是越获利的一方越不满意，总抱
怨对方获得了更多的利益
存在的主要问题
例如：三门问题（蒙提霍尔问题、蒙提霍尔悖论）
出自美国的电视游戏节目Let’s Make a Deal，该
节目的主持人为蒙提·霍尔（Monty Hall）。参赛
者会看见三扇关闭了的门，其中一扇的后面有一辆
汽车，选中后面有车的那扇门可赢得该汽车，另外
两扇门后面则各藏有一只山羊。当参赛者选定了一
扇门，但未去开启它的时候，节目主持人开启剩下
两扇门的其中一扇，露出其中一只山羊。主持人其
后会问参赛者要不要换另一扇仍然关上的门？
隐含的基本假设：
主持人知道门的“秘密”，总是打开“山羊门”
存在的主要问题
两类建模思路：
规范方法(Normative)
实证方法(Positive)
决策者应该如何做---不管他实际如何做
（他可能犯错误）
决策者实际如何做---不管他应该如何做
（他可能犯错误，但
对他所犯的错误也要
解释、预测和控制）
行为运筹学与行为运作管理
BOR(BOM) = OR/OM + 行为科学：研究受到个体、社会
和文化影响下的人的行为和认知规律对运作系统和运作
过程的影响。
行为科学：
美国管理百科全书的定义：“行为科学是运用自然科学的实
验和观察方法，研究自然和社会环境中人的行为的科学，已经确
认的学科包括心理学、社会学、社会人类学和其它学科类似的观
点和方法。”
狭义行为科学的研究通过研究生产和工作环境中人的行为，
旨在提高生产和工作效率。（组织行为学）
行为运筹学与行为运作管理
What’s about BOR?
• Behavioral operations research (BOR)
examines the behavior of actual human
agents in complex decision problems.
BOR is the operations management
analog of experimental economics and
behavioral finance, and is part of the
field known as management science.
行为运筹学与行为运作管理
• M. F. Shakun, Management Science and Management:
Implementing Management Science via Situational
Normativism, Management Science, Vol. 18, No. 8,
(Apr. 1972), pp. B367-B377
Specifically, training in "behavioral operations
research" is recommended where management scientists
study the behavioral sciences and learn how to use this
knowledge in MS work. This implies: (1) knowing how
to incorporate behavioral variables and knowledge into
MS models as needed and (2) explicit consideration of
change and implementation as an integral part of model
building.
行为运筹学与行为运作管理
服务系统与机械系统的本质不同：服务系统含人
人的行为：有限理性、社会偏好、文化冲突、…
在国外，行为运筹学/运作管理（BOR/OM）已成为OR
（OM）中最活跃的热点之一
JOM（2006）
出版了专辑，
共刊登了9篇
专题论文，
分别介绍行
为运作管理
中的实证研
究工作
MSOM（2008）
出版了专辑，
共刊登了7篇专
题论文，分别
介绍行为运筹
学与行为运作
管理中不同分
支的研究工作
行为运筹学与行为运作管理
国家自然科学基金委员会“双清论坛”
2009年8月22日~23日，中国科大
主题：行为运作管理
行为运筹学与行为运作管理
International Workshop on
Behavioral OR/OM
Tsinghua University
(Department of Industrial Engineering
And Department of Mathematical Sciences)
时间：2009-2011每年12月18日前后
地点：清华大学
2012年12月：西南交通大学
2013年12月：大连理工大学
提
纲
• 什么是行为运筹学/运作管理？
– 传统OR/OM：存在的主要问题
– BOR/BOM的兴起
• BOR/BOM：几个例子
– 公平性模型
– 报童模型
– 认知层次模型
• BOR/BOM：知识要点及研究方法
不患寡而患不均
（“不平则鸣”）
背景与问题
背景
最后通牒博弈(Ultimatum Game)
• 甲乙两人就分配１笔钱（如100元）进行博弈.
• 甲首先提出分配方案 (分给乙的钱: s).
• 如果乙接受，则按此分配；否则双方什么也得不到.
• 完全信息动态博弈：
均衡结果是(s=0,乙接受);
如果要求严格均衡，则s=１分钱.
• 现实中的情况果真如此吗？
实验
China
结果
背景与问题
• 现实(实验)中的情况:
•多数s＝总额的40~50%
•s越小，越容易被乙拒绝
自私：
理性／
非理性？
公平：
利他／
互惠？
“不患寡而患不均”！
模型假设与建立
财富总额为1
接受提议：甲乙所得x1=1-s, x2= s；否则：x1=x2=0
1. 每个参与者都喜欢对所有参与者公平的结果；
2. 每个参与者自己受到不公平对待时的“愤怒”，
胜过其他参与者受到不公平对待时的“愧疚”．
效用函数 U i ( x1 , x2 )  xi   i max{x j  xi ,0}  i max{xi  x j ,0}
 i  i  0
i  1,2, j  3  i
i  1 / 2
否则，xi>xj=1-xi时，
Ｕi(x)＝xi -βi (xi -xj)= βi -(2βi -1)xi
关于xi的系数非正（过分“愧疚” ）
模型求解
Ui ( x1, x2 )  xi  i max{x j  xi ,0}  i max{xi  x j ,0}
乙的最优反应（给定s）
如果不接受，则x1=x2=0; U1(s)=U2(s)=0 .
如果接受，则x1=1-s, x2=s.
2  1/ 2
• 若s≥1/2,则x2≥ x1
U 2 (s)  s   2 (2s  1) ≥1/2≥0
• 若s≤1/2,则x2≤x1
U 2 (s)  s   2 (1  2s)  (1  22 )s   2
U2(s)≥0
(s＝1/2, 两者一致)
乙的最优反应
s ≥ s( 2 )   2 /(1  2 2 )
易知
0  s( 2 )  1/ 2
当 s ≥ s(2 ) 接受; 否则，不接受
模型求解：甲的决策(只需考虑乙接受情形)
Case 1: 甲知道乙的α2
• 若s≥1/2,则x2≥ x1
U1 (s)  1  s  1 (2s  1)
s=1/2时达到最大值1/2
• 若s≤1/2,则x2≤ x1
但 s ≥ s(2 )
甲的决策
U1 (s)  1  s  1 (1  2s)  1  1  (21 1)s
1  1 / 2
s  s (2 )  2 /(1  22 )
均衡: (s*,接受)
*
s*严格小于50%;
是乙的“愤怒”系数α2的增函数.
模型求解：甲的决策
Case 2: 甲不知道乙的α2,
但知道α2的分布F(α2)
• 若s≥1/2,则x2≥ x1
  max{ F ( )  0}
  min{ F ( )  1}
U1(s)=1-s-α1(2s-1) 同前
• 若s≤1/2,则x2≤ x1
乙接受概率
期望效用
甲的决策
s  s ( )
0,

p   F ( s /(1  2 s )), s ( )  s  s ( )
1,
s  s ( )

s  s ( )
0,

EU1 ( s )  [1  1  (21  1) s ]F ( s /(1  2s )), s ( )  s  s ( )
1    (2  1) s,
s  s ( )
1
1

max [1  1  (21  1) s]F ( s /(1  2s))
s ( ) s  s ( )
s*
模型解释
• 甲永远不会提出大于１/２的方案s
• 乙拒绝过小的方案s
• 乙接受概率随s增加不减
很好地解释了实际中的最后通牒博弈.
主要参考文献
公平行为的其他模型
• 公平理论，又称社会比较理论:“按劳分配”
– J. S. Adams (1965). Inequity in social exchange: Advances in
Experimental Social Psychology, Vol. 2, 267–299
– 劳动(投入)包括数量、质量等；分配的对象可以是有形的，如工
资、利润等，也可以是无形的，如表扬、赞赏等；
– 基本模型：报酬/投入
–
–
–
–
OP/IP = OC/IC （或 OH/IH ）
OP——对自己报酬的感觉
IP——对自己投入的感觉
OC （OH）——对他人（或本人从前）报酬的感觉
IC （IH）——对他人（或本人从前）投入的感觉
– 局限性：
–
–
–
–
公平与个人的主观判断有关，与个人所持的评判标准相关；
绩效难以评定；不完全信息往往使“比较”脱离客观实际；
人们的公平感还取决于自己的期望值和对组织的“认同感”；
…
公平行为的其他模型
• Rabin (1993,2002)：基于友好意图的互惠模型
ai represents player i's strategy, bj
represents player i's belief about what
player j's strategy will be, and ci
represents what player i's beliefs about
player j's beliefs about player i's strategy.
公平行为的其他模型
• 其他公平模型
• Kim & Mauborgne (1997)：fair process;
• Bolton & Ockenfels (2000)：基于比较的公平-互惠-竞
争模型
• 纵向公平性和横向公平性
• Ho & Su (2009). Peer-induced fairness
公平行为与供应链管理
• 供应链/营销渠道中的公平行为的实证研究
– Olmstead and Rhode 1985, Kumar 1996, Scheer et al. 2003,
Loch and Wu 2008：一些生产商和零售商会牺牲一部分自
己的利润来提高渠道中其他企业的利润。
– Kumar (1996)发表于《哈佛商业评论》的文章指出，如果
希望渠道上下游双方进一步协调，取得更高的渠道利润，
则双方需要增加信任（Trust）。而取得信任的途径有以下
几点：
– 1) 双方的交流（Bilateral communication）,
– 2) 公平公正（Impartiality）,
– 3) 可反驳性（Refutability）,
– 4) 解释（Explanation），
– 等等。
公平行为与供应链管理
• 公平与渠道协调
– Cui, Raju and Zhang (2007): Fairness and Channel Coordination
• 单一生产商单一零售商渠道，如果下游考虑公平，
批发价合同在一定条件下可以达到渠道协调
• Retailer:
– Demirag, Chen, & Li (2010): 上文模型中如果需求函数形式不
是线性的，在某些情况下也可以达到渠道协调；
– Pavlov and Katok (2009): 研究在公平性和其他有限理性行为共同
存在的情况下，渠道不能达到协调的原因，以及当公平性模型的
参数是私有信息的时候，对供应链协调带来的影响.
公平行为与供应链管理
• 公平与市场定价
– Chen and Cui (2010)
– “The Benefit of Uniform Price for Branded Variants”
– incorporate consumers’ concerns of peer-induced price fairness into a
model of price competition
– explain the extensive adoption of uniform pricing by firms
• Guo (HKUST, 2012)
– “Inequity Aversion and Fair Selling”
– Firm: cost disclosure?
– Customer : willing to buy?
公平行为与供应链管理
• 公平与销售激励
• Cui, Raju & Shi (2011)
– “S-shaped Incentive Schemes and Pay Caps ”
– incorporating salespeople’s aversion to pay inequity into the standard
agency model
公平行为与供应链管理
• 公平与投资和定价
– Cui and Mallucci (2012) ：
– “Fairness Ideals in Distribution Channels”
– both investments and pricing decisions
– They analytically show that firms are more willing to invest in a channel
in order to affect the formation of the equitable payoff, when firms are
fair-minded and make pricing decisions to assure their pecuniary payoffs
reflect their respective contributions to the channel.
– Embed the fairness utility function of channel members in a quantal
response (QRE) model (bounded rationality).
– Suggest that a newly proposed sequence-aligned fairness ideal,
according to which the sequence of moving determines the formation of
equitable payoff for channel members, significantly outperforms other
fairness ideals, including strict egalitarianism, liberal egalitarianism, and
libertarianism.
公平行为与供应链管理
• 过程公平与努力程度
– Wu, Loch & Heyden (2008) ：
– develop a model of fair process in
a principal-agent (i.e., manageremployee) context.
– show that indeed fair process will
not always be used.
• 公平与供货渠道
– Chen, Zhao and Shen (2012)
– “The Horizontal Fairness Concern of Backup Supplier in a Triadic Supply
Chain”
公平行为与供应链管理
• 公平、合作广告与渠道协调（Yang et al. 2013）
• 市场需求函数 S(A) 由区域性广告 A 决定， S ( A)  T  kA
– T 0
代表市场潜力
– k > 0 代表零售商的广告转化为利润的比例系数
• 假设短期内销售价格给定，从而除去广告成本之外的边际利润率
给定
• 令 M、R 的边际利润率分别为 ρM 和 ρR；假设 M   R 2
M 采用合作广告策略；
– 广告成本满足二次函数形式： A2/2 。
制造商M
零售商R
二级分销渠道结构图
市场需求S
，保证
公平行为与供应链管理
• 合作广告合同
– 制造商 M 对零售商 R 的每一单位区域性广告支出 A 给予 t 比例
的补贴；
• M、R 以及整个渠道的利润函数可写为
合作广
告策略
 M ( A, t )   M S ( A)  tA2 / 2   M (T  kA)  tA2 / 2,
 R ( A, t )   R S ( A)  (1  t ) A2 / 2   R (T  kA)  (1  t ) A2 / 2.
C ( A)  (M  R )S ( A)  A2 / 2  (M  R )(T  kA)  A2 / 2
• 不关注公平情形 -- 分散决策(Stackelberg 博弈)
M 首先决定其参与率 t; R 随后决定广告水平 A
–结论：
分散决策下合作广告策略达不到渠道协调；
合作广告策略占优于非合作广告策略。
公平行为与供应链管理
• 零售商关注公平的模型
– R 最大化自己的效用，效用函数不仅与自己的利润相关，还与
其他人的利润相关（ Fehr & Schmidt 1999 ）
U R ( A, t )   R ( A, t )   max( M ( A, t )   R ( A, t )), 0
  max( R ( A, t )   M ( A, t )), 0,
•   0称之为愤怒系数
1]
•   [0,称之为怜悯系数
  0称之为零售商的公平比例系数
•
 
• 假设
– 定义 R 的三种状态：
• 愤怒不公平状态：零售商考虑公平性，且  R ( A, t )   M ( A, t )
• 怜悯不公平状态：零售商考虑公平性，且  R ( A, t )   M ( A, t )
• 公平状态：零售商考虑公平性，且  R ( A, t )   M ( A, t )
公平行为与供应链管理
•
模型主要结果：
–
–
存在渠道协调的区域（I），如下图算例所示；
渠道均衡条件下渠道利润不能任意分配，M 和 R 的利润只能按照
的比例分配。
1: 
公平的代价
简要评述
• 关于公平性，学者们还提出了其他模型（略）。
• 公平性的模型广泛用于分析许多现实问题
•纵向公平性：上下游企业间的渠道协调问题
•横向公平性：各类销售商；各类供应商
•……
报童模型
背景与问题
http://www.cnr.cn/gundong/20120
4/t20120403_509376939.shtml
报童面临的
关键决策之一
“20世纪40年代巴菲特的报
童生涯为他赚进5000美元,
他用这笔钱开始投资，最终
成立伯克夏公司，成为全球
最富有的人之一。”
每天需求量是随机的，
应该购进多少份报纸？
购进太多卖不完退回赔钱
购进太少不够销售赚钱少
存在一个合
适的购进量
一个具体例子
每份报纸：进货价c=6元，
报童零售价 p=12元，处理价 s=3元.
历史销售
量统计
销量
概率
累计概率
1000
2000
0.00
0.05
0.00
0.05
3000
4000
5000
0.15
0.40
0.30
0.20
0.60
0.90
6000
0.10
1.00
报童进多少份？
报童问题的特点
•
•
•
•
•
相对较短的销售期（明确的起止时刻）
销售期开始前要确定购买量
需求分布已知（或可以估计）
失去销售机会的成本较高
剩余（库存）的成本较高
例如：季节性产品（易逝品）
• 报纸，杂志，时装，纪念品，玩具，…
• 鲜花，海鲜，蔬菜，水果，…
• 血库，应急药物，救灾物质，…
• ……
模型假设与分析
假已知需求量D的随机规律——每天（单位时
设间）需求量为 x 的概率是 f(x), x=0,1,2,…
报童售报： p (零售价) > c(购进价) > s(处理价)
分
析售出一份赚 p-c=(p-s)-(c-s)；处理一份赔 c-s
每天需求量是随机的
每天利润是随机的
优化问题的目标函数(效用)应是——
长期的日平均利润
（每天利润的期望值）
风险态度：
风险中性！
建模
• 设每天购进 q 份，日平均利润为 E(π)
• 售出一份赚 p-c =(p-s)-(c-s)；降价一份赔 c-s
x  q  售出x  赚( p  c) x
 降价q  x  赔(c  s)(q  x)
x  q  售出q  赚( p  c)q

q
E( (q, D))  [( p  c) x  (c  s)(q  x)] f ( x) 
x 0
 ( p  c)qf ( x)
x q 1
求 q 使 G(q)=E(π(q,D)) 最大
f (x)  概率密度
求解将 x 视为连续变量
q

0
q
G(q)   [( p  c) x  (c  s)(q  x)] f ( x)dx   ( p  c)qf ( x)dx
q
dG
 ( p  c)qf (q)  0 (c  s) f ( x)dx
dq

 ( p  c)qf (q)   ( p  c) f ( x)dx
q
q

0
q
 (c  s) f ( x)dx  ( p  c) f ( x)dx
dG
0
dq


q
0

q
pc

cs
f ( x) dx
f ( x) dx

结果解释


q
0

q
取q使
q
0

f ( x)dx  P1 ,  f ( x)dx  P2
pc

cs
f ( x)dx
f ( x)dx
q
f
P1
pc

P2
cs
P1
pc


1  P1
cs
pc
 P1 
ps
P1
0
P2
Q
p-c ~售出一份赚的钱
c-s ~降价一份赔的钱
( p  c)  q , (c  s)  q 
x
具体例子的解
报童： p-c = 12 - 6 = 6
p-s = 12 - 3 = 9
P(x ≤ q) = 0.667
若c=9： p-c = 12 - 9 = 3
p-s = 12 - 3 = 9
P(x ≤ q) = 0.333
Probability P(x<Q)
1.00
0.667
0.80
0.60
0.333
0.40
0.20
需求 D
0.00
1000
2000
3000
3,208
4000
Quantity (Q)
5000
4,222
6000
模型是否反映了现实？
来源: M. E. Schweitzer and G. P. Cachon.
Decision Bias in the Newsvendor Problem
with a Known Demand Distribution:
Experimental Evidence. Management
Science, 46/3 (2000), pp. 404-420
模型的检验/验证
实际调查（有一定困难）
设计可控的（实验室）实验
离散均匀分布： D~U[1,300]，均值μ=150
p=12, s=0，c=3 或 9
高利润：c=3
低利润：c=9
P(x ≤ q) = 1/4<1/2
P(x ≤ q) = 3/4>1/2
qn=300*1/4=75<μ
qn=300*3/4=225>μ
实验主体：来自Duke大学的34名学生，每人实验30轮
（高低利润情形各半，事前学生不知道轮数）
模型与现实的差距
高利润：P(x ≤ q) = 0.75
P(x ≤ q) = 0.25
低利润：
模型与现实的差距
如何解释？
其他偏好：效用（utility）、目标
•风险中性==〉风险规避/追逐（risk averse/seeking）
•前景理论（Prospect Theory）、损失规避（loss averse）
•浪费规避（waste averse）
•缺货规避（stockout averse）
•低估机会成本（ Underestimated Opportunity Costs ）
•事后误差最小（ Minimizing Ex-Post Inventory Error ）
锚定且调整不足（ Anchoring and Insufficient Adjustment）
•锚定于平均需求μ，并向最优订货量方向调整
•锚定于上次订货量，并向上次需求方向调整
模型与现实的差距
设初始财富为w0, 效用函数为u, 则期望效用为：
u
us
un
ua
风险中性（risk-neutral）：u ==〉un
un(w)=w 
qn (同前，经典模型)
风险规避（risk-averse）：u ==〉ua

qa < qn
(Eeckhoudt et al. 1995)
风险追逐（risk-seeking）：u ==〉us

qs > qn (Eeckhoudt et al. 1995)
w
模型与现实的差距
设初始财富为w0, 效用函数为u, 则期望效用为：
up
前景理论（Prospect Theory）：u ==〉up
参照依赖（如以w0为参照）
w0
w
如果所有结果都是收益  qp < qn
如果所有结果都是损失  qp > qn
否则  qp < qn或 qp > qn
未严格分析
模型与现实的差距
设初始财富为w0, 效用函数为u, 则期望效用为：
ul
损失规避（loss-averse）：u ==〉ul
(前景理论的特例)

ql < q n
w0
w
模型与现实的差距
设初始财富为w0, 效用函数为u, 则期望效用为：
浪费规避（waste-averse）：u ==〉ut

ql < q n
缺货规避（stockout-averse）：u ==〉um

qm > q n
模型与现实的差距
设初始财富为w0, 效用函数为u, 则期望效用为：
低估机会成本（ Underestimated Opportunity Costs ）：u ==〉uo
当D>q:
机会成本：p-c
um：p-c ==〉β(p-c),
0≤β<1

qo < qn
模型与现实的差距
设初始财富为w0, 效用函数为u, 则期望效用为：
事后误差最小（ Minimizing Ex-Post Inventory Error ）: u ==〉ue
“后悔成本”
假设需求分布对称：
f(μ+y)=f(μ-y)
μ为均值
高利润产品  μ ≤ qe ≤ qn
低利润产品  μ > qe > qn
模型与现实的差距
锚定且调整不足（ Anchoring and Insufficient Adjustment）
•锚定于平均需求μ，并向最优订货量方向调整
√
 高利润参品订货不足，反之订货偏高
• 锚定于上次订货量，并向上次需求方向调整
√
 无论高低利润产品，均可能订货偏低/高
注：可建立多阶段模型严格分析（如考虑学习效应）
可参考：AJ A. Bostian，Charles A. Holt，Angela M. Smith. Newsvendor “Pull-toCenter” Effect: Adaptive Learning in a Laboratory Experiment, MANUFACTURING
& SERVICE OPERATIONS MANAGEMENT, Vol. 10, No. 4, Fall 2008, pp. 590–608
模型与现实的差距
小结：
√

√

√

注：
可能存在
组合效应
模型与现实的差距
其他发展：
最近若干年，还有其他相关实验和解释
例如：最近，有人通过实验
发现：
男女决策者提交的订货量
有差别，如何解释？
认知层次模型
Example: p-Beauty Contest (p-BC)
 n players
 Every player simultaneously chooses a number from
0 to 100
 Compute the group average
 Define Target Number (Beauty Number) to be 0.7
times the group average （p=0.7）
 The winner is the player whose number is the closet
to the Target Number
 The prize to the winner is US$20
Results in various p-BC games
Subject Pool
CEOs
80 year olds
Economics PhDs
Portfolio Managers
Game Theorists
Group Size Sample Size
20
20
33
33
16
16
26
26
27-54
136
Mean
37.9
37.0
27.4
24.3
19.1
答案：如果所有实验参与者都是“经济人”(完全理性)，
则每个人都应该选0。
评价：据说博弈论专家不选0的理由之一是----“我当然
知道理论结果是0，但其他人都会选0吗？”
69
行为科学的要点之一：有限理性
西蒙：真实行为对客观理性的违反
1、按照理性的要求，行为主体必须完全了解并
预期每项决策产生的结果。而实际上，人们对决策
结果的了解总是零零碎碎、不完整的；
2、由于决策产生的结果未来才会发生，所以在
给他们赋值时就必须用想象力来弥补缺乏真实体验
的不足；
3、按照理性的要求，行为主体要在所有可行的
备选行为中做出选择。而在真实情况下，主体只可
能想到有限的几个可行方案。
认知层次模型的要点
CH (Cognitive Hierarchy)模型: 决策者的异质性
(Camerer, Ho, and Chong, 2004)
The Quarterly Journal of Economics, August 2004
认知层次模型的要点
CH (Cognitive Hierarchy)模型: 决策者的异质性
(Camerer, Ho, and Chong, 2004)
1、决策者的认知（如聪明程度、计算能力等）
不同，可以依次分层：0，1，2，…；
2、不同层的决策者具有不同的决策规则。
具体模型（模型假设）：
1、第0层决策者随机选择行动；
2、第k(>0)层决策者认为其他所有人位于0~k-1
层并可推测其比例，从而对其行动做出最优反映；
3、位于不同层的决策者数量服从Poisson分布。
（实证数据：通常τ=1~2之间拟合较好）
The Cognitive Hierarchy (CH) Model
People are different and have different
decision rules.
Modeling heterogeneity (i.e., distribution of
types of players). Types of players are
denoted by levels 0, 1, 2, 3,…,
Modeling decision rule of each type.
73
Assumptions
• Denote a k-step player’s belief about the proportion
of h-step players by gk(h). (But the frequency is f(k)).
• Overconfidence: players doing k>=1 steps do not
realize that others are using k or more than k steps
of thinking (that is, gk(h) = 0 for h >= k ).
• k-step players have an accurate guess about the
relative proportions of players who are doing less
thinking than they are.
• “Increasingly rational expectations”: As k increases,
the total absolute deviation between the actual
frequencies f(h) and the beliefs gk(h) shrinks.
74
Modeling Decision Rule
Frequency of k-step is f(k)
Step 0 choose randomly: |Si|=mi, P0(sij) = 1/mi
k-step thinkers know proportions f(0),...f(k-1)
Form
based on beliefs
beliefs g k (h) 
f ( h)
(h<k) and best-respond
k 1
 f (h
'
)
h ' 0
Iterative and no need to solve a fixed point
75
Modeling Decision Rule
Given the k-step thinker’s beliefs, the expected payoff
to a k-step thinker from choosing strategy si j is
For simplicity, we assume that players best-respond (Pk(si*)
= 1 iff si* = argmax Ek(πi(si j))), and randomize equally if
two or more strategies have identical expected payoffs.
• Starting with 0-step player behavior and iterating to
compute P1(si j ), P2(si j ),. . . . (In practice, we
truncate the recursion at a k large enough that the
remaining frequencies, f(k’) for k’ > k, are tiny.)
76
Poission-CH: mean=1.55
ROW
K
0
1
2
3
>3
Proportion, f(k)
0.212
0.329
0.255
0.132
0.072
T
L
0,0
COLUMN
C
10,-10
R
-5,5
M
-15,15
15,-15
25,-25
B
5,-5
-10,10
0,0
K's
Level (K) Proportion
0
0.212
Aggregate
0
0.212
1
0.329
Aggregate
0
0.212
1
0.329
2
0.255
Aggregate
K+1's
Belief
1.00
1.00
0.39
0.61
1.00
0.27
0.41
0.32
1.00
T
0.33
0.33
0.33
0
0.13
0.33
0
0
0.09
ROW
M
0.33
0.33
0.33
1
0.74
0.33
1
0
0.50
B
0.33
0.33
0.33
0
0.13
0.33
0
1
0.41
L
0.33
0.33
0.33
1
0.74
0.33
1
1
0.82
COL
C
0.33
0.33
0.33
0
0.13
0.33
0
0
0.09
R
0.33
0.33
0.33
0
0.13
0.33
0
0
0.09
77
Theoretical Implications
Exhibits “increasingly rational expectations”
 Normalized gk(h) approximates f(h) more closely as
k ∞ (i.e., highest level types are
“sophisticated” (or "worldly") and earn the most.
Highest level type actions converge as k ∞
 marginal benefit of thinking harder 0
78
Alternative Specifications
Overconfidence:
k-steps think others are all one step lower (k-1) (Stahl, GEB,
1995; Nagel, AER, 1995; Ho, Camerer and Weigelt, AER,
1998)
“Increasingly irrational expectations” as k ∞
Has some odd properties (e.g., cycles in entry games)
Self-conscious:
k-steps think there are other k-step thinkers (gk(k)>0).
Similar to Quantal Response Equilibrium/Nash
Fits worse
79
Plausibility of Poisson Distribution
We focus on the one-parameter Poisson distribution for f(k)
because
• The simpler one-parameter Poisson form fits almost as well
as a seven-parameter model (with frequencies f(k) up to k
=7)—allowing each f(k) to be independent results in less than a
1 percent decrease in log likelihood—in four of the five data
sets we examined.
•The Poisson model is also easier to compute and estimate, and
easier to work with theoretically.
80
Modeling Heterogeneity, f(k)
A1:
f (k )


f (k  1) k
f(k)/f(k-1) declines (proportionally to 1/k)
sharp drop-off due to increasing difficulty in
simulating others’ behaviors
A2: f(0) + f(1) = 2f(2)
81
Implications
 A1 Poisson distribution with mean and variance = 
f (k )  e


k
k!
A1,A2  Poisson, 1.618... (golden ratio Φ)
82
Poisson Distribution
 f(k) with mean step of thinking :
f (k )  e


k
k!
frequency
Poisson distributions for
various 
0.4
0.35
0.3
0.25
0.2
0.15
0.1
0.05
0
1
1.5
2
0
1
2
3
4
5
6
number of steps
83
Existence and Uniqueness:
CH Solution
 Existence: There is always a CH solution in
any game
 Uniqueness: It is always unique
84
Theoretical Properties of CH Model
 Dominance-Solvable Games
• When f(k) is Poisson-distributed, the relative proportions of
types one step below and two steps below a k-step thinker,
f(k-1)/f(k -2) = τ/(k-1), puts overwhelming weight on the k-1
types if τ is very large (i.e., k<<τ). In that case, a k-step
thinker acts as if almost all others are using k -1 steps.
• This property of the Poisson distribution provides a simple
way to link thinking steps to iterated deletion of dominated
strategies.
– 1-step thinkers will never choose weakly dominated strategies,
because those strategies are never best responses to the random
strategies of 0-step types
– τ is large: 2-step thinkers, 3-step thinkers, …
85
Theoretical Properties of CH Model
Advantages over Nash equilibrium
Can “solve” multiplicity problem (picks one
statistical distribution)
Sensible interpretation of mixed strategies (de
facto purification)
Theory:
τ∞ converges to Nash equilibrium in (weakly)
dominance solvable games
86
Example: Entry games
 Market entry with many entrants:
Industry demand D (as % of # of players) is announced
Prefer to enter if expected %(entrants) < D;
Stay out if expected %(entrants) > D
All choose simultaneously
 Experimental regularity in the 1st period:
Consistent with Nash prediction, %(entrants) increases with
D
“To a psychologist, it looks like magic”-- D. Kahneman ‘88
87
Example: Entry games (data)
How entry varies with industry demand
D, (Sundali, Seale & Rapoport, 2000)
1
0.9
0.8
% entry
0.7
entry=demand
experimental data
0.6
0.5
0.4
0.3
0.2
0.1
0
0
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
Demand (as % of number of
players )
88
Behaviors of Level 0 and 1 Players
( =1.25)
Level 1
Level 0
Demand (as % of # of players)
89
Behaviors of Level 0 and 1 Players
( =1.25)
Level 0 + Level 1
Demand (as % of # of players)
90
Behaviors of Level 2 Players
( =1.25)
Level 2
Level 0 + Level 1
Demand (as % of # of players)
91
Behaviors of Level 0, 1, and 2 Players
( =1.25)
Level 2
Level 0 + Level 1 +
Level 2
Level 0 +
Level 1
Demand (as % of # of players)
92
CH Predictions in Entry Games
( = 1.25)
How entry varies with demand (D),
experimental data and thinking model
1
0.9
0.8
% entry
0.7
entry=demand
0.6
experimental data
0.5
1.25
0.4
0.3
0.2
0.1
0
0
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
Demand (as % of # of players)
93
Results in various p-BC games
94
Results in various p-BC games
Subject Pool
CEOs
80 year olds
Economics PhDs
Portfolio Managers
Game Theorists
Group Size Sample Size
20
33
16
26
27-54
20
33
16
26
136
Mean
37.9
37.0
27.4
24.3
19.1
Error
(Nash)
-37.9
-37.0
-27.4
-24.3
-19.1
Error
(CH)
-0.1
-0.1
0.0
0.1
0.0

1.0
1.1
2.3
2.8
3.7
95
Summary
CH Model:
Discrete thinking steps
Frequency Poisson distributed
One-shot games
Fits better than Nash and adds more economic
value
Explains “magic” of entry games
Sensible interpretation of mixed strategies
Can “solve” multiplicity problem
 Initial conditions for learning
96
提
纲
• 什么是行为运筹学/运作管理？
– 传统OR/OM：存在的主要问题
– BOR/BOM的兴起
• BOR/BOM：几个例子
– 公平性模型
– 报童模型
– 认知层次模型
• BOR/BOM：知识要点及研究方法
行为运筹学与行为运作管理
行为科学的要点
来源： Loch, C.H. and Wu, Y.
Behavioral Operations Management,
2007
Introduction
Individual Biases
• 1.Reference Dependence and
Prospect Theory
• 2. Immediacy, Salience and
Hyperbolic Discounting
• 3. Ambiguity and Complexity
Effects
• 4. Regret Theory
• 5. Heuristics and Biases
• 6. Emotions and the “Affect
Heuristic”
Social Preferences
• 1. Emotions
• 2. Status
• 3. Reciprocity and
Relationships
• 4. Group Identity
• 5. Motivation and Group
Performance
• 6. Fair Process
Behavioral OM Models of
Culture
行为运筹学与行为运作管理
行为科学的要点
• 来源： E. Bendoly, et al, Bodies of
knowledge for research in Behavioral
Operations – POM, 19(4), 2010
行为运筹学与行为运作管理
行为科学的要点
认知心理学：
感知、使用语言、推理、解决问题、判断和决
策的心理过程。
人的行为只能部分地由先前的环境事件或行为
结果所决定，一些最重要的行为是从全新的思维方
式中产生的。
人的认知过程：决策中的启发式和认知偏差
当人们面临决策时，只能采用简单的启发式方
式，在有限的时间、有限的知识和有限的认知能力
的条件下进行。
行为运筹学与行为运作管理
行为科学的要点
要点1：过于自信（Overconfidence）
例如：overprecision (高估自己掌握的信息精准度)
overestimation (高估自己的能力)
要点2：锚定、调整不充分（Anchoring and
Insufficient Adjustment）
例如：过分“沉溺”于过去的经验和教训
要点3：损失规避、反射效应、框架效应（Loss
Aversion, Reflection Effect, and Framing）
例如：前景理论（Prospect Theory）
行为运筹学与行为运作管理
行为运筹学与行为运作管理
前景理论
价值函数(Value function)
效用
获利
（ⅰ）以参照点为原点
（ⅱ）收益部分为凹的、损失部分
为凸的
（ⅲ）损失部分比收益部分更陡峭
财富
损失
行为运筹学与行为运作管理
前景理论
权重函数(Weighting function)
非常不可能的事件要么被忽视
要么被权重过度
大概率与确定性之间的差异要
么被忽视要么被夸大
因而，在接近端点处的情形是
不正常的
行为运筹学与行为运作管理
行为科学的要点
社会心理学：
研究个体和群体的社会心理现象。个体社会心理现
象指受他人和群体制约的个人的思想、感情和行为，
如人际知觉、人际吸引、社会促进和社会抑制、顺
从等。群体社会心理现象指群体本身特有的心理特
征，如群体凝聚力、社会心理气氛、群体决策，以
及全体之间的相互关系和相互影响等。
-- “关系学”（Guanxi）
人的社会属性：
信任，权力；平等，公正；奉献，互惠；…
行为运筹学与行为运作管理
行为科学的要点
要点1：目标设定（Goal Setting Theory）
例如：目标清晰比尽力而为的绩效更好；
兼顾个体与群体目标、短期与长期目标；
关注平等、公正，奉献、互惠
要点2：反馈与控制(Feedback & Control Theory)
例如：及时、准确的反馈 ==〉及时调整、控制
要点3：相关性（Interdependence）
例如：搭便车《==》来自同事的压力；
努力程度向“中等”看齐
行为运筹学与行为运作管理
行为科学的要点
群体动力学(group dynamics)：
亦称 “团体动力学”，试图通过对群体现象的动
态分析发现其一般规律的理论。它以群体的性质、
群体发展的规律、群体和个人的关系，群体和群体
的关系等作为研究对象。这一理论对社会心理学、
组织管理心理学的形成和发展有很大影响，特别是
对研究群体行为作出了重要贡献。
--（产业与）组织心理学、组织行为学（OB）
人的行为受到环境（“场”）的影响：
B = f (P, E)
B: 行为；P:个人；E: 环境；f: 函数
行为运筹学与行为运作管理
行为科学的要点
要点1：群体思维与Abilene悖论（Groupthink and
Abilene Paradox）
例如：从众心理--导致次优甚至很坏结果
要点2：相互责备(Blame in Groups)
例如：“内耗”--内部“摩擦”
要点3：衰竭的螺旋（Breakdown Spirals）
例如：集体主义（文化）《==》个体主义（文
化）
行为运筹学与行为运作管理
行为科学的要点
系统动力学(system dynamics)：
对整体运作本质的思考方式，把结构的方法、功能
的方法和历史的方法融为一个整体，其目的在于提
升人类组织的“群体智力”。
常用工具：
因果映射—描述系统的动态复杂性
仿真建模—对复杂系统的结果进行仿真
行为运筹学与行为运作管理
行为科学的要点
要点1：对反馈结构的错误感知（Misperceptions
of Feedback Structure）
例如：关注具体事件，而不是导致事件的系统
结构；
对非线性的影响估计不足
要点2：对反馈动力学机制的错误感知(Misperceptions
of Feedback Dynamics)
例如：有限理性；
有限的计算能力（即使是很简单的问题）
BOR/BOM研究方法
BOR（BOM）研究四部曲
实验设计
行为科学
实验与
实证
行为特
征认知
建模与
优化
运筹学
行为科学
统计学
行为科学
还原
机制
运作管理
行为科学
BOR/BOM可能的主要研究方向
来源：A. Goldfarb, et al. (12位学者) 2011. Behavioral Models of Managerial
Decision-Making. http://ssrn.com/abstract=2011559
BOR/BOM可能的主要研究方向
研究对象（理论与实验）：
博弈论(对策论)、算法博弈论
排队论：排队系统、服务系统
决策论：决策分析
存储论：生产、库存与供应链管理
网络、图论
动态规划、组合优化
线性规划、非线性规划、整数规划，等等（？）
基本方法：在建模中，引入人的行为特征
BOR/BOM可能的主要研究方向
OM研究对象(应用与实证):
产品研发
质量管理
生产系统
外包战略
等等
OM交叉领域:
OM与营销管理
OM与金融(风险管理)
OM与应急管理
OM与收益管理
等等
更多研究对象:
学习效果：作业一次性还是每周布置、检查？
生产计划与排序中人的行为（见下页）
等等
BOR/BOM可能的主要研究方向
2011年出版：
本书特点：
This book explores the role of
human behavior in planning
and scheduling decisions in
industry. From a wide range of
disciplinary perspectives, we
investigate how human
behavior determines
performance, and how this
interacts with the formal
organizational processes and
the decision support systems.
BOR/BOM可能的主要研究方向
跨文化背景决策差异：
已有结果（西方人），中国人的决策行为规律如何？
投资偏好对比（来源：http://efinance.org.cn/cn/1966/investigation.pdf）
总人数
64
64
63
64
64
64
64
64
64
64
64
64
63
64
64
64
64
64
选择A （中国）
72%
23%
68%
14%
19%
73%
78%
30%
27%
23%
64%
27%
27%
70%
22%
28%
81%
20%
选择B（中国）
28%
77%
32%
86%
81%
27%
22%
70%
73%
77%
36%
73%
73%
30%
78%
72%
19%
80%
A（美国）
83% 1
18%
65%
20%
14%
73%
92%
42%
8%
30%
20%
22%
16%
69%
18%
70%
72%
17%
B（美国）
7%
82%
35%
80%
86%
27%
8%
58%
92%
70%
80%
78%
84%
31%
82%
30%
28%
83%
主要参考文献
• A. Goldfarb, et al. 2011. Behavioral Models of Managerial
Decision-Making. http://ssrn.com/abstract=2011559
• E. Bendoly, R. Croson, P. Goncalves, and K. Schultz, “Bodies of
knowledge for research in Behavioral Operations” – Production
and Operations Management 19(4), 2010
• E. Bendoly, K. Donohue and K. Schultz, “Behavior in Operations
Management: Assessing Recent Findings and Revisiting Old
Assumptions”, JOM 24(6), 2006
• Gino, F., G. Pisano. 2008. Toward a theory of Behavioral
Operations. Manuf. Serv. Oper. Manage. 10(4): 676–691.
• Loch, C.H. and Wu, Y. Behavioral Operations Management.
Foundations and Trends in Technology, Information and
Operations Management. 2007.
• Camerer（贺京同等译），行为博弈：对策略互动的实验研究，
中国人民大学出版社，2002。
主要参考文献
•
•
•
•
•
•
•
Kahneman D. and Tversky, A. Prospect theory: An analysis of
decision under risk. Econometrica, 1979, 47, 263-291.
Tversky, A. and Kahneman, D. Advances in prospect theory:
Cumulative representation of uncertainty. Journal of Risk and
Uncertainty, 1992, 5, 297-232.
Camerer, C.F.; Ho, T.H. and Chong, K.J. A cognitive hierarchy
model of games. The Quarterly Journal of Economics, 2004, 119,
861-898.
投资偏好问卷调查分析，
http://efinance.org.cn/cn/1966/investigation.pdf
Hans, N. and Groson, R. Introduction to the Special Issue on
Behavioral Operations. MSOM, 2008, 10, 563-565.
Bendoly, E. Incorporating behavioral theory in OM empirical
models. Journal of Operations Management, 2006, 24, 735-736.
Loewenstein, G. and Prelec, D. Anomalies in intertemporal choice
evidence and an interpretation. The Quarterly Journal of
Economics, 1992, 107, 573-598.
主要参考文献
• Simon（黄涛译），西蒙选集，首都经贸大学出版社，2002。
• 金雪军，杨晓兰，行为经济学，首都经贸大学出版社，2009。
• 董志勇，行为经济学，北京大学出版社，2006 & 2008。
• 刘凤良，周业安，陈彦斌，于泽，行为经济学：理论与扩展，
中国经济出版社，2008。
• Roth（聂庆译），经济学中的实验室实验，中国人民大学出
版社，2002。
• 叶德珠，行为经济学利率期限结构理论研究，经济科学出版
社，2008。
• 李爱梅，心理帐户与非理性经济决策行为研究，经济科学出
版社，2007。
• Sen（李风华译），理性与自由，中国人民大学出版社，2002。
• 孙绍荣，宗利永，鲁虹，理性行为与非理性行为，上海财经
大学出版社，2007。
• 胡冶岩，行为管理学，经济科学出版社，2006。
• ……

行为运筹学和格式运筹学简介

Transcript 行为运筹学和格式运筹学简介

Directory