Transcript מצגת ההרצאה
Sarit Kraus
Bar-Ilan University
[email protected]
Multi-issue Negotiation
Employer and job
candidate.
Objective: reach an
agreement over
hiring terms after
successful interview.
Chat based
negotiations
3
Training People in Negotiations
4
The Training Methods
Classical role playing with another human
counterpart:
simple
scheduling,
requires other people
Agent role playing
accessible and available 24/7
modeling different counterparts
no significant differences were
found between the different training
methods (148 human subjects)
5/19
Virtual Suspect to Train Investigators
7
Culture Sensitive Agents for
Culture related Studies
The development of standardized agents to be used in
the collection of data for studies on culture and
negotiation
Bargaining
Automated Mediators for
Resolving Conflicts
Automated Speech Therapist
First place of the TedMed 2014 innovation competition for startups in medicine
11
Language Tele-Rehabilitation:
Monitoring and Intervention
טפוך
Patient:
Patient:תפוח
השניה
Cyndi:
האותיות גם
נכונות
והשלישית
זה פרי
Currently in use by patients.
12
Agent Supports Deliberation
Past
deliberations
accumulative
data
Agent
Update
Current
deliberation
Offer arguments
=Obtains information
13
Supporting Robots-Human Teams
15
Sustainability: Reducing Fuel
Consumption
Persuasion
Why not use only game theory?
Game theory is a study of strategic decision making:
"the study of mathematical models of conflict and
cooperation between intelligent rational decisionmakers“
Results from the social sciences suggest people do not
follow game theory strategies.
People Often Follow Suboptimal
Decision Strategies
Irrationalities attributed to
sensitivity to context
lack of knowledge of own preferences
the effects of complexity
the interplay between emotion and cognition
the problem of self control
Why not Only Behavioral Science
Models?
There are several models that describe human
decision making
Most models specify general criteria that are context
sensitive but usually do not provide specific
parameters or mathematical definitions
Why not Only Machine Learning?
Machine learning builds models based on data
It is difficult to collect human data
Collecting data on specific user is very time consuming.
Human data is noisy
“Curse” of dimensionality
Methodology
Human
behavior
models
Human
specific
data
machine
learning
Human
Prediction
Model
Data
(from
specific
culture)
Game Theory
Optimization
methods
Take action
22
22
An Experimental Test-Bed
Interesting for people to play:
analogous to task settings;
vivid representation of strategy
space (not just a list of outcomes).
Possible for computers to play.
Can vary in complexity
repeated vs. one-shot setting;
availability of information;
communication protocol;
Negotiations and teamwork
25
25
CT Game
100 point bonus for
getting to goal
10 point bonus for each
chip left at end of game
Agreement are not
enforceable
Collaborators: Gal, Haim,
Gelfand
26
26
An Influence DiagramTwo rounds interaction
Probability
of
acceptance
Probability
of transfer
Methodology
Human
behavior
models
Human
specific
data
machine
learning
Human
Prediction
Model
Data
(from
specific
culture)
Game Theory
Optimization
methods
Take action
Prediction of Acceptance and Reliability
Data Collection:
Human playing against adaptive agent (PURB)
Reliability Measure
People
(Lebanon)
People
(US)
Co-dep
Task
indep.
Task
dep.
Average
0.96
0.94
0.87
0.92
0.64
0.78
0.51
0.65
29
Personality, Adaptive Learning
(PAL) Agent In thisDatadatafrom
set the
“Nasty Agent”: Machine
Less reliable when Learning
fulfilling its
agreement
People
adapt their
behavior to
their
counterparts
.
Lebanon
people
specific
almost
always kept
culture
the agreements
PAL never kept
agreements
Optimization
methods
Human
Prediction
Model
Take action
30
Pal vs Humans
The model’s used by PAL had a very
low accuracy prediction of people
actually played against PAL
250
200
150
PAL
Human
100
50
0
U.S
Lebanon
Israel
31
Multi-issue Negotiation: The
Negotiation Scenario
Employer and job
candidate
Objective: reach
an agreement over
hiring terms after
successful
interview
32
Chat-Based
Negotiation
33
NegoChat’s Offers
Find a “good” offer to use as anchoring
Predict which offers will be accepted using on past
negotiation sessions and create clusters of possible
offers per time period
Aspiration Adaptation Theory (AAT)
proposes issues sequentially based on aspiration
scale learned from data
retreat from previous values
Collaborators: Avi Rosenfeld, Zukerman,
Dagan, Gelfand
[email protected]
34
34
600
1200
500
1000
400
800
300
600
200
400
100
200
0
0
KBAgent
Time to Reach Agreement
Agent Score
NegoChat vs KBagent in Israel
Agent Score
NegoChat
Time (sec)
35
NegoChat in Egypt
Collect new data of human vs human (negotiations
much longer)
Build classifier
Extract aspiration list
Got many complaints on the agent.
Challenges in running experiments.
Results: slower than in Israel; lower score; almost the
same happiness and fairness; females scored much
lower.
36
One operator – Multiple robots
Search And Rescue (SAR)
Warehouse operation
Automatic air-craft towing
Fire-Fighting
Military applications
Etc..
Semi-Autonomous Robots
Noisy signals
Controls the robots
Agent
Controls the robots
Agent design
Data on
human
behavior
30 human
Operators in
simulation
Data on robots
performance
Machine
Learning
Human model
Optimization
Provide
Advice
150 hours of
simulations (no
human operator).
Robot model
Evaluation: Three Environments
12 subjects
16 subjects
16 subjects
Objects found per condition
Simulated office
Physical office
Simulated
warehouse yard
Computer Games for Learning
Relatively low impact
Possible Reason: lack involvement of the teachers
Better results with a human teachers in the loop.
Capabilities for Agents in Learning
Planning
Monitoring
Intervention
Encouragement
45
Human-agent Working Together
Planning: human forms the basic learning plan;
agent adapts it to the student’s progress.
Monitoring & Intervention: Agent performs M&I
and asks the human for help when it is not sure.
Encouragement: point up to the human
that a student needs encouragements.
46
Team of Agents and Teacher
Many automated agents provide service to many
students.
One human teacher supervises many students.
Special agent
supports the
teacher.
47
Reducing Energy Consumption
by Advising People on how to set the
Climate Control System
GM Chevrolet Volt 2011
Presenting Advice to User
Presenting Advice to User
~80% of drivers explicitly accepted.
Methodology
Collecting data on 38 subjects for an hour session in
the car)
Predicting drivers reactions to offers
Modeling the car and environment
Integrating into an MDP
Solving the MDP
51
Evaluation
45 drivers - 15 per condition, 3 rounds.
The lower the better.
Why Did MDP Outperform the
SAP?
Agent
Avg. go eco %
Avg. save %
Avg. consumption
MACS
0.835
23.1
0.174
SAP
agent
0.641
33.7
0.237
SAP was aggressive.
Some subjects stopped clicking on the advice.
This does not look
as a real person!!
Remove the
Avatar!!
AniMed evaluation
Agents interacting proficiently
with people is important
Human
Data
behavior
(from
Challenges:
models
How to integrate machinespecific
culture)
learning
and
behavioral
[email protected]
Human
machine
models?
How to use in
learning
specific
data
agent’s strategy?
Challenges:
Game Theory
Optimization
methods
Human
Experimenting with people
Prediction
is very difficult !!!
Model
Working with people from
other disciplines is
challenging.
Take action
56