Transcript Document

Modeling and Planning with Robust Hybrid Automata

Cooperative Control of Distributed Autonomous Vehicles in Adversarial Environments

2001 MURI: UCLA, CalTech, Cornell, MIT

Dahleh/Feron/Williams May 14, 2001 UCLA

Brief update on MIT status

Investigators • Dahleh • Feron • Massaquoi • Williams Students • Z.-H. Mao (PhD) • G. Kotsalis (PhD) • K. Santarelli (PhD) • T. Schouwenaars (PhD) • M. Valenti (PhD) • A. Walcott (PhD)

Outline • Robust Hybrid Automaton concepts • Model-Based Programming of autonomous explorers • Game-theoretic concepts

Problem Formulation

• •

Basic problem for autonomous vehicles/robots:

Generate and execute a (sub) optimal motion plan, satisfying given boundary conditions, flight envelope and obstacle avoidance constraints , in a dynamic and uncertain environment

– Nonlinear control • Steering of underactuated, non-holonomic systems • Stabilization/tracking for nonlinear systems • Flight envelope protection – Robotics/Artificial Intelligence • Path planning (obstacle avoidance) for non-holonomic dynamical systems – Computer science/Software Engineering • Hard real-time constraints

Research supported by AFOSR, Draper, ONR

Hierarchical decomposition

• Need to introduce a

hierarchical structure

to achieve computational tractability, e.g. (Stengel, 93): – – –

“Strategic layer”:

Task scheduling, goal planning

“Tactical layer”: “Reflexive layer”:

Guidance, navigation Tracking, control, estimation • General hierarchical systems, derived from arbitrary decompositions, can be

extremely hard to analyze and verify

• • Design a hierarchical system such that it offers safety and performance guarantees

by construction

– Analysis and verification:

robustness analysis

problem

Consistent hierarchical system

System Quantization

• Quantization of

feasible

trajectories into

trajectory primitives

– formalization of the concept of “maneuver” –

Consistent abstraction

of the system dynamics • Hierarchical decomposition of the control tasks: – Maneuver sequencing

(guidance, trajectory planning)

– Maneuver execution

(control, trajectory tracking)

• Control synthesis: – Build a “maneuver library” (with feedback control) –

Behavioral programming

space : Solve a mixed-integer program on a “small” – Hybrid control system with performance and safety guarantees

by design

.

Maneuver Automaton

• Two classes of trajectory primitives ( trim trajectories + maneuvers ) • Construct a “Maneuver Library”, with a finite number of primitives • Generate trajectories by

sequencing

such primitives – All generated trajectories are solutions of the system’s diff. equations – All generated trajectories satisfy the flight envelope constraints (assuming F(x,u)=F( Y h x,u)) Steady left turn Hover Forward flight Steady right turn

0 -100 -200 -300 0

Example of planning in a free environment

400 300 200 actual position actual velocity commanded position "maneuver switch" 100 5 10 15 20 25 30 35 40

Model-based Autonomy

• How do we program explorers that reason quickly and extensively from commonsense models?

• How do we coordinate heterogeneous teams of robots -- in space, air and land - to perform complex exploration? • How do we couple reasoning, adaptivity and learning to create robust agents?

• How do we incorporate model-based autonomy into every day, ubiquitous computing devices?

Model-based Autonomy

Programmers generate breadth of functions from commonsense models in light of mission goals.

• Model-based Reactive Programming • Programmer guides state evolution at strategic levels.

• Commonsense Modeling • Programmer specifies commonsense, compositional models of spacecraft behavior.

• Model-based Execution Kernel • Reason through system interactions on the fly, performing significant search & deduction within the reactive control loop.

Model-based Programming of Cooperating Explorers

Managing Interactions for Cooperation

Programmers and operators must reason through system-wide interactions to : • select among redundant procedures • Evaluate outcomes • Plan contingencies • select deadlines • select timing constraints • allocate resources

Model-based Cooperative

Programming

Model-based Programs • Specify team behaviors as concurrent programs.

  • Specify options using decision theoretic choice.

   • Specify timing constraints between activities.

  c If c next A Unless c next A A, B Always A Choose reward A in time [t ,t + ] • Model-based Execution • Achieves correctness and economy  Pre-plans threads of execution that are optimal and temporally consistent .

• Responds at reactive timescales  Perform planning as graph search

Decision-theoretic Temporal Planner

HOME Station: ABC Mission Scenario

TWO

RENDEZVOUS

ONE Enroute

RESCUE AREA Station: XYZ Diverge RESCUE LOCATION MEETING POINT

Enroute Activity:

Enroute

Corridor 2 Rendezvous Corridor 1 Corridor 3 Rescue Area

Enroute Activity: • Least cost threads of execution generated by extended auction algorithm price = 425 [450,540] 1

0 0

3 price = 425 0 4 price = 425 6 price = 440

425

440 5 price = 0

0 0

8 0 price = 0

0

7 price = 0 9 price = 30 11 price = 1

30 1

price = 0 2

Extend Path 0

10 price = 0

0

13

0

price = 0 12 price = 0

Path P = 1

3

4

5

8

Start Node : 1 End Node: 2

9

10 11

12 13

2

x init

4 Temporal planning is combined with randomized path planning to find a collision free corridor

Path 1

X obs

x goal

5

Game-theoretic concepts

(Feron and DeMot) Problem: •Navigation of a number of vehicles to a target •Target located at a position that is known with respect to the vehicles or in a known region with a certain known probability distribution •Vehicles have visual information about a local part of the environment •Adversarial, unknown environment Issues: • Many cooperating vehicles vs. single vehicle missions •Continuously updating available information Approach: •Game theory

Illustrative Example

Adversary Target Two-agent game Requires mixed strategy Obstacle ?

Initial Observations

• Multiple vehicles yield pure strategies whereas for single vehicles a mixed strategy is optimal • Continuously information updates? Applicability of certainty equivalence principles (eg Basar & Bernhardt, Birkhauser, 1991) • More general setting: nature chooses the position of an arbitrary amount of obstacles in the unexplored areas - Need for well-defined models