lecture10

Transcript lecture10

Today’s Topics
• Tradeoffs in BFS, DFS, and BEST
• Dealing with Large OPEN and CLOSED
– A Clever Combo: Iterative Deepening
– Beam Search (BEAM)
– Hill Climbing (HC)
– HC with Multiple Restarts
– Simulated Annealing (SA)
9/29/15
CS 540 - Fall 2015 (Shavlik©), Lecture 10, Week 4
1
Tradeoffs
(d = tree depth, b = branching factor)
Method
Breadth
Positives
Negatives
Guaranteed to find
soln if one exists
OPEN can become
big, O(bd)
(all possible solutions generated
for each depth before increasing
depth)
Depth
Best
9/29/15
Finds shortest path
(in #arcs traversed)
Can be slow
Open grows slower,
O(b  d)
Might not get shortest
solution path
Might find long
solution quickly
Can get stuck in
infinite spaces
Provides means of
Requires a good
using domain
heuristic function
knowledge
CS 540 - Fall 2015 (Shavlik©), Lecture 10, Week 4
Lecture 1, Slide 2
Memory Needs for OPEN
(d = tree depth, b = branching factor)
1
Breadth
b
b2
b3
…
Yellow nodes in
OPEN
bd
Depth
Each level has b-1
nodes in OPEN (last
level has b), so O(b  d)
9/29/15
CS 540 - Fall 2015 (Shavlik©), Lecture 10, Week 4
3
DFS with
Iterative Deepening
Combines strengths of BFS and DFS
Algo (Fig 3.18)
Let k = 0
Loop
let OPEN = { startNode }
// Don’t use CLOSED (depth limits handles inf
do DFS but limit depth to k // If depth = k, don’t generate children
if goal node found return solution
else if never reached the depth bound of k
return FAIL // Searched finite space fully
else increment k
9/29/15
CS 540 - Fall 2015 (Shavlik©), Lecture 10, Week 4
loops)
4
Iterative Deepening Visualized
• See Figure 3.19 of text (use doc camera)
• WE SAVE NO INFORMATION BETWEEN
ITERATIONS OF THE LOOP!
• RECOMPUTE, rather than STORE
(a space-time tradeoff; common in CS)
• At first glance, seems stupid, but …
9/29/15
CS 540 - Fall 2015 (Shavlik©), Lecture 10, Week 4
5
Computing the Excess Work
• Number of nodes generated in a depth-limited search to
depth d with branching factor b
totalWork(d, b) = b0 + b1 + b2 + … + bd-2 + bd-1 + bd
• Number of nodes generated in an
iterative deepening search
iterDeep_totalWork(d, b) =

totalWork(i, b)
i from 1 to d
14 Jan 2004
CS 3243 - Blind Search
6
Example: Computing Excess Work
totalWork(5, 10) = 1 + 10 + 100 + 1,000 + 10,000 + 100,000 = 111,111
iterDeep_totalWork(5, 10)
=
totalWork(0, 10) + totalWork(1, 10) + totalWork(2, 10)
+ totalWork(3, 10) + totalWork(4, 10) + totalWork(5, 10)
= 1 + 11 + 111 + 1,111 + 11,111 + 111,111
= 123,456
Excess work = (123,456 - 111,111) / 111,111
= 11%
- not bad!
9/29/15
CS 540 - Fall 2015 (Shavlik©), Lecture 10, Week 4
7
BEAM Search:
Another Way to Deal with Large Spaces
• Simple idea
– Never let OPEN get larger than some constant,
called the ‘beam width’
– Insert children in OPEN, then reduce OPEN to
size ‘beam width’ (only need 1 new line in our basic code,
open  discardFromBackEndIfTooLong(open, beamWidth) )
– Makes most sense with BEST first search,
since most promising nodes at front of OPEN
• The above is a variation of what the text calls
“local beam search” (pg 125-126)
9/29/15
CS 540 - Fall 2015 (Shavlik©), Lecture 10, Week 4
8
What if No Explicit Goal Test?
• Sometimes we don’t have an explicit
description of the GOAL
• If so, we simply aim to maximize (or
minimize) the scoring function
• Eg, design factory with maximal
expected profit. Or cheapest assembly cost
• When no explicit goal, assume goal?(X)
always returns FALSE
9/29/15
CS 540 - Fall 2015 (Shavlik©), Lecture 10, Week 4
9
Hill Climbing (HC)
• Only keep ONE child in OPEN,and ONLY IF that
child has a better score than the current node
(recall the ‘greedy’ d-tree pruning algo)
• Like BEAM with beam-width = 1,
but don’t keep nodes worse than the current one
• Will stop at LOCAL maxima rather than GLOBAL
• Sometimes we do ‘valley [gradient] descending’
if lower scores are better, but ideas are identical
9/29/15
CS 540 - Fall 2015 (Shavlik©), Lecture 10, Week 4
10
HC with Multiple Restarts
(simple but effective, plus runs in parallel)
For some tasks, we can start in various
initial states (eg, slide the 8-puzzle pieces
randomly for 100 moves)
Repeat N times
• Choose a random initial state
• Do HC; record score and final state if best so far
Score
Return best state found
x
9/29/15
x
x
CS 540 - Fall 2015 (Shavlik©), Lecture 10, Week 4
x
State Space
11
Simulated Annealing (SA)
• HC but sometimes allow downhill moves
• Over time, the prob of allowing downhill moves gets smaller
Let Temperature = 100
X = StartNode // Call the current node X for short
LOOP
If X is a goal node or Temperature = 0, return X
Randomly choose a neighbor, Y
If score(Y) > score (X) move to Y // Accept since uphill
Else with prob e (score(Y) – score(X)) / Temperature go to Y
Reduce Temperature // Need to choose a ‘cooling schedule’
9/29/15
CS 540 - Fall 2015 (Shavlik©), Lecture 10, Week 4
12
SA Example
(Let Temp = 10;
scores NEGATED
since originally
lower was better)
Start
score = -9
B
C
score = -11
D
score = -8
score = -4
• Assume at Start and randomly choose B
– What is prob will move to B?
Prob= e (-11 – (-9))/10 = e -0.2 = 0.82
• Assume at Start and randomly choose C
– What is prob will move to C?
Prob = 1.0 since an UPHILL (ie, good) move
9/29/15
CS 540 - Fall 2015 (Shavlik©), Lecture 10, Week 4
13
Case Analysis of SA
1) Temp >> | score(Y) – score(X) |
prob  e0 = 1
so most moves accepted when temp is high
2) Temp  | score(Y) – score(X) |
prob  e-1 = 0.37 // Since score(y) < score(X)
3) Temp << | score(Y) – score(X) |
prob  e-∞ = 0
so few moves accepted when temp is low
9/29/15
CS 540 - Fall 2015 (Shavlik©), Lecture 10, Week 4
Lecture 1, Slide 14
Dealing with Large
OPEN Lists
• Iterative Deepening
– Keep OPEN small by doing repeated work
– Still get shortest solution (in #arcs)
• BEAM
– Limit OPEN to a max size (the beam-width)
– Might discard the best (or only!) solution
• HC (Hill Climbing)
– Only go to better-scoring nodes
– Good choice when no explicit GOAL test
– Stops at local (rather than global) optimum
• HC with Random Restarts
– K times start in random state, then go uphill; keep best
– Might not find the global optimum, but works well in practice
• SA (Simulated Annealing)
– Always accept good moves, with some prob make a bad move
– In the theoretical limit, finds global optimum
9/29/15
CS 540 - Fall 2015 (Shavlik©), Lecture 10, Week 4
Slide 15
If OPEN Can Get Too Large,
What about CLOSED?
• If branching factor is b, we add
b items to OPEN whenever one item is removed
from CLOSED – so OPEN grows faster
• Items in CLOSED can be hashed, approximated,
etc while items in OPEN need to store more info
• But CLOSE growing too large can still be
a problem
– so often it isn’t used and we live with some repeated work
and risk of infinite loops
– CLOSED not needed in Iter. Deepening and HC; WHY?
9/29/15
CS 540 - Fall 2015 (Shavlik©), Lecture 10, Week 4
16
(Partially) Wrap Up
• Search spaces can grow rapidly
• Sometimes we give up on optimality and seek
satisfactory solutions
• But sometimes we’re unaware of more powerful
search methods and are too simplistic!
• Various ‘engineering tradeoffs’ exist and best
design is problem specific
• As technology changes, choices change
9/29/15
CS 540 - Fall 2015 (Shavlik©), Lecture 10, Week 4
17

lecture10

Transcript lecture10

Directory