Slide 1

Transcript Slide 1

Regular Expressions

Chapter 6

Regular Languages

Regular Language Regular Expression Accepts Finite State Machine 2

Regular Expressions

The regular expressions over an alphabet  are all and only the strings that can be obtained as follows: 1.  2.  4. If  5. If  6. If  7.  is a regular expression.

is a regular expression.

3. Every element of ,  ,   is a regular expression.

are regular expressions, then so is  .

are regular expressions, then so is  .

is a regular expression, then so is  *.

is a regular expression, then so is  + .

8. If  is a regular expression, then so is (  ).

Regular Expression Examples

If  = { a , b }, the following are regular expressions:   a ( a  abba b )*   4

Regular Expressions Define Languages

Define

, a

semantic interpretation function

for regular expressions: 1.

(  ) =  .

(  ) = {  }.

(

), where

 

(  ) =

(  )

(  ). = {

(    ) =

(  ) 

(  *) = (

(  ))*.

(  ).

(  + ) =

(  *) =

(  ) (

(  ))*. If

(  ) is equal to  , then

(  + ) is also equal to  . Otherwise

(  + ) is the 8.

language that is formed by concatenating together one or more strings drawn from

(  ).

((  )) =

(  ). 5

The Role of the Rules

• Rules 1, 3, 4, 5, and 6 give the language its power to define sets. • Rule 8 has as its only role grouping other operators. • Rules 2 and 7 appear to add functionality to the regular expression language, but they don’t.

2.  is a regular expression.

7.  is a regular expression, then so is  + .

Analyzing a Regular Expression

(( a  b )* b ) =

(( a  b )*)

( b ) = (

(( a  b )))*

( b ) = (

( a ) 

( b ))*

( b ) = ({ a }  { b })* { b } = { a , b }* { b }.

Examples

( a * b * ) =

( ( a  b )* ) =

( ( a  b )* a * b * ) =

( ( a  b )* abba ( a  b )* ) = 8

Going the Other Way

= {

 { a , b }*: |

| is even} 9

Going the Other Way

= {

 { a , b }*: |

| is even} (( a  b ) ( a  b ))* ( aa  ab  ba  bb )* 10

Going the Other Way

= {

 { a , b }*: |

| is even} (( a  b ) ( a  b ))* ( aa  ab  ba  bb )*

= {

 { a , b }*:

contains an odd number of a ’s} 11

Going the Other Way

= {

 { a , b }*: |

| is even} (( a  b ) ( a  b ))* ( aa  ab  ba  bb )*

= {

 { a , b }*:

contains an odd number of a ’s} b * ( ab * ab *)* a b * b * a b * ( ab * ab *)* 12

More Regular Expression Examples

( ( aa *)   ) =

( ( a   )* ) =

= {

 { a , b }*: there is no more than one b in

}

= {

 { a , b }* : no two consecutive letters in

are the same} 13

(    ) ( a  b )*

Common Idioms

optional   *, where  = {a, b} 14

Operator Precedence in Regular Expressions Highest Lowest Regular Expressions

Kleene star concatenation union

Arithmetic Expressions

exponentiation multiplication addition a b *  c d * x y 2 + i j 2 15

The Details Matter

a *  b *  ( a  b )* ( ab )*  a * b * 16

Kleene’s Theorem

Finite state machines and regular expressions define the same class of languages. To prove this, we must show: To prove A = B, we have to prove: 1. A  B and 2. B  A

Theorem:

Any language that can be defined with a regular expression can be accepted by some FSM and so is regular.

Theorem:

Every regular language (i.e., every language that can be accepted by some DFSM) can be defined with a regular expression.