Classes of association rules short overview Jan Rauch, Department of Knowledge and Information Engineering University of Economics, Prague.
Download
Report
Transcript Classes of association rules short overview Jan Rauch, Department of Knowledge and Information Engineering University of Economics, Prague.
Classes of association rules
short overview
Jan Rauch,
Department of Knowledge and Information
Engineering University of Economics, Prague
1
Classes of association rules – overview
Introduction, classes of rules and quantifiers
Implicational quantifiers
Deduction rules for implicational quantifiers
Tables of critical frequencies for implicational quantifiers
- double implication 4ft quantifiers
- equivalence 4ft quantifiers
4ft quantifiers with F-property
2
Classes of association rules – Introduction
Simple intuitive definition
Each class contains both simple association rules and comlex
association rules corresponding to statistical hypothesis tests
Important both theoretical and practical properties
Examples:
imlicational association rules
double imlicational association rules
-double imlicational association rules
equivalency association rules
- equivalency association rules
rules with F-property
3
Literature
Hájek, P. - Havránek T.: Mechanising Hypothesis Formation –
Mathematical Foundations for a General Theory. Berlin –
Heidelberg - New York, Springer-Verlag, 1978, 396 pp,
http://www.cs.cas.cz/~hajek/guhabook/
Rauch, J.: Logic of Association Rules. Applied Intelligence, 2005, No. 22, 9-28
Rauch, J.: Classes of Association Rules, An Overview. In: LIN, T.Y. Ying,
X.(Ed.): Foundation of Semantic Oriented Data and Web Mining. Proceedings of an
ICDM 2005 Workshop, IEEE Houston 2005. pp 68 – 74.
http://www.cs.sjsu.edu/faculty/tylin/ICDM05/proceeding.pdf
4
Classes of 4ft-quantifiers
Association rule belongs to the class of association rules
if and only if the 4ft-quantifier belongs to the class of 4ft-quantifiers
Examples:
association rule is implicational iff is implicational
association rule is -double implicational iff is -double
implicational
association rule is - equivalency iff is - equivalency
5
* is implicational quantifier
M
M’
a
c
b
d
a’
c’
b’
d’
M’ is better from the point of view of implication:
a’ a b’ b
If *(a, b, c, d) = 1 and
*(a’, b’, c’, d’) = 1
a’ a b’ b
then
Truth Preservation Condition for implicational quantifiers:
TPC : a’ a b’ b
* is implicational:
If *(a, b, c, d) = 1 and TPC then *(a’, b’, c’, d’) = 1
6
Implication quantifiers – examples (1)
Founded implication: p,B (a,b,c,d) = 1 iff
a’ a b’ b:
a
p a Base
ab
a´
a´
a
p a´ a Base
a´b´ a´b a b
Founded 2b - implication: p,B (a,b,c,d) = 1 iff
a
p a Base
a 2b
7
Implication quantifiers – examples (2)
Lower critical implication for 0 < p 1, 0 0.5:
!p; (a,b,c,d) = 1
a+b
iff
(a+b)!
i!( a+bi)!
* pi *(1 p) abi
i a
The rule !p; corresponds to the statistical test (on the level ) of the null
hypothesis H0: P( | ) p against the alternative one H1: P( | ) > p.
Here P( | ) is the conditional probability of the validity of under the
condition .
a +b
' '
a’ a b’ b:
ia
(a '+b ')!
i!( a '+b 'i )!
* p * (1 p)
i
a ' b 'i
a +b
i a
(a+b) !
i!( a+bi )!
* p i * (1 p) abi
8
Deduction rules (1)
Is the deduction rule
M
A
A
E
a
c
A 0.9,50 E
A 0.9,50 E F
E
b
d
we see:
correct?
M EF (EF)
a’
b’
A
c’
d’
A
a’ a b’ b and TPC
thus if 0.9,50(a,b,c,d) = 1 then also 0.9, 50(a,b,c,d) = 1
Yes, the deduction rule
A 0.9,50 E
A 0.9,50 E F
is correct.
9
Deduction rules (2)
A !0.95,0.05 E
Is the deduction rule
A !0.95,0.05 E F
M
A
A
E
a
c
E
b
d
we see:
correct?
M EF (EF)
a’
b’
A
c’
d’
A
a’ a b’ b and it is TPC
and thus if !0.95,0.05(a,b,c,d) = 1 then also !0.95, 0.05(a,b,c,d) = 1
Yes, the deduction rule
A !0.95,0.05 E
A !0.95,0.05 E F
is correct.
10
Deduction rules (3)
Additional correct deduction rules (prove it home):
A B 0.9,50 E
A 0.9,50 E B
A B !0.95,0.05 E
A !0.95,0.05 E B
Question:
* implication quantifier:
X*Y
X' * Y '
iff ???
11
Deduction rules – two notions
Associated propositional formula ( ) associated to Boolean attribute :
Rule
p,B
e.g.
A B C p,B D E F
A, B, C, B, D, E, F are Boolean attributes
( ): Boolean attributes propositional variables
() = A B C
() = D E F
A, B, C, D, E, F are propositional variables,
we can decide if () is a tautology
12
Deduction rules – two notions
Implicational quantifier is interesting:
I is a – dependent , b – dependent and (0,0,c,d) = 0
is a - dependent if exists
a, a’, b, c, d :
(a,b,c,d) (a’, b, c, d)
0.9, 50, !0.9, 0.05 are interesting implication quantifiers
13
Correct Deduction Rules
X*Y
X' * Y'
is the correct deduction rule iff 1) or 2) are satisfied:
1) both (X) (Y) (X’) (Y’)
and (X’) (Y’) (X) (Y)
are tautologies
2) (X) (Y) is a tautology
14
Correct Deduction Rules
Example:
A B !0.95,0.05 E
A !0.95,0.05 E B
A B E A (E B)
is correct because of
and
A ( E B) A B E
are tautologies
15
Table of Critical Frequencies
implication quantifier:
if *(a, b, c, d) = 1 and a’ a b’ b
then *(a’, b’, c’, d’) = 1
* is c, d independent, thus *(a, b) instead of *(a, b, c, d)
Table of maximal b for *: Tb*(a) = min {e|*(a, e) = 0}
*(a, b)= 1 iff b < Tb* (a)
p, (a,b) 1
!
a +b
iff
i a
!p, (a,b) 1 iff
(a+b) !
i!( a+b i )!
* p i * (1 p) a b i
b Tb! p , (a )
16
Table of maximal b
b
a +b
ia
(a + b) !
i
a bi
*
p
*
(
1
p
)
i! ( a + b i )!
a+b
(a+b)!
i!( a+bi)!
* pi *(1 p) abi
i a
a
17
Class of - double implication 4ft quantifiers
M
X
X
Y
a
c
M’
X
X
Y
b
d
True Preservation Condition:
example: X p Y
Y
a’
c’
Y
b’
d’
a’ a b’ + c’ b + c
a/(a + b + c) p
TCF: Tb*(a) = min{b+c| *(a, b, c) = 0}
*(a, b, c)= 1 iff b + c < Tb* (a)
X *Y
X' * Y'
is correct iff ...
18
Class of - equivalence 4ft quantifiers
M
X
X
Y
a
c
M’
X
X
Y
b
d
True Preservation Condition:
example: X p Y
TCF:
Y
a’
c’
Y
b’
d’
a’ + d’ a + d b’ + c’ b + c
(a + d)/(a+b+c+d) p
Tb*(F) = min {b+c | *(a,b,c,d)=0 a+d=F}
*(a, b,c,d)= 1 iff b + c < Tb*(a + d)
X *Y
X' * Y'
is correct iff ...
19
4ft quantifiers with F-property
has the F-property if it satisfies
1) If (a,b,c,d) = 1 and b c – 1 0 then (a,b+1,c-1,d) = 1
2) If (a,b,c,d) = 1 and c b – 1 0 then (a,b -1,c+1,d) = 1
If is symmetrical and has the F-property then there is a function T(a,d,n) such
that for a+b+c+d = n is (a,b,c,d) = 1 iff | b-c | T(a,d,n)
Fisher’s quantifier and 2 quantifier have the F-property
20
AA - quantifier has F-property
21