Decomposed Process Mining: The ILP Case

Download Report

Transcript Decomposed Process Mining: The ILP Case

Decomposed Process
Mining: The ILP Case
Eric Verbeek and Wil van der Aalst
A Problem
/ department of mathematics and computer science
17-7-2015
PAGE 1
A Solution
/ department of mathematics and computer science
17-7-2015
PAGE 2
Regular Discovery
/ department of mathematics and computer science
17-7-2015
PAGE 3
Decomposed Discovery: Divide
/ department of mathematics and computer science
17-7-2015
PAGE 4
Decomposed Discovery: Conquer
/ department of mathematics and computer science
17-7-2015
PAGE 5
Regular Replay
/ department of mathematics and computer science
17-7-2015
PAGE 6
Decomposed Replay: Divide
/ department of mathematics and computer science
17-7-2015
PAGE 7
Decomposed Replay: Conquer
/ department of mathematics and computer science
17-7-2015
PAGE 8
Example Model (Accepting Petri Net)
/ department of mathematics and computer science
17-7-2015
PAGE 9
Example Event (Activity) Log
/ department of mathematics and computer science
17-7-2015
PAGE 10
Divide and Conquer Framework
/ department of mathematics and computer science
17-7-2015
PAGE 11
Divide and Conquer Framework
See http://www.promtools.org/
/ department of mathematics and computer science
17-7-2015
PAGE 12
Decomposed ILP Discovery Algorithm
/ department of mathematics and computer science
17-7-2015
PAGE 13
Filter (First Cluster)
Filter In/Out
/ name of department
Replace
17-7-2015
PAGE 14
Decomposed ILP Discovery Algorithm
/ department of mathematics and computer science
17-7-2015
PAGE 15
Discovered Model
/ department of mathematics and computer science
17-7-2015
PAGE 16
Decomposed ILP Replay Algorithm
/ department of mathematics and computer science
17-7-2015
PAGE 17
Strategy (Third Cluster)
Filter
/ department of mathematics and computer science
Replace
17-7-2015
PAGE 18
Decomposed ILP Replay Algorithm
/ department of mathematics and computer science
17-7-2015
PAGE 19
Replay Cost Factor
• Implementation issue:
• The ILP-based replayer takes integer costs.
• If an activity occurs in, say, 3 clusters, then the replay
costs of this activity in a single cluster should be a
third of the usual replay costs in the entire model.
• Solution:
• Take the greatest common divisor of all activity cluster
counts, multiply all replay costs by that factor, and later
on divide all replay costs by this factor again.
/ department of mathematics and computer science
17-7-2015
PAGE 20
Decomposed ILP Replay Algorithm
/ department of mathematics and computer science
17-7-2015
PAGE 21
Case Study Setting
• Mode
Event log based
on BPI Challenge
2012 log
/ department of mathematics and computer science
Model discovered
in earlier work
Event log aligned
on discovered
model
17-7-2015
PAGE 22
Case Study Model
/ department of mathematics and computer science
17-7-2015
PAGE 23
Case Study Results
/ department of mathematics and computer science
17-7-2015
PAGE 24
Conclusions
• General framework for decomposed process mining
• Objects with imports, exports, and visualizers
− Accepting Petri Net
− Causal Activity Matrix
− Causal Activity Graph
− Activity Cluster Array
− Event Log Array
− Accepting Petri Net Array
− Log Alignment
− Log Alignment Array
• Many algorithms
/ department of mathematics and computer science
17-7-2015
PAGE 25
Conclusions
• ILP-based decomposed discovery and ILP-based
decomposed replay
• Discovery can result in the same model in a fraction of
the time
• Replay can result in less costs in less time (trade-off)
/ department of mathematics and computer science
17-7-2015
PAGE 26
Future Work
• Non-maximal decompositions
• Grouping maximally-decomposed clusters may be
beneficial (work of Bart Hompes)
• Splitting large maximally-decomposed clusters may
also be beneficial (cf. original BPI Challenge 2012 log)
• Support for different discovery and replay algorithms
• Merging nets
• Merging alignments
/ department of mathematics and computer science
17-7-2015
PAGE 27