Generating Corpus for Evaluating Performance of Process Matching Techniques
DOI:
https://doi.org/10.21015/vtcs.v15i3.525Abstract
Business process management (BPM) plays a vital role in organizations management. A central piece to that is the collection of business process models. Depending upon the size of the organization, the collection may have many process models in the process repository. A key feature to such a repository is searching of process models which requires computing similarity between a pair of process models. For a given pair of process models, similarity refers to finding whether the two process models that form the pair are similar or not. To compute the similarity between process models, several techniques have been established however a rigorous evaluation of these techniques has either not been conducted on numerous occasions or the evaluation has not been sufficiently rigorous. A key reason to that is the absence of a benchmark set of queries and their relevant process models, as a judge by human experts. In this study, we argue, the fewer queries used for evaluation may not have the necessary diversity to challenge the abilities of the matching techniques. This work is usually not done due to a large number of manual comparisons. It is thus required a pool of queries. A related challenge is to identify a pool of process models that are declared as relevant to the query models. To address these challenges, we have suggested a technique.References
. Houy, C., Fettke, P., & Loos, P. (2010). Empirical research in business process management – analysis of an emerging field of research. Business Process Management Journal, 16(4), 619–661. https://doi.org/10.1108/14637151011065946
. Weske, M. (2010). Business Process Management: Concepts, Languages, Architectures (1st ed.). Springer Publishing Company, Incorporated.
. White, S. A. (2008). BPMN Modeling and Reference Guide: Understanding and Using BPMN. Future Strategies Inc.
. AjmoneMarsan, M., Balbo, G., Conte, G., Donatelli, S., &Franceschinis, G. (1995). Modelling with generalized stochastic Petri nets.
. van der Aalst, W. M. P., &terHofstede, A. H. M. (2005). YAWL: Yet Another Workflow Language. Inf. Syst., 30(4), 245–275. https://doi.org/10.1016/j.is.2004.02.002
. Leopold, H., Mendling, J., &Polyvyanyy, A. (2012). Generating Natural Language Texts from Business Process Models. In Proceedings of the 24th International Conference on Advanced Information Systems Engineering (pp. 64–79). Berlin, Heidelberg: Springer-Verlag. https://doi.org/10.1007/978-3-642-31095-9_5
. Phios Process Repository. (n.d.). Retrieved March 9, 2018, from http://process.mit.edu/
. Ceri, S., Abid, A., Helou, M. A., Barbieri, D., Bozzon, A., Braga, D., … Valle, E. D. (2010). Search Computing: Managing complex search queries. IEEE Internet Computing, 14(6), 14–22.
. Antunes, G., Bakhshandeh, M., Borbinha, J., Cardoso, J., Dadashnia, S., Francescomarino, C. D., …Ghidini, C. (2015). The process model matching contest 2015. Enterprise Modelling and Information Systems Architectures.
. Dijkman, R., Dumas, M., &García-Bañuelos, L. (2009). Graph Matching Algorithms for Business Process Model Similarity Search. In Proceedings of the 7th International Conference on Business Process Management (pp. 48–63). Berlin, Heidelberg: Springer-Verlag. https://doi.org/10.1007/978-3-642-03848-8_5
. Weidlich, M., Dijkman, R., &Mendling, J. (2010). The ICoP Framework: Identification of Correspondences Between Process Models. In Proceedings of the 22Nd International Conference on Advanced Information Systems Engineering (pp. 483–498). Berlin, Heidelberg: Springer-Verlag. Retrieved from http://dl.acm.org/citation.cfm?id=1883784.1883832
. Dijkman, R., Dumas, M., van Dongen, B., Käärik, R., &Mendling, J. (2011). Similarity of Business Process Models: Metrics and Evaluation. Inf. Syst., 36(2), 498–516. https://doi.org/10.1016/j.is.2010.09.006
. Luu, V.-T., Ripken, M., Forestier, G., Fondement, F., & Muller, P.-A. (2016). Using Glocal Event Alignment for Comparing Sequences of Significantly Different Lengths (Vol. 9729, pp. 58–72). https://doi.org/10.1007/978-3-319-41920-6_5
. Allison, L., & Dix, T. I. (1986). A bit-string longest-common-subsequence algorithm. Information Processing Letters, 23(5), 305–310. https://doi.org/10.1016/0020-0190(86)90091-8
. Mullin, R. (1985). Time warps, string edits, and macromolecules: The theory and practice of sequence comparison. edited by D. Sankoff and J. B. Kruskal. Addison-Wesley Publishing Company, Inc., Advanced Book Program, Reading, Mass., Don Mills, Ontario, 1983. 300 pp. U. S. $31.95. ISBN 0-201-07809-0. Canadian Journal of Statistics, 13(2), 167–168. https://doi.org/10.2307/3314879
Downloads
Published
How to Cite
Issue
Section
License
Authors who publish with this journal agree to the following terms:
- Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under a Creative Commons Attribution License (CC-By) that allows others to share the work with an acknowledgment of the work's authorship and initial publication in this journal.
- Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgement of its initial publication in this journal.
- Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work (See The Effect of Open Access).
This work is licensed under a Creative Commons Attribution License CC BY