1,721,005 research outputs found
Heuristics Miner for Time Intervals
Process Mining attempts to reconstruct the workflow of a
business process from logs of activities. This task is quite important in
business scenarios where there is not a well understood and structured
definition of the business process performed by workers. Activities logs
are thus mined in the attempt to reconstruct the actual business process.
In this paper, we propose the generalization of a popular process mining
algorithm, named Heuristics Miner, to time intervals. We show that the
possibility to use, when available, time interval information for the per-
formed activities allows the algorithm to produce better workflow models
Automatic determination of parameters' values for Heuristics Miner++
The choice of parameters' values for noise-tolerant Process Mining algorithms is not trivial, especially for users that are not expert in Process Mining. Exhaustive exploration of all possible set of values is not feasible, since several parameters are real-valued. Selecting the “right” values, however, is important, since otherwise the control-flow network returned by the mining can be quite far from the correct one. Here we face this problem for a specific Process Mining algorithm, i.e. Heuristics Miner++. We recognize that the domain of real-valued parameters can be actually partitioned into a finite number of equivalence classes and we suggest exploring the parameters space by a local search strategy driven by a Minimum Description Length principle. We believe that the proposed approach is sufficiently general to be used for other Process Mining algorithms. Experimental results on a set of randomly generated process models show promising results
PLG: a Framework for the Generation of Business Process Models and their Execution Logs
Evaluating process mining algorithms would require the avail-
ability of a suite of real-world business processes and their execution logs,
which hardly are available. In this paper we propose an approach for the
random generation of business processes and their execution logs. The
proposed approach is based on the generation of process descriptions
via a stochastic context-free grammar whose definition is based on well-
known process patterns. An algorithm for the generation of execution
instances is also proposed. The implemented tools are publicly available
Ectopic secretion of growth hormone-releasing hormone (GHRH) in neuroendocrine tumors: relevant clinical aspects.
Process Mining Meets Statistical Model Checking: Towards a Novel Approach to Model Validation and Enhancement
We propose a novel research line integrating Statistical Model Checking (SMC), a family of simulation-based analysis techniques from quantitative formal methods, with Process Mining (PM), a collection of data-driven process-oriented techniques. SMC and PM are complementary. SMC focuses on performing the right number of simulations to obtain statistically-reliable estimations (e.g., the probability of success of an attack). PM focuses on reconstructing a model of a system using logs of its traces. Nevertheless, both approaches aim at providing evidence of issues/guarantees of the system, and at proposing enhancements. We aim at enriching SMC by explaining why it produced specific estimates. This might help, e.g., identifying issues in the model (validation) or suggesting improvements (enhancement). Given that SMC uses statistics to decide what is the correct number of simulations (or traces), we avoid by-construction the complex issue of under-representation of system behavior in the logs crucial to many PM exercises. This work-in-progress paper demonstrates the proposed methodology and its usefulness using a simple example from the security threat modeling domain. We show how PM helps highlighting both mistakes in the model, and possibilities for improvement
A Purpose-Guided Log Generation Framework
Process mining is a prominent discipline that collects a variety of techniques fulfilling different mining purposes by gathering information from event logs. This involves the continuous necessity of event logs suitable for testing mining techniques with respect to different purposes. Unfortunately, event logs are hard to find and usually contain noise that can influence the results of a mining technique. In this paper, we propose a framework for generating event logs tailored for different mining purposes, e.g., process discovery and conformance checking. Event logs generation and tuning are made out through business model simulations guided by the mining purpose under consideration. Beyond defining the framework, we implemented it as a tool, which has been also used for the validation of the approach we propose
Going Beyond Counting First Authors in Author Co-citation Analysis
The present study examines one of the fundamental aspects of author co-citation analysis (ACA) - the way co-citation
counts are defined. Co-citation counting provides the data on which all subsequent statistical analyses and mappings
are based, and we compare ACA results based on two different types of co-citation counting - the traditional type that
only counts the first one among a cited work's authors on the one hand and a non-traditional type that takes into
account the first 5 authors of a cited work on the other hand. Results indicate that the picture produced through this non-traditional author co-citation counting contains more coherent author groups and is therefore considerably clearer. However, this picture represents fewer specialties in the research field being studied than that produced through the traditional first-author co-citation counting when the same number of top-ranked authors is selected and analyzed. Reasons for these effects are discussed
Variations on the Author
“Variations on the Author” discusses two of Eduardo Coutinho’s recent films (Um Dia na Vida, from 2010, and Últimas Conversas, posthumously released in 2015) and their contribution to the general question of documentary authorship. The director’s filmography is characterized by a consistent yet self-effacing form of authorial self-inscription: Coutinho often features as an interviewer that rather than express opinions propels discourses; an interviewer that is good at listening. This mode of self-inscription characterizes him as an author who is not expressive but who is nonetheless markedly present on the screen. In Um Dia na Vida, however, Coutinho is completely absent form the image, while Últimas Conversas, on the contrary, includes a confessional prologue that moves the director from the margins to the center of his films. This article examines the ways in which these works stand out in the filmography of a director who offers new insights into the notion of cinematic authorship
Appropriate Similarity Measures for Author Cocitation Analysis
We provide a number of new insights into the methodological discussion about author cocitation analysis. We first argue that the use of the Pearson correlation for measuring the similarity between authors’ cocitation profiles is not very satisfactory. We then discuss what kind of similarity measures may be used as an alternative to the Pearson correlation. We consider three similarity measures in particular. One is the well-known cosine. The other two similarity measures have not been used before in the bibliometric literature. Finally, we show by means of an example that our findings have a high practical relevance.information science;Pearson correlation;cosine;similarity measure;author cocitation analysis
- …
