1,721,063 research outputs found
Using interactions and dynamics for mining groups of moving objects from trajectory data
Advances in tracking technology enable the gathering of spatio-temporal data in the form of trajectories, which when analysed can convey useful knowledge. In particular, discovering groups of moving objects is a valuable means for a wide class of problems related to mobility. The task of group mining has been investigated by considering mostly the spatial closeness and similarity of the trajectories, while little attention has been paid to the relationships between the trajectories and time-changing nature of the trajectories. The relationships may provide evidence of interactions between the moving objects. The time-changing nature may provide evidence of dynamics of the movements. Therefore, interactions and dynamics can be sources of information to be considered in order to discover new forms of groups. Motivated by this, we introduce the concept of crews and propose a method to discover crews. A crew gathers moving objects with similar interactions and similar dynamics. The proposed method relies on i) new movement parameters, which explicitly consider interactions and dynamics, and ii) a distance-free clustering algorithm, which groups objects based on the similarity of the movement parameters. We conduct extensive experiments, which include a quantitative evaluation of the quality of the crews and comparison with alternative solutions
Mining Temporal Associations Between Air Pollution and Effects on the Human Health
The task of monitoring and improving the urban air quality has attracted a great deal of interest both from national governments and scientific communities. In order to implement policies for the environmental protection, the recent
urban planning decisions are often based on the results produced by several research
fields. An important research direction aims at understanding the pollution phenomenon by means of data mining approaches, which support decision makers with
information extracted directly from data. In this work we investigate the effect of air pollution on human health by taking into account the temporal variability of environmental data. Since the repercussions of air pollution on humans are perceived
only after a certain lapse of time, we propose to discover temporal associations
which relate a change at time tj of the population health conditions with a change
at time ti (tj>ti) of the polluting emissions. Information conveyed by discovered temporal associations could be exploited both to support policies for environmental
protection and to adopt strategies for the reduction of human health risks
Samples_for_NoDistance
This dataset contains the samples builts on 10 iterations with the version of the systems which use no weighting schem
Samples_for_DescriptiveandCollectiveDistances
This dataset contains the sampels builts on 10 iterations with the version of the system which combines Descriptive Distance and Collective Distance as weighting schem
Original datasets with the full training data
This dataset contains the original real-world data network
Samples_for_DescriptiveDistance
This dataset contains the samples built on 10 iterations with the version of the system which uses a Descriptive Distance weighting schem
Samples_for_CollectiveDistance
This dataset contains the samples built on 10 iterations with the version of the system which uses the collective distance weighting schem
Dataset_folder
The folder has the seventeen network datasets we used for a network regression task. The folder contains the files used for a 5-cross validation experiment replicated by 5 trials. The datasets are organized in 4 categories of files: namefile_arffTest_N (the testing file for the trial N-th), namefile_arffTrain_M (the training file for the trial M-th), namefile_arffTrain_P_arffSampleQ (the sample for Q-th trial at P-th fold). Also, a file of the distances of the edges is associated to each dataset
Dataset_folder
The folder has the seventeen network datasets we used for a network regression task. The folder contains the files used for a 5-cross validation experiment replicated by 5 trials. The datasets are organized in 4 categories of files: namefile_arffTest_N (the testing file for the trial N-th), namefile_arffTrain_M (the training file for the trial M-th), namefile_arffTrain_P_arffSampleQ (the sample for Q-th trial at P-th fold). Also, a file of the distances of the edges is associated to each dataset
- …
