Elsevier

Information Sciences

Volume 179, Issue 13, 13 June 2009, Pages 2218-2231
Information Sciences

Mining frequent trajectory patterns in spatial–temporal databases

https://doi.org/10.1016/j.ins.2009.02.016Get rights and content

Abstract

In this paper, we propose an efficient graph-based mining (GBM) algorithm for mining the frequent trajectory patterns in a spatial–temporal database. The proposed method comprises two phases. First, we scan the database once to generate a mapping graph and trajectory information lists (TI-lists). Then, we traverse the mapping graph in a depth-first search manner to mine all frequent trajectory patterns in the database. By using the mapping graph and TI-lists, the GBM algorithm can localize support counting and pattern extension in a small number of TI-lists. Moreover, it utilizes the adjacency property to reduce the search space. Therefore, our proposed method can efficiently mine the frequent trajectory patterns in the database. The experimental results show that it outperforms the Apriori-based and PrefixSpan-based methods by more than one order of magnitude.

Introduction

With advances in tracking technologies and the rapid improvement in location-based services, a large amount of spatial–temporal data has been collected in databases [8]. As a result, mining implicit and useful patterns in such databases has attracted increasing attention recently. The entities in a database that change their locations over time are defined as “moving objects”, where the movement (i.e., the trajectory) of an object can be described as a sequence of spatial coordinates, each of which is associated with a time stamp.

Finding the frequently repeated trajectory patterns can help us analyze and predict the movements of objects. Temporal information in the patterns makes many applications that are only concerned with the spatial information more realistic and practical. For example, using temporal information to understand customers’ movement (trajectory) patterns can help us make an appropriate recommendation or a mobile advertisement for them. In ecology, analyzing animals’ movement routes can help us better understand their behavior or detect a strange phenomenon or disaster, especially if some animals’ movement patterns change abruptly.

Therefore, in this paper, we propose an efficient algorithm that takes into account both spatial and temporal attributes to mine the frequent trajectory patterns. The proposed method comprises two phases. First, we scan the database once to generate a mapping graph and trajectory information lists (TI-lists). Then, we apply the proposed graph-based mining (GBM) algorithm to traverse the mapping graph and mine the frequent trajectory patterns in a depth-first search manner. Since our proposed algorithm uses the adjacency property to reduce the search space, we only need to extend a trajectory to the adjacent neighborhoods of the last location of the trajectory pattern. By using the mapping graph and TI-lists, the GBM algorithm can localize support counting and pattern extension to a small number of TI-lists. Thus, it is more efficient than the Apriori-based and PrefixSpan-based algorithms.

The main contributions of this study are summarized as follows: (1) We propose a spatial–temporal pattern to describe a trajectory, where both the spatial and temporal attributes are simultaneously considered. (2) By using the mapping graph, the GBM algorithm can localize support counting and pattern extension to a small number of TI-lists. (3) The GBM algorithm exploits the adjacency property to mine frequent trajectory patterns and reduce the search space. (4) We propose an efficient mining algorithm for discovering frequent trajectory patterns without candidate generation.

The remainder of this paper is organized as follows: Section 2 discusses the related work. Section 3 describes the preliminary concepts and the problem definitions used throughout the paper. Section 4 presents the proposed algorithm in detail. Section 5 illustrates the performance of our approach. Finally, Section 6 describes concluding remarks and future work.

Section snippets

Related work

Mining frequent itemsets is one of the fundamental problems in the area of data mining. An itemset is considered frequent if its support is not less than a user-specified minimum support threshold, where the support of an itemset is defined as the percentage of transactions in the database that contain the itemset. Many itemset mining algorithms [1], [15], [17], [24], [36], [38], [39] have been proposed for mining the frequent itemsets in a database. Agrawal et al. [1] proposed an Apriori

Preliminaries and problem definitions

A reference space of size e × e is a 2-dimensional space, where the reference space is a square, the length and width of the reference space are equal to e, e is an integer, and e  1. The x- and y-coordinates of a point gi in the reference space are denoted by gi · x and gi · y, respectively, where gi · x and gi · y are integers, 1  gi · x  e, and 1  gi · y  e. Consider a spatial–temporal database D = {T1, T2,  , Tu}, where Ti is the trajectory of a moving object, Ti = 〈(x1, y1, t1), (x2, y2, t2),  , (xm, ym, tm)〉, (xj, yj) is

The proposed method

In this section, we discuss the proposed GBM (graph-based mining) algorithm for mining the frequent patterns. As mentioned earlier, the proposed algorithm comprises two phases. First, we transform all trajectories in the database into a mapping graph. For each vertex in the mapping graph, we use a data structure, called a Trajectory Information list (TI-list), to record all trajectories that pass through the vertex. Then, we apply the proposed mining algorithm to find all frequent patterns. We

Performance evaluation

We conducted experiments to compare the proposed method with the modified MP (Moving Pattern mining) algorithm [9] (hereafter MP) and the modified PrefixSpan algorithm (hereafter PrefixSpan).

The original PrefixSpan algorithm is designed to mine sequential patterns over sequence databases and not trajectory patterns over trajectory databases, where time is not explicit as it is in a trajectory. Thus, the modified PrefixSpan algorithm needs to consider how to embed the time constraint into the

Conclusions and future work

In this paper, we have proposed an efficient graph-based mining (GBM) algorithm for mining frequent trajectory patterns. The algorithm comprises two phases. In the first phase, we transform all trajectories in the database into a mapping graph. Then, we create a TI-list for each vertex to record all trajectories that pass through the vertex. In the second phase, using the information recorded in TI-lists, the proposed algorithm traverses the mapping graph in a depth-first search manner to mine

Acknowledgements

The authors are grateful to the anonymous referees for their helpful comments and suggestions. This research was supported in part by the National Science Council of Republic of China under Grant No. NSC 97-2410-H-002-117.

References (41)

  • U. Yun

    A new framework for detecting weighted sequential patterns in large sequence databases

    Knowledge-Based Systems

    (2008)
  • R. Agrawal, T. Imielinski, A. Swami, Mining association rules between sets of items in large databases, in: Proceedings...
  • J, Ayres, J.E. Gehrke, T. Yiu, J. Flannick, Sequential pattern mining using a bitmap representation, in: Proceedings of...
  • A. Brilingaite, C.S. Jensen, N. Zokaite, Enabling routes as context in mobile services, in: Proceedings of the 12th...
  • H. Cao, N. Mamoulis, D.W. Cheung, Mining frequent spatio-temporal sequential patterns, in: Proceedings of the 5th IEEE...
  • H. Cao, N. Mamoulis, D.W. Cheung, Discovery of collocation episodes in spatiotemporal data, in: Proceedings of the...
  • H. Cao et al.

    Discovery of periodic patterns in spatiotemporal sequence

    IEEE Transactions on Knowledge and Data Engineering

    (2007)
  • J.D. Chung, O.H. Paek, J.W. Lee, K.H. Ryu, Temporal moving pattern mining for location-based service, in: Proceedings...
  • M. Garofalakis et al.

    Mining sequential patterns with regular expression constraints

    IEEE Transactions on Knowledge and Data Engineering

    (2002)
  • F. Giannotti, M. Nanni, D. Pedreschi, F. Pinelli, Mining sequences with temporal annotations, in: Proceedings of the...
  • Cited by (103)

    • An indoor trajectory frequent pattern mining algorithm based on vague grid sequence

      2019, Expert Systems with Applications
      Citation Excerpt :

      In the data preprocessing phase, the trajectory sequence is transformed into a sequence of regions of interest, and regions of interest are divided into two categories according to the spatial discretization: default regions of interest and hot regions of interest. Lee, Chen, and Ip (2009) proposed a trajectory frequent pattern mining algorithm GBM based on graph, which firstly scans the database once to generate a mapping graph and trajectory information lists(TI-lists), and then traverses the mapping graph by depth-first walk to mine the trajectory frequent patterns. Luo, Tan, Chen, and Ni (2013) investigated the most frequent trajectory query based on time cycle to analyze the most frequent road choice of most pedestrians.

    View all citing articles on Scopus
    View full text