• 中国精品科技期刊
  • CCF推荐A类中文期刊
  • 计算领域高质量科技期刊T1类
Advanced Search
Zhang Jingwei, Liu Shaojian, Yang Qing, Zhou Ya. DMFUCP: A Distributed Mining Framework for Universal Companion Patterns on Large-Scale Trajectory Data[J]. Journal of Computer Research and Development, 2022, 59(3): 647-660. DOI: 10.7544/issn1000-1239.20200648
Citation: Zhang Jingwei, Liu Shaojian, Yang Qing, Zhou Ya. DMFUCP: A Distributed Mining Framework for Universal Companion Patterns on Large-Scale Trajectory Data[J]. Journal of Computer Research and Development, 2022, 59(3): 647-660. DOI: 10.7544/issn1000-1239.20200648

DMFUCP: A Distributed Mining Framework for Universal Companion Patterns on Large-Scale Trajectory Data

Funds: This work was supported by the National Natural Science Foundation of China (61862013, 61662015, U1811264, U1711263), the Natural Science Foundation of Guangxi Aotonomous Region of China (2020GXNSFAA159117, 2018GXNSFAA281199, 2017GXNSFAA198035), the Key Project of Guangxi Key Laboratory of Trusted Software (KX202052), and the Foundation of Guangxi Key Laboratory of Automatic Detection Technology and Instrument (YQ19109).
More Information
  • Published Date: February 28, 2022
  • The popularity of mobile positioning terminals makes users’ locations be easily accessible, which contributes huge amount of trajectory data. Universal companion pattern mining aims at discovering those highly overlapping behavior paths between moving objects in spatio-temporal dimensions, and it is very valuable and challenging to provide effective and efficient pattern mining methods on large-scale trajectories. Obviously, the mining strategy on a centralized environment is incompetent for the consideration of scalability caused by huge and growing trajectory data. Existing distributed mining frameworks are weak in both providing effective input for efficient pattern mining and the processing ability on a large number of loose connections in massive trajectories, which should be covered to improve mining performance. In this study, we propose a distributed two-stage mining framework, DMFUCP, which embeds optimization on data preprocessing and loose connection analysis to provide more efficient and effective universal companion pattern mining. In the data preprocessing stage of DMFUCP, we design both a density clustering algorithm DBSCANCD and a clustering balance algorithm TCB to input high-quality trajectory data with less noisy for mining tasks. In the mining stage of DMFUCP, we propose both a G pruning repartition algorithm GSPR and a segmented enumeration algorithm SAE. GSPR introduces a parameter G to segment long trajectories and then repartitions all segments to improve the processing effectiveness on loose connections. SAE guarantees the mining performance through multithreading and forward closure. Compared with those existing companion pattern mining frameworks on real datasets, DMFUCP reduces the time required to mine each set of universal companion pattern by 20% to 40% while providing better universal companion pattern discovery capabilities.
  • Related Articles

    [1]Wu Jiansheng, Feng Qiaoyu, Yuan Jingzhou, Hu Haifeng, Zhou Jiate, Gao Hao. Predicting Biological Functions of G Protein-Coupled Receptors Based on Fast Multi-Instance Multi-Label Learning[J]. Journal of Computer Research and Development, 2018, 55(8): 1674-1682. DOI: 10.7544/issn1000-1239.2018.20180361
    [2]Ouyang Dantong, Zhi Huayun, Liu Bowen, Zhang Liming, Zhang Yonggang. A Method of Computing Minimal Diagnoses Based on Pseudo-Failure-Degree to Create New Enumeration Tree[J]. Journal of Computer Research and Development, 2018, 55(4): 782-790. DOI: 10.7544/issn1000-1239.2018.20170016
    [3]Long Jianwu, Shen Xuanjing, and Chen Haipeng. Interactive Document Images Thresholding Segmentation Algorithm Based on Image Regions[J]. Journal of Computer Research and Development, 2012, 49(7): 1420-1431.
    [4]Tang Yang, Pan Zhigeng, Tang Min, Pheng Ann Heng, Xia Deshen. Image Segmentation with Hierarchical Mean Shift[J]. Journal of Computer Research and Development, 2009, 46(9): 1424-1431.
    [5]Wang Wenhui, Feng Qianjin, Chen Wufan. Segmentation of Brain MR Images Based on the Measurement of Difference of Mutual Information and Gauss-Markov Random Field Model[J]. Journal of Computer Research and Development, 2009, 46(3): 521-527.
    [6]Hao Zhongxiao, Wang Yudong, He Yunbin. Line Segment Nearest Neighbor Query of Spatial Database[J]. Journal of Computer Research and Development, 2008, 45(9): 1539-1545.
    [7]Yang Yuedong, Wang Lili, and Hao Aimin. Motion String: A Motion Capture Data Representation for Behavior Segmentation[J]. Journal of Computer Research and Development, 2008, 45(3): 527-534.
    [8]Luo Zhiyong, Song Rou. Disambiguation in a Modern Chinese General-Purpose Word Segmentation System[J]. Journal of Computer Research and Development, 2006, 43(6): 1122-1128.
    [9]Ren Jinping and Lü Shuwang. Enumerations and Counting of Orthomorphic Permutations[J]. Journal of Computer Research and Development, 2006, 43(6): 1071-1075.
    [10]Xiao Hui and Hi Yunfa. Data Mining Based on Segmented Time Warping Distance in Time Series Database[J]. Journal of Computer Research and Development, 2005, 42(1): 72-78.
  • Cited by

    Periodical cited type(4)

    1. 余舒鹏,吴春雨,赵斌,吉根林. 面向时空轨迹流的共同运动模式分布式挖掘算法. 数据采集与处理. 2024(05): 1163-1181 .
    2. 陈佐虎,彭振国,何海燕,窦蕴韬. 大数据环境下基于图嵌入的电力用户行为模式挖掘与预测. 信息技术与信息化. 2024(12): 129-132 .
    3. 李建华,王泽鼎. 考虑路径耗时的城市汽车分布式充电桩选点规划. 吉林大学学报(工学版). 2023(08): 2298-2303 .
    4. 张康威,张敬伟,杨青,胡晓丽,单美静. DCPFS:分布式轨迹流伴随模式挖掘框架. 计算机科学. 2022(S2): 270-279 .

    Other cited types(2)

Catalog

    Article views (284) PDF downloads (122) Cited by(6)

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return