• 中国精品科技期刊
  • CCF推荐A类中文期刊
  • 计算领域高质量科技期刊T1类
Advanced Search
Yang Bei, Huang Houkuan. Mining Top-K Significant Itemsets in Landmark Windows over Data Streams[J]. Journal of Computer Research and Development, 2010, 47(3): 463-473.
Citation: Yang Bei, Huang Houkuan. Mining Top-K Significant Itemsets in Landmark Windows over Data Streams[J]. Journal of Computer Research and Development, 2010, 47(3): 463-473.

Mining Top-K Significant Itemsets in Landmark Windows over Data Streams

More Information
  • Published Date: March 14, 2010
  • Frequent itemset mining over data streams becomes a hot topic in data mining and knowledge discovery recently, which has been applied to different areas. However, the setting of a minimum support threshold needs some domain knowledge. It will bring many difficulties or much burden to users if the support threshold is not set reasonably. It is interesting for users to find top-K significant itemsets over data streams. A dynamic incremental approximate algorithm, TOPSIL-Miner, is presented to mine top-K significant itemsets in landmark windows. A new data structure, TOPSIL-Tree, is designed to store the potential significant itemsets, and other data structures of maximum support list, ordered item list, TOPSET and minimum support list are devised to maintain the information about mining results. Moreover, three optimization strategies are exploited to reduce the time and space cost of the algorithm: 1) pruning trivial nodes in the current data stream; 2) promoting mining support threshold during mining process heuristically and adaptively; and 3) promoting pruning threshold dynamically. The accuracy of the algorithm is also analyzed. Extensive experiments are performed to evaluate the good effectiveness, the high efficiency and precision of the algorithm.
  • Related Articles

    [1]Cheng Haodong, Han Meng, Zhang Ni, Li Xiaojuan, Wang Le. Closed High Utility Itemsets Mining over Data Stream Based on Sliding Window Model[J]. Journal of Computer Research and Development, 2021, 58(11): 2500-2514. DOI: 10.7544/issn1000-1239.2021.20200554
    [2]Wang Le, Feng Lin, Wang Shui. An Algorithm of Mining TOP-K High Utility Patterns Without Generating Candidates[J]. Journal of Computer Research and Development, 2015, 52(2): 445-455. DOI: 10.7544/issn1000-1239.2015.20131184
    [3]Wang Shuang, Wang Guoren. Sliding Window Top-K Frequent Item Query on Uncertain Stream[J]. Journal of Computer Research and Development, 2012, 49(10): 2189-2197.
    [4]Liao Guoqiong, Wu Lingqin, Wan Changxuan. Frequent Patterns Mining over Uncertain Data Streams Based on Probability Decay Window Model[J]. Journal of Computer Research and Development, 2012, 49(5): 1105-1115.
    [5]Zhu Ranwei, Wang Peng, and Liu Majin. Algorithm Based on Counting for Mining Frequent Items over Data Stream[J]. Journal of Computer Research and Development, 2011, 48(10): 1803-1811.
    [6]Song Wei, Li Jinhong, Xu Zhangyan, Yang Bingru. Research on a New Concise Representation of Frequent Itemset and Its Mining Algorithm[J]. Journal of Computer Research and Development, 2010, 47(2): 277-285.
    [7]Wang Liming and Zhang Zhuo. An Algorithm for Mining Closed Frequent Itemsets Based on Apposition Assembly of Iceberg Concept Lattices[J]. Journal of Computer Research and Development, 2007, 44(7): 1184-1190.
    [8]Wang Liming and Zhao Hui. Algorithms of Mining Global Maximum Frequent Itemsets Based on FP-Tree[J]. Journal of Computer Research and Development, 2007, 44(3).
    [9]Song Wei, Yang Bingru, Xu Zhangyan, Gao Jing. An Improved Algorithm for Mining Frequent Closed Itemsets[J]. Journal of Computer Research and Development, 2007, 44(2): 278-286.
    [10]Liu Xuejun, Xu Hongbing, Dong Yisheng, Qian Jiangbo, Wang Yongli. Mining Frequent Closed Patterns from a Sliding Window over Data Streams[J]. Journal of Computer Research and Development, 2006, 43(10): 1738-1743.

Catalog

    Article views (910) PDF downloads (699) Cited by()

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return