一种基于树的频繁模式挖掘算法

刘君强;潘云鹤

系统工程理论与实践 ›› 2003, Vol. 23 ›› Issue (5) : 108-112.

PDF(234 KB)
PDF(234 KB)
系统工程理论与实践 ›› 2003, Vol. 23 ›› Issue (5) : 108-112. DOI: 10.12011/1000-6788(2003)5-108
论文

一种基于树的频繁模式挖掘算法

    刘君强(1),潘云鹤(2)
作者信息 +

A Tree-Based Frequent Pattern Mining Algorithm

    Jun Qiang LIU(1), Yun he PAN(2)
Author information +
文章历史 +

摘要

提出了一种基于树的频繁模式挖掘算法TBA-FP。它以树表示法压缩数据库所含模式信息,将挖掘问题转化为按深度优先策略构造频繁模式树,并引入了虚拟裁剪等优化技术。实验表明,TBA-FP挖掘“长”模式的时间效率与空间可伸缩性远远优于经典算法Apriori.

Abstract

A tree-based algorithm for mining of frequent patterns, TBA-FP, is proposed in this paper. The term, tree-based, has two meanings. First, a tree-based representation is used to compress the pattern information in the database. Second, the mining process is regarded as a process of constructing a frequent pattern tree in depth first order. Optimizing techniques, such as pseudo clipping, is discussed. Experiments show that TBA-FP is much more efficient and scalable than the classical algorithm, Apriori.

关键词

频繁模式 / 数据挖掘 / 海量数据库

Key words

frequent patterns / data mining / very large databases

引用本文

导出引用
刘君强 , 潘云鹤. 一种基于树的频繁模式挖掘算法. 系统工程理论与实践, 2003, 23(5): 108-112 https://doi.org/10.12011/1000-6788(2003)5-108
Jun Qiang LIU , Yun he PAN. A Tree-Based Frequent Pattern Mining Algorithm. Systems Engineering - Theory & Practice, 2003, 23(5): 108-112 https://doi.org/10.12011/1000-6788(2003)5-108
PDF(234 KB)

285

Accesses

0

Citation

Detail

段落导航
相关文章

/