myBUD中多媒体数据索引CFTree*的研究和实现
Research and Implementation of CFTree* Index on MultiMedia Data in myBUD
-
摘要: 图片、音频、视频、网页等非结构化数据的高速增长使得如何高效管理它们成为一大挑战.提出的多媒体数据索引CFTree*是非结构化数据管理系统平台myBUD中对多媒体数据进行管理的具体研究和实现.CFTree*是基于簇特征树的层次树索引结构,可用于基于内容的近似kNN查询.实验表明,基于CFTree*索引结构的近似kNN查询性能比基于顺序扫描的kNN查询有60%左右的提高.与精确kNN相比,基于CFTree*索引的近似kNN查询结果与查询对象的平均相似度略低于精确kNN结果,但结果的多样性则优于精确kNN结果.Abstract: The ever-growing unstructure data, such as image, video, audio, web page, etc., bring the challenge of effective unstructured data management(USDM). In this work, CFTree* is proposed to index and manage the multimedia data in our USDM platform—myBUD. The CFTree* index is a hierarchical indexing structure built on the top of cluster feature tree. CFTree* can be leveraged in the approximate kNN query processing. The experimental result shows that the query performance of approximate kNN query based on CFTree* gains about 60% improvement over thaton sequence scan. The approximate kNN query result has lower average precision than exact kNN query result, but it has more diversity.
-
Keywords:
- unstructured data management(USDM) /
- clustering /
- CFTree* /
- multimedia index /
- approximate kNN query
-
计量
- 文章访问数: 777
- HTML全文浏览量: 0
- PDF下载量: 360