• 中国精品科技期刊
  • CCF推荐A类中文期刊
  • 计算领域高质量科技期刊T1类
Advanced Search
Tan Wentang, Wang Zhenwen, Yin Fengjing, Ge Bin, and Xiao Weidong. A Partial Comparative Cross Collections LDA Model[J]. Journal of Computer Research and Development, 2013, 50(9): 1943-1953.
Citation: Tan Wentang, Wang Zhenwen, Yin Fengjing, Ge Bin, and Xiao Weidong. A Partial Comparative Cross Collections LDA Model[J]. Journal of Computer Research and Development, 2013, 50(9): 1943-1953.

A Partial Comparative Cross Collections LDA Model

More Information
  • Published Date: September 14, 2013
  • Comparative text mining like spatiotemporal and cross-cultural text mining is concerned with extracting common and unique themes from a set of comparable text collections. State-of-the-art cross collections topic models suffer from the important flaw that they can only analyze the common topics among document collections. We introduce a generative topic model PCCLDA(partial comparative cross collections LDA) for multi-collections CTM to detect both common topics and collection-special topics,and model text more exactly based on hierarchical dirichlet processes. We present a Gibbs sampling for model inference, and evaluate the model by a variety of qualitative and quantitative evaluations including model perplexity and log-likelihood measurements. PCCLDA discovers both common topics among collections and collection special topics, and also shows improvements on model perplexity and Held-Out likehood compared with two main CTM topic models.
  • Related Articles

    [1]Wang Guohua, David Hung-Chang Du, Wu Fenggang, Liu Shiyong. Survey on High Density Magnetic Recording Technology[J]. Journal of Computer Research and Development, 2018, 55(9): 2016-2028. DOI: 10.7544/issn1000-1239.2018.20180264
    [2]He Wenbin, Liu Qunfeng, Xiong Jinzhi. The Error Theory of Polynomial Smoothing Functions for Support Vector Machines[J]. Journal of Computer Research and Development, 2016, 53(7): 1576-1585. DOI: 10.7544/issn1000-1239.2016.20148462
    [3]Bi Anqi, Dong Aimei, Wang Shitong. A Dynamic Data Stream Clustering Algorithm Based on Probability and Exemplar[J]. Journal of Computer Research and Development, 2016, 53(5): 1029-1042. DOI: 10.7544/issn1000-1239.2016.20148428
    [4]Wang Lijin, Zhong Yiwen, Yin Yilong. Orthogonal Crossover Cuckoo Search Algorithm with External Archive[J]. Journal of Computer Research and Development, 2015, 52(11): 2496-2507. DOI: 10.7544/issn1000-1239.2015.20148042
    [5]Xu Min, Deng Zhaohong, Wang Shitong, Shi Yingzhong. MMCKDE: m-Mixed Clustering Kernel Density Estimation over Data Streams[J]. Journal of Computer Research and Development, 2014, 51(10): 2277-2294. DOI: 10.7544/issn1000-1239.2014.20130718
    [6]Shen Yue, Guo Longjiang, Li Jinbao. Density and Distance Based Probabilistic Broadcasting Algorithm in Mobile Sensor Networks[J]. Journal of Computer Research and Development, 2014, 51(1): 151-160.
    [7]Zong Dan, Li Chunpeng, Xia Shihong, Wang Zhaoqi. Key-Postures Based Automated Construction of Motion Graph[J]. Journal of Computer Research and Development, 2010, 47(8): 1321-1328.
    [8]Xiong Jinzhi, Yuan Huaqiang, Peng Hong. A General Formulation of Polynomial Smooth Support Vector Machines[J]. Journal of Computer Research and Development, 2008, 45(8): 1346-1353.
    [9]Song Yuqing, Xie Conghua, Zhu Yuquan, Li Cunhua, Chen Jianmei, Wang Lijun. Research on Medical Image Clustering Based on Approximate Density Function[J]. Journal of Computer Research and Development, 2006, 43(11): 1947-1952.
    [10]Chen Jun and Wang Guojin. Constructing Convexity-Preserving Interpolation Curves of Hyperbolic Polynomial B-Splines Using a Shape Parameter[J]. Journal of Computer Research and Development, 2006, 43(7): 1216-1224.

Catalog

    Article views (1028) PDF downloads (635) Cited by()

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return