• 中国精品科技期刊
  • CCF推荐A类中文期刊
  • 计算领域高质量科技期刊T1类
Advanced Search
Liang Jiye, Qiao Jie, Cao Fuyuan, Liu Xiaolin. A Distributed Representation Model for Short Text Analysis[J]. Journal of Computer Research and Development, 2018, 55(8): 1631-1640. DOI: 10.7544/issn1000-1239.2018.20180233
Citation: Liang Jiye, Qiao Jie, Cao Fuyuan, Liu Xiaolin. A Distributed Representation Model for Short Text Analysis[J]. Journal of Computer Research and Development, 2018, 55(8): 1631-1640. DOI: 10.7544/issn1000-1239.2018.20180233

A Distributed Representation Model for Short Text Analysis

More Information
  • Published Date: July 31, 2018
  • The distributed representation of short texts has become an important task in text mining. However, the direct application of the traditional Paragraph Vector may not be suitable, and the fundamental reason is that it does not make use of the information of corpus in training process, so it can not effectively improve the situation of insufficient contextual information in short texts. In view of this, in this paper we propose a novel distributed representation model for short texts called BTPV (biterm topic paragraph vector). BTPV adds the topic information of BTM (biterm topic model) to the Paragraph Vector model. This method not only uses the global information of corpus, but also perfects the implicit vector of Paragraph Vector with the explicit topic information of BTM. At last, we crawl popular news comments from the Internet as experimental data sets, using K-Means clustering algorithm to compare the models’ representation performance. Experimental results have shown that the BTPV model can get better clustering results compared with the common distributed representation models such as word2vec and Paragraph Vector, which indicates the advantage of the proposed model for short text analysis.
  • Related Articles

    [1]Zhang Jing, Ju Jialiang, Ren Yonggong. Double-Generators Network for Data-Free Knowledge Distillation[J]. Journal of Computer Research and Development, 2023, 60(7): 1615-1627. DOI: 10.7544/issn1000-1239.202220024
    [2]Cheng Haodong, Han Meng, Zhang Ni, Li Xiaojuan, Wang Le. Closed High Utility Itemsets Mining over Data Stream Based on Sliding Window Model[J]. Journal of Computer Research and Development, 2021, 58(11): 2500-2514. DOI: 10.7544/issn1000-1239.2021.20200554
    [3]Li Xuebing, Chen Yang, Zhou Mengying, Wang Xin. Internet Data Transfer Protocol QUIC: A Survey[J]. Journal of Computer Research and Development, 2020, 57(9): 1864-1876. DOI: 10.7544/issn1000-1239.2020.20190693
    [4]Liu Bingyi, Wu Libing, Jia Dongyao, Nie Lei, Ye Luyao, Wang Jianping. Data Uplink Strategy in Mobile Cloud Service Based Vehicular Ad Hoc Network[J]. Journal of Computer Research and Development, 2016, 53(4): 811-823. DOI: 10.7544/issn1000-1239.2016.20151150
    [5]Wang Qiang, Li Xiongfei, Wang Jing. A Data Placement and Task Scheduling Algorithm in Cloud Computing[J]. Journal of Computer Research and Development, 2014, 51(11): 2416-2426. DOI: 10.7544/issn1000-1239.2014.20130749
    [6]Zhang Peng, Wang Guiling, Xu Xuehui. A Data Placement Approach for Workflow in Cloud[J]. Journal of Computer Research and Development, 2013, 50(3): 636-647.
    [7]Han Donghong, Gong Pizhen, Xiao Chuan, Zhou Rui. Load Shedding Strategies on Sliding Window Joins over Data Streams[J]. Journal of Computer Research and Development, 2011, 48(1): 103-109.
    [8]Liu Xuejun, Xu Hongbing, Dong Yisheng, Qian Jiangbo, Wang Yongli. Mining Frequent Closed Patterns from a Sliding Window over Data Streams[J]. Journal of Computer Research and Development, 2006, 43(10): 1738-1743.
    [9]Jin Hai, Luo Fei, Zhang Qin, and Zhang Hao. An Efficient Data Transfer Protocol for P2P-Based High Performance Computing[J]. Journal of Computer Research and Development, 2006, 43(9): 1543-1549.
    [10]Qian Jiangbo, Xu Hongbing, Wang Yongli, Liu Xuejun, Dong Yisheng. Simultaneous Sliding Window Join Approach over Multiple Data Streams[J]. Journal of Computer Research and Development, 2005, 42(10): 1771-1778.

Catalog

    Article views (1512) PDF downloads (623) Cited by()

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return