• 中国精品科技期刊
  • CCF推荐A类中文期刊
  • 计算领域高质量科技期刊T1类
Advanced Search
Xi Xuefeng, Chu Xiaomin, Sun Qingying, Zhou Guodong. Corpus Construction for Chinese Discourse Topic via Micro-Topic Scheme[J]. Journal of Computer Research and Development, 2017, 54(8): 1833-1852. DOI: 10.7544/issn1000-1239.2017.20170348
Citation: Xi Xuefeng, Chu Xiaomin, Sun Qingying, Zhou Guodong. Corpus Construction for Chinese Discourse Topic via Micro-Topic Scheme[J]. Journal of Computer Research and Development, 2017, 54(8): 1833-1852. DOI: 10.7544/issn1000-1239.2017.20170348

Corpus Construction for Chinese Discourse Topic via Micro-Topic Scheme

More Information
  • Published Date: July 31, 2017
  • Currently discourse topic structure analysis is the fundamental research of natural language understanding. Due to the lack of a large number of high-quality discourse corpus resources, which are suitable for Chinese discourse analysis, it has seriously restricted the research of the relevant discourse topic computing models. In order to solve the above problems, we firstly study the theoretical representation system of Chinese discourse topic structure. From the theme-rheme theory, theory of English rhetorical structure and Pennsylvania discourse treebank system, research of Chinese complex sentence and sentence group, combined with Chinese characteristics, we propose a Chinese discourse micro-topic scheme based on theme-rheme theory and construct a Chinese discourse topic structure representation model based on the topic chain. Then, on the basis of the above, we adopt the top-down and backward search annotation strategy and the combination of the human machine and the corpus annotation method to construct the Chinese discourse topic corpus (CDTC). Moreover, we carry out a detailed statistical analysis of the CDTC which contains a total of 500 documents. Compared with the OntoNotes corpus and the generalized topic structure theory, this micro-topic scheme representation model has some advantages in theory and is consistent with the Chinese characteristics. Finally, the consistency test shows that CDTC can fully reflect the difficulty of Chinese discourse topic analysis, and can provide support for the relevant research.
  • Related Articles

    [1]Zhang Long, Wang Jinsong. DDoS Attack Detection Model Based on Information Entropy and DNN in SDN[J]. Journal of Computer Research and Development, 2019, 56(5): 909-918. DOI: 10.7544/issn1000-1239.2019.20190017
    [2]Zhou Yanhong, Zhang Xianyong, Mo Zhiwen. Conditional Neighborhood Entropy with Granulation Monotonicity and Its Relevant Attribute Reduction[J]. Journal of Computer Research and Development, 2018, 55(11): 2395-2405. DOI: 10.7544/issn1000-1239.2018.20170607
    [3]Li Feng, Miao Duoqian, Zhang Zhifei, Zhang Wei. Mutual Information Based Granular Feature Weighted k-Nearest Neighbors Algorithm for Multi-Label Learning[J]. Journal of Computer Research and Development, 2017, 54(5): 1024-1035. DOI: 10.7544/issn1000-1239.2017.20160351
    [4]Dong Hongbin, Teng Xuyang, Yang Xue. Feature Selection Based on the Measurement of Correlation Information Entropy[J]. Journal of Computer Research and Development, 2016, 53(8): 1684-1695. DOI: 10.7544/issn1000-1239.2016.20160172
    [5]Tang Liangrui, Chen Yuanyuan, and Feng Sen. A Chain Routing Algorithm Based on Evidence Theory in Wireless Sensor Networks[J]. Journal of Computer Research and Development, 2013, 50(7): 1362-1369.
    [6]Xu Junling, Zhou Yuming, Chen Lin, Xu Baowen. An Unsupervised Feature Selection Approach Based on Mutual Information[J]. Journal of Computer Research and Development, 2012, 49(2): 372-382.
    [7]Yang Chunfang, Liu Fenlin, and Luo Xiangyang. Histograms Difference and Quantitative Steganalysis of JPEG Steganography Based on Relative Entropy[J]. Journal of Computer Research and Development, 2011, 48(8): 1563-1569.
    [8]Wang Wenhui, Feng Qianjin, Chen Wufan. Segmentation of Brain MR Images Based on the Measurement of Difference of Mutual Information and Gauss-Markov Random Field Model[J]. Journal of Computer Research and Development, 2009, 46(3): 521-527.
    [9]Xiong Zhongmin, Hao Zhongxiao. An Approach to Termination Decision for a Rule Set Based on Activation Path and Conditional Formula[J]. Journal of Computer Research and Development, 2006, 43(5): 901-907.
    [10]Wang Xizhao and An Sufang. Research on Learning Weights of Fuzzy Production Rules Based on Maximum Fuzzy Entropy[J]. Journal of Computer Research and Development, 2006, 43(4): 673-678.
  • Cited by

    Periodical cited type(0)

    Other cited types(1)

Catalog

    Article views (1443) PDF downloads (506) Cited by(1)

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return