• 中国精品科技期刊
  • CCF推荐A类中文期刊
  • 计算领域高质量科技期刊T1类
Advanced Search
Li Guochen, Wang Ruibo, Li Jihong. Automatic Labeling of Chinese Functional Chunks Based on Conditional Random Fields Model[J]. Journal of Computer Research and Development, 2010, 47(2): 336-343.
Citation: Li Guochen, Wang Ruibo, Li Jihong. Automatic Labeling of Chinese Functional Chunks Based on Conditional Random Fields Model[J]. Journal of Computer Research and Development, 2010, 47(2): 336-343.

Automatic Labeling of Chinese Functional Chunks Based on Conditional Random Fields Model

More Information
  • Published Date: February 14, 2010
  • In the schema of Chinese chunking, the words are firstly combined into base-chunks, and then the base-chunks are further combined into functional chunks, and finally formalized into a hierarchical syntactic structure. In this paper, the problem of automatic labeling of Chinese functional chunks is modeled as a sequential labeling task, and then words and base chunks are regarded as labeling units of the Chinese functional chunk labeling models. For each of the labeling models a series of new features on the level of base-chunks are constructed, and conditional random fields model is employed in the model. The data set in the experiments is selected from Tsinghua Chinese Treebank (TCT) corpus, and split into train set and test set according to the proportion of 8∶2. The experimental results show that in comparison with the model in which the feature set at word level is only used, the addition of some base-chunk features can significantly improve the performance of functional chunk labeling. The proposed functional chunk labeling method based on human-corrected base-chunks can achieve precision of 88.47%, recall of 89.93% and F-measure of 89.19%. When auto-parsed base-chunks are used, the labeling of Chinese functional chunks achieves precision of 84.27%, recall of 85.57% and F-measure of 84.92%.
  • Related Articles

    [1]Wei Zhenkai, Cheng Meng, Zhou Xiabing, Li Zhifeng, Zou Bowei, Hong Yu, Yao Jianmin. Convolutional Interactive Attention Mechanism for Aspect Extraction[J]. Journal of Computer Research and Development, 2020, 57(11): 2456-2466. DOI: 10.7544/issn1000-1239.2020.20190748
    [2]Du Shengdong, Li Tianrui, Yang Yan, Wang Hao, Xie Peng, Horng Shi-Jinn. A Sequence-to-Sequence Spatial-Temporal Attention Learning Model for Urban Traffic Flow Prediction[J]. Journal of Computer Research and Development, 2020, 57(8): 1715-1728. DOI: 10.7544/issn1000-1239.2020.20200169
    [3]Wang Lu, Du Yuyue, Qi Hongda. Process Model Repair Based on Firing Sequences[J]. Journal of Computer Research and Development, 2018, 55(3): 585-601. DOI: 10.7544/issn1000-1239.2018.20160838
    [4]Zhang Hongbin, Ji Donghong, Yin Lan, Ren Yafeng, Niu Zhengyu. Caption Generation from Product Image Based on Tag Refinement and Syntactic Tree[J]. Journal of Computer Research and Development, 2016, 53(11): 2542-2555. DOI: 10.7544/issn1000-1239.2016.20150906
    [5]Wang Jinshui, Weng Wei, Peng Xin. Recovering Traceability Links Using Syntactic Analysis[J]. Journal of Computer Research and Development, 2015, 52(3): 729-737. DOI: 10.7544/issn1000-1239.2015.20131308
    [6]Tian Feng, Shen Xukun. Image Annotation by Semantic Neighborhood Learning from Weakly Labeled Dataset[J]. Journal of Computer Research and Development, 2014, 51(8): 1821-1832. DOI: 10.7544/issn1000-1239.2014.20121087
    [7]Wan Changxuan, Jiang Tengjiao, Zhong Minjuan, and Bian Hairong. Sentiment Computing of Web Financial Information Based on the Part-of-Speech Tagging and Dependency Parsing[J]. Journal of Computer Research and Development, 2013, 50(12): 2554-2569.
    [8]Zhu Hegui, Zhang Xiangde, Yang Lianping, and Tang Qingsong. Fingerprint-Based Random Sequence Generator[J]. Journal of Computer Research and Development, 2009, 46(11): 1862-1867.
    [9]Zhang Pengcheng, Zhou Yu, Li Bixin, and Xu Baowen. Property Sequence Chart: Formal Syntax and Semantic[J]. Journal of Computer Research and Development, 2008, 45(2): 318-328.
    [10]Chen Dangyang, Jia Suling, Wang Huiwen, and Luo Chang. Trend Sequences Analysis of Temporal Data and a Subsequence Matching Algorithm[J]. Journal of Computer Research and Development, 2007, 44(3).

Catalog

    Article views (808) PDF downloads (716) Cited by()

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return