• 中国精品科技期刊
  • CCF推荐A类中文期刊
  • 计算领域高质量科技期刊T1类
Advanced Search
Xun Endong and Li Cheng. Applying Terminology Definition Pattern and Multiple Features to Identify Technical New Term and Its Definition[J]. Journal of Computer Research and Development, 2009, 46(1): 62-69.
Citation: Xun Endong and Li Cheng. Applying Terminology Definition Pattern and Multiple Features to Identify Technical New Term and Its Definition[J]. Journal of Computer Research and Development, 2009, 46(1): 62-69.

Applying Terminology Definition Pattern and Multiple Features to Identify Technical New Term and Its Definition

More Information
  • Published Date: January 14, 2009
  • identification of technical new term and its definition is an important research topic in information extraction. It is still a great challenge to provide a scalable solution for large-scale terms extraction, because most previous approaches fail to explicitly define the linguistic constituent of terms and the function of their definition patterns. The authors’ research shows that the occurrences of technical new terms in most cases are accompanied with their definition descriptions in the real corpus. Based on this intuition, the linguistic constituent of technical terms and the numerical function of their definitions are defined explicitly. Also presented is a novel statistical approach based on linguistic pattern of terminology definition (LPTD) to extract Chinese technical new terms and their definitions. LPTD in this paper is first proposed to delimit the boundary of technical terms. In the identification phase, both statistical information of terms and LPTD features obtained from the previous filtering process are taken into account in the SVM classifier. They are integrated into one unified framework. The idea in this paper can also be used for reference in collocation extraction (CE) and be easily extended to other different languages. Compared with the previously reported outcomes, this approach achieves a competitive result in real large-scale corpora at 90.5% in precision and 78.1% in recall.
  • Related Articles

    [1]Wang Jieting, Qian Yuhua, Li Feijiang, Liu Guoqing. Support Vector Machine with Eliminating the Random Consistency[J]. Journal of Computer Research and Development, 2020, 57(8): 1581-1593. DOI: 10.7544/issn1000-1239.2020.20200127
    [2]Zhang Xiangwen, Lu Ziyao, Yang Jing, Lin Qian, Lu Yu, Wang Hongji, Su Jinsong. Weighted Lattice Based Recurrent Neural Networks for Sentence Semantic Representation Modeling[J]. Journal of Computer Research and Development, 2019, 56(4): 854-865. DOI: 10.7544/issn1000-1239.2019.20170917
    [3]Chen Zhiming, Li Maoxi, Wang Mingwen. Sentence-Level Machine Translation Quality Estimation Based on Neural Network Features[J]. Journal of Computer Research and Development, 2017, 54(8): 1804-1812. DOI: 10.7544/issn1000-1239.2017.20170182
    [4]Li Ru, Wang Zhiqiang, Li Shuanghong, Liang Jiye, Collin Baker. Chinese Sentence Similarity Computing Based on Frame Semantic Parsing[J]. Journal of Computer Research and Development, 2013, 50(8): 1728-1736.
    [5]Hou Wei, Dong Hongbin, Yin Guisheng. A Membership Degree Refinement-Based Evolutionary Clustering Algorithm[J]. Journal of Computer Research and Development, 2013, 50(3): 548-558.
    [6]Wang Suge, Li Deyu, Wei Yingjie. A Method of Text Sentiment Classification Based on Weighted Rough Membership[J]. Journal of Computer Research and Development, 2011, 48(5): 855-861.
    [7]Chen Jianmei, Lu Hu, Song Yuqing, Song Shunlin, Xu Jing, Xie Conghua, Ni Weiwei. A Possibility Fuzzy Clustering Algorithm Based on the Uncertainty Membership[J]. Journal of Computer Research and Development, 2008, 45(9): 1486-1492.
    [8]Wu Youzheng, Zhao Jun and Xu Bo. Sentence Retrieval with a Topic-Based Language Model[J]. Journal of Computer Research and Development, 2007, 44(2): 288-295.
    [9]Qin Bing, Liu Ting, Chen Shanglin, and Li Sheng. Sentences Optimum Selection for Multi-Document Summarization[J]. Journal of Computer Research and Development, 2006, 43(6): 1129-1134.
    [10]Wang Jian, Lin Fuzong. Digital Audio Watermarking Based on Support Vector Machine (SVM)[J]. Journal of Computer Research and Development, 2005, 42(9): 1605-1611.

Catalog

    Article views (828) PDF downloads (868) Cited by()

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return