• 中国精品科技期刊
  • CCF推荐A类中文期刊
  • 计算领域高质量科技期刊T1类
Advanced Search
Dong Yongquan, Li Qingzhong, Ding Yanhui, Peng Zhaohui. Constrained Conditional Random Fields for Semantic Annotation of Web Data[J]. Journal of Computer Research and Development, 2012, 49(2): 361-371.
Citation: Dong Yongquan, Li Qingzhong, Ding Yanhui, Peng Zhaohui. Constrained Conditional Random Fields for Semantic Annotation of Web Data[J]. Journal of Computer Research and Development, 2012, 49(2): 361-371.

Constrained Conditional Random Fields for Semantic Annotation of Web Data

More Information
  • Published Date: February 14, 2012
  • Semantic annotation of Web data is a key step for Web information extraction. The goal of semantic annotation is to assign meaningful semantic labels to data elements of the extracted Web object. It is a hot research topic that has gained increasing attention all over the world in recent years. Conditional random fields are the state-of-the-art approaches taking the sequence characteristics to do better labeling. However, traditional conditional random fields can not simultaneously use existing Web databases and logical relationships among Web data elements, which lead to low precision of Web data semantic annotation. To solve the problems, this paper presents a constrained conditional random fields (CCRF) model to annotate Web data. The model incorporates confidence constraints and logical constraints to efficiently utilize existing Web databases and logical relationships among Web data elements. In order to solve the problem that the Viterbi inference approach of traditional CRF model can not simultaneously utilize two kinds of constraints, the model incorporates a novel inference procedure based on integer linear programming and extends CRF to naturally and efficiently support two kinds of constraints. Experimental results on a large number of real-world data collected from diverse domains show that the proposed approach significantly improves the accuracy of semantic annotation of Web data, and lays a solid foundation for Web information extraction.
  • Related Articles

    [1]Yang Yongpeng, Jiang Dejun. A Method for Solving the wandering B+ tree Problem[J]. Journal of Computer Research and Development, 2023, 60(3): 539-554. DOI: 10.7544/issn1000-1239.202220555
    [2]Liu Yang, Jin Peiquan. ZB+-tree: A Novel ZNS SSD-Aware Index Structure[J]. Journal of Computer Research and Development, 2023, 60(3): 509-524. DOI: 10.7544/issn1000-1239.202220502
    [3]Zhao Xinyi, Huang Xiangdong, Qiao Jialin, Kang Rong, Li Na, Wang Jianmin. A Spatio-Temporal Index Based on Skew Spatial Coding and R-Tree[J]. Journal of Computer Research and Development, 2019, 56(3): 666-676. DOI: 10.7544/issn1000-1239.2019.20170750
    [4]Yang Niya, Peng Tao, Liu Lu. Link Prediction Method Based on Clustering and Decision Tree[J]. Journal of Computer Research and Development, 2017, 54(8): 1795-1803. DOI: 10.7544/issn1000-1239.2017.20170172
    [5]Zou Lei, Peng Peng. A Survey of Distributed RDF Data Management[J]. Journal of Computer Research and Development, 2017, 54(6): 1213-1224. DOI: 10.7544/issn1000-1239.2017.20160908
    [6]Fan Haixiong, Liu Fuxian, and Xia Lu. Research on Case Index BCS-Tree and Its Constructing Method[J]. Journal of Computer Research and Development, 2013, 50(12): 2629-2641.
    [7]Hu Jianli, Zhou Bin, Wu Quanyuan, Li Xiaohua. A Reputation Based Attack-Resistant Distributed Trust Management Model for P2P Networks[J]. Journal of Computer Research and Development, 2011, 48(12): 2235-2241.
    [8]Dong Jian, Zuo Decheng, Liu Hongwei, Yang Xiaozong, and Ren Xiao. A Protocol of Fault Diagnosis Agreement Based on Invalid Link[J]. Journal of Computer Research and Development, 2007, 44(6): 914-923.
    [9]Cai Zhiping, Yin Jianping, Liu Xianghui, Liu Fang, and Lü Shaohe. A Distributed Network Monitoring Model with Link Constraint[J]. Journal of Computer Research and Development, 2006, 43(4): 601-606.
    [10]Wang Yongli, Xu Hongbing, Dong Yisheng, Qian Jiangbo, Liu Xuejun. Algorithms for Incremental Aggregation over Distributed Data Stream[J]. Journal of Computer Research and Development, 2006, 43(3): 509-515.

Catalog

    Article views (883) PDF downloads (479) Cited by()

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return