• 中国精品科技期刊
  • CCF推荐A类中文期刊
  • 计算领域高质量科技期刊T1类
Advanced Search
Deng Xubin, Zhu Yangyong. ReDE: A Regular Expression-Based Method for Extracting Biological Data[J]. Journal of Computer Research and Development, 2005, 42(12): 2184-2191.
Citation: Deng Xubin, Zhu Yangyong. ReDE: A Regular Expression-Based Method for Extracting Biological Data[J]. Journal of Computer Research and Development, 2005, 42(12): 2184-2191.

ReDE: A Regular Expression-Based Method for Extracting Biological Data

More Information
  • Published Date: December 14, 2005
  • Extracting data from heterogeneous biological data sources to build a query and analysis platform for biological scientists is currently a hot research topic. In general, data extraction process concerns many interdependent metadata. Making full use of dependencies among metadata to generate one metadata from another can reduce metadata maintenance overhead. However, many data extraction methods overlook these dependencies and require much effort to construct and maintain many metadata. In this paper, a regular expression (RE) based method named as ReDE is proposed to avoid this drawback: by building a parse tree for RE groups, an RE-based algorithm for generating relational database scheme and a general data extraction and assembling algorithm are designed. The novelty is that the RE is the only necessary metadata whose management and maintenance are relatively easy. This method can serve as the basis for building a biological database design-aiding tool and a high automatic tool for data extraction, and has been applied to extract data for the first online integrated biological data warehouse of China.
  • Related Articles

    [1]Wu Wenlong, Yin Hailian, Wang Ning, Xu Mengfei, Zhao Xinzhe, Yin Zhanzuo, Liu Yuanrui, Wang Haofen, Ding Yan, Li Bohan. A Synergetic LLM-KG Framework for Cross-Domain Heterogeneous Data Query[J]. Journal of Computer Research and Development, 2025, 62(3): 605-619. DOI: 10.7544/issn1000-1239.202440634
    [2]Wang Mengru, Yao Yunzhi, Xi Zekun, Zhang Jintian, Wang Peng, Xu Ziwen, Zhang Ningyu. Safety Analysis of Large Model Content Generation Based on Knowledge Editing[J]. Journal of Computer Research and Development, 2024, 61(5): 1143-1155. DOI: 10.7544/issn1000-1239.202330965
    [3]Guo Jiang, Wang Miao, Zhang Yujun. Content Type Based Jumping Probability Caching Mechanism in NDN[J]. Journal of Computer Research and Development, 2021, 58(5): 1118-1128. DOI: 10.7544/issn1000-1239.2021.20190871
    [4]Li Li, Liu Huanyu, Lu Laifeng. Probabilistic Caching Content Placement Method Based on Content-Centrality[J]. Journal of Computer Research and Development, 2020, 57(12): 2648-2661. DOI: 10.7544/issn1000-1239.2020.20190704
    [5]Wang Yishu, Yuan Ye, Liu Meng, Wang Guoren. Survey of Query Processing and Mining Techniques over Large Temporal Graph Database[J]. Journal of Computer Research and Development, 2018, 55(9): 1889-1902. DOI: 10.7544/issn1000-1239.2018.20180132
    [6]Huang Sheng, Teng Mingnian, Wu Zhen, Xu Jianghua, Ji Ruijun. A Data Caching Scheme Based on Node Classification in Named Data Networking[J]. Journal of Computer Research and Development, 2016, 53(6): 1281-1291. DOI: 10.7544/issn1000-1239.2016.20148097
    [7]Li Ruimin, Lin Hongfei, Yan Jun. Mining Latent Semantic on User-Tag-Item for Personalized Music Recommendation[J]. Journal of Computer Research and Development, 2014, 51(10): 2270-2276. DOI: 10.7544/issn1000-1239.2014.20130342
    [8]Wang Zhurong, Li Wei, Zhu Bilei, Li Xiaoqiang. Audio Authentication Based on Music Content Analysis[J]. Journal of Computer Research and Development, 2012, 49(1): 158-166.
    [9]Huang Zhenhua and Wang Wei. An Algebra for Skyline Query Processing Data Cube[J]. Journal of Computer Research and Development, 2007, 44(6): 990-999.
    [10]Zheng Guibin, Han Jiqing. Automatic Music Transcription Based on Harmonic Structure Information[J]. Journal of Computer Research and Development, 2006, 43(12): 2187-2192.

Catalog

    Article views (836) PDF downloads (691) Cited by()

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return