高级检索

    基于大语言模型的知识库查询风格自适应转换

    Adapative Query Style Transfer for Large Language Models as Knowledge Bases

    • 摘要: 大语言模型在知识存储方面不断增强的能力展示了其作为知识库的潜在效用. 然而,任何给定的提示只能提供大语言模型所涵盖知识的下限估计. 在语言模型即知识库(language models as knowledge bases,LMs-as-KBs)的场景中,先前的提示学习方法忽略了查询风格对模型表现的影响. 揭示了大语言模型确实具有与查询风格相关的可学习偏好,并且利用大语言模型的这种特性引入了查询风格自适应转换(adaptive query style transfer,ARES)方法,通过适应大语言模型的偏好来增强其知识查询的表现. ARES方法从构造查询候选集开始,通过改写实现多种表达风格的纳入. 随后,ARES训练一个评估器来学习并识别大语言模型对查询风格的偏好,评估查询候选集并选择潜在最优查询. 在多个数据集上进行的实验表明了该方法在提高大语言模型即知识库服务上查询准确率的有效性,增量对比原始模型与3个基线方法分别实现了平均2.26%,1.68%,1.19%,1.17%的提升,这表明ARES可以与其他方法有效地结合使用,从而实现多角度的查询表现增强.

       

      Abstract: The increasing capabilities of large language models (LLMs) in knowledge storage have underscored their potential utility as knowledge bases. However, it’s important to note that any given prompt can merely offer a lower-bound estimate of the knowledge encompassed by the language model. Prior prompt learning methods in the context of Language Models as Knowledge Bases (LMs-as-KBs) have overlooked the influence of query style. We have unveiled a significant revelation - there are indeed learnable preference within LLMs pertaining to query style. Leveraging this distinctive model characteristic, we introduce the Adaptive query style transfer (ARES) method to improve the performance of LMs-as-KBs by adapting LLM’s preference. ARES initiates by presenting a candidate set of queries, achieved through paraphrasing to incorporate various expression styles. Subsequently, an evaluator is trained to learn and discern LLM’s preferences for query styles, ultimately evaluating the candidate set and selecting the potentially optimal query. Experiments conducted across multiple datasets have convincingly demonstrated the efficacy of our approach in enhancing question answering accuracy on LMs-as-KBs scenarios. Furthermore, Incremental comparisons with the original model and three baseline methods show an average improvement of 2.26%, 1.68%, 1.19%, and 1.17%, respectively, indicating ARES can be effectively utilized in conjunction with other approaches, leading to enhanced performance and optimization across different dimensions.

       

    /

    返回文章
    返回