高级检索

智能问答模型构建以及在公共卫生领域应用:基于检索增强生成技术

Development of an intelligent question-answering model and its application in public health: based on RAG technology

  • 摘要:
    目的 通过构建基于检索增强生成(RAG)技术的智能问答模型,辅助公共卫生领域的技术人员从规范、指南、手册、文献等技术文档中快速、准确识别并提取关键信息,提升知识获取效率与规范应用能力。
    方法 基于RAG技术,集成自动化文本预处理机制,采用LangChain框架,将带有阈值管理的信息检索模块与历史对话记录支持的回答生成模块整合,构建基于大语言模型的检索增强智能问答模型(CURA-LLM)。
    结果 在操作规范、检测流程与法律规范3个公共卫生领域典型数据集上CURA-LLM模型表现优异,F1评分分别为0.941、0.891和0.947,余弦相似度分别为0.968、0.929和0.963。与3种现有领域问答模型相比,CURA-LLM模型在检索相关性和回答准确性方面更具有优势。
    结论 CURA-LLM模型为公共卫生技术文档的智能化解读与知识快速总结与提取提供了有效支撑。辅助专业人员在突发公共卫生事件响应、日常管理、培训与教学等场景中快速响应与规范执行,具有良好的应用前景。

     

    Abstract:
    Objective To develop an intelligent question-answering model based on retrieval-augmented generation (RAG) to assist public health professionals in efficiently and accurately extracting key information from technical documents such as regulations, guidelines, manuals, and literature, thereby improving knowledge acquisition efficiency and standardized application capabilities.
    Methods On the basis of RAG, this study integrates an automated text preprocessing mechanism and employs the LangChain framework to construct a context-aware unified retrieval-augmented question-answering model powered by large language models (CURA-LLM). This model combines a threshold-managed information retrieval module with a response generation module supported by historical dialogue records.
    Results CURA-LLM demonstrated outstanding performance on three representative public health datasets related to operational guidelines, testing procedures, and legal regulations, with the F1 scores of 0.941, 0.891, and 0.947, and the cosine similarities of 0.968, 0.929, and 0.963, respectively. Moreover, CURA-LLM outperformed three existing domain-specific question-answering models in terms of retrieval relevance and answer accuracy.
    Conclusions CURA-LLM provides effective support for the intelligent interpretation and rapid extraction of knowledge from public health technical documents. It can assist personnel in quickly responding and standardizing execution in scenarios such as emergency responses, routine management, and professional training, offering strong potential for practical application.

     

/

返回文章
返回