[关键词]
[摘要]
大语言模型(large language model,LLM)通过处理和理解自然语言数据,实现高质量的信息检索、知识提取等功能,为中医药研究提供了新机遇。基于中医药大模型发展现状,梳理了LLM开发过程中的数据存储与处理方法,概述了检索增强生成、混合专家模型、人类反馈强化学习、知识蒸馏等人工智能方法,归纳了LLM训练微调与性能评价方法。针对中医药数据的特点,从高质量数据集构建、多领域专家系统融合、信息快速提取、训练与调优等方面入手,提出了中医药LLM的构建策略,并分析了LLM在中医药领域的具体应用场景,为中医药领域LLM的构建和应用提供参考,推动中医药现代化和智能化发展。
[Key word]
[Abstract]
By processing and understanding natural language data, large language models (LLM) enable the high-quality information retrieval, knowledge extraction, etc., and provide new opportunities for traditional Chinese medicine (TCM) research. Based on recent developments of LLM in TCM, the present work summarizes the data storage and processing algorithms, as well as artificial intelligence methods, such as retrieval-augmented generation, mixture of experts, reinforcement learning from human feedback, and knowledge distillation for developing LLM. It also summarizes methods for training fine-tuning and performance evaluation of LLM. In response to the characteristics of TCM data, strategies for developing LLM for TCM are proposed, which focuses on developing high-quality datasets, integrating mixture of experts, rapid information extraction, and model training and optimization. Additionally, it outlines specific application scenarios of LLM in TCM. The aim of this work is to provide insights for the development and application of LLM in TCM, promoting the modernization and intelligent development of TCM.
[中图分类号]
R28;TP18
[基金项目]
成都中医药大学引进人才项目(030041225)