通过递进知识更新和自一致性增强大语言模型推理能力

打印
收藏

收藏成功

微博 QQ空间微信

打开文本图片集

关键词：大语言模型；复杂多步推理；思维链；推理增强；动态知识更新中图分类号：TP183 文献标志码：A 文章编号：1001-3695（2025）12-023-3707-09 doi：10. 19734/j.issn.1001-3695.2025.05.0126

Enhancing reasoning ability of large language models through progressive knowledge updates and self-consistency

ChangXiaonan，ZhangLong†，MaFuxiao （SchoolofComputer&InformationEngineering，TianjinNormal University，Tianjin3Oo387，China）

Abstract：This paperaddressed thelimitations oflarge language models incomplex multi-stepreasoning tasks.Existing methods，such aschainof thought（CoT），enhancereasoning capabilitiesby guiding the model to generatereasoning steps；however，theyoften encountererrors inthegeneratedintermediatesteps andinformationomissions.Whenanypartofthereasonng processfails，itfrequentlyleadstoincorrctfinalanswers.Toovercomethesechallnges，thispaperproposedanovelreasoning method called progressive consistent reasoning （PCR）. PCR constructed a dynamic knowledge repository -a structured listof informationthatwascontinuallupdatedduringthereasoningprocess—byextracting explicitkeyinformation fromthe original problem toestablishan initialknowledgebaseanddecomposingthe problem into multiple sub-problems.Aftersolving each sub-problem，PCR dynamicallyupdatedtheknowledgerepository by extracting new informationfromthesub-problemanswers.The modelthenreconsideredtheoriginalproblembasedontheupdatedknowledgebasetogeneratecompleteanswerattempts foreach stage.Finally，itemployedanaggregationstrategytointegratethecandidatesolutionsfromeach stage，resulting ina morerobustand accurate final answer.Compared toother methods，PCR improved performanceonvarious complex reasoning benchmarks，such as GSM8K and CSQA，by 11. 9% ，5. 73% ，and 3. 45% ，0. 95% over traditional chain of thoughtandself-consistencyreasoning methods，respectively.TheresultsdemonstratethatPCRefectivelyreduces theimpact of erorsinintermediatestepsandinformationomisionsonthefinaloutcomes，therebyenancingthestabilityandaccuracyof reasoning.

Key words：large language models；complex multi-step reasoning；chainofthought；reasoning enhancement;dynamic update

0 引言

随着深度学习技术的快速发展，大型语言模型在自然语言处理（naturallanguageprocessing，NLP）领域取得了革命性突破。（剩余17928字）

试读结束

购买全文6.00元下一篇考虑故障因素的云服务器的最优策略及性能分析

计算机应用研究

2025年12期

¥12.00/本