面向电信领域的大模型提示词工程测评

打开文本图片集
中图分类号:TP182 文献标识码:A 文章编号:2096-4706(2025)12-0123-06
Prompt Engineering Evaluation of Large Language Model for the TelecommunicationsDomain
FAN Wenbin1, WANG Yanyan1, WANG Yingying1, XU Yin1, SONG Qi² (1.KnowledgeComputing InteligenceLaboratory,GuoChuang CloudTechnologyCo.,Ltd.,Hefei3oo88,China; 2.SchoolofComputerScienceandTechnologyUniversityofScienceandTechnologyofChina,Hefei23o027,China)
Abstract: A prompt evaluation system of Large Language Model (LLM) forthe telecommunications domain is proposed toadresste isuesofincomplete evaluationofpromptparameters inpromptengineeringresearchand thelackofconsideration forthecomplexityinrealproductionsenariosofevaluationmethod.Tothisend,fivedatasets inthe telecommunications domainareconstructed,coveringthree majortasksofsntimenttextclasification,customersrvice intentrecogniionnd knowledge-basedquestionanswering.Subsequentlypromptparametersarecategorized intofourdimensionsofole,lngth, tone,andorder,andthe impactofthesediferentdimensionsontheperformanceofsixLLMsissystematicallyevaluated.The researchresults indicatethata well-esigned promptcansignificantlyimprovemodel performanceonthethreemajortasks inthe telecommunications domain.
Keywords: Large Language Model; prompt enginering; model performance optimization; telecommunication domain; Jatural Language Processing
0 引言
近年来,人工智能技术迅猛发展,其中大语言模型(LargeLanguageLodels,LLMs)作为自然语言处理领域的核心技术,受到了广泛关注。(剩余8179字)