奖励回溯DQN驱动的多QoS工业网络时隙调度方法

打印
收藏

收藏成功

微博 QQ空间微信

打开文本图片集

中图分类号：TP393 文献标志码：A 文章编号：1001-3695（2025）07-029-2141-06

doi： 10.19734/j. issn.1001-3695.2024.12.0491

Abstract：Existing researchonmulti-QoSscheduling problems，due toitsreliancesolelyonimmediatereward feedback mechanisms，faces isues ofpoor scalabilityand resource wastagewhen handlingdelay-sensitivedataand mediadata withcontinuous transmision requirements inresource-constrained scenarios.To addressthis problem，this paper proposed aRB-DQN algorithm.Thisalgorithmadjustedthecurrntstate’spolicyevaluationbybacktrackingfutureinteractions，effectivelyidentifyingandresolving packetlosscausedbysuboptimalschedulingstrategies.Additionaly，itdesignedaLTTmetric，whichcomprehensivelyconsideredtheservicerequirements ofbothdelay-sensitivedataandmedia-typedata，alowing forweightadjustmentstoemphasizediferentpriorities.Extensivesimulationresultsdemonstratethattheproposedalgorithmsignificantlyreducesthe delayand jiterofdelay-sensitivedata while ensuringthe smothnessandstabilityof media-type data，outperforming other scheduling strategies.

Keywords：time slot scheduling；deep reinforcement learning；multi-QoS；reward backtracking

0 引言

随着工业互联网的快速发展，制造业正经历深刻的变革。（剩余15719字）

试读结束

购买全文6.00元下一篇基于QUIC的拥塞控制算法动态切换机制

计算机应用研究

2025年07期

¥12.00/本