[关键词]
[摘要]
针对干扰机群掩护目标突防组网雷达场景下的干扰资源分配的问题,提出了一种基于深度强化学习的干扰资源分配方法。该文将干扰资源分配模型描述为一个马尔可夫决策过程,并提出了一种基于动作密钥编码的双延迟深度确定性策略梯度(AKE-TD3)网络训练算法,将混合整数优化问题转化为连续变量优化问题,解决了算法难以收敛的问题。仿真结果表明,文中所设计的干扰资源分配方法对组网雷达有更好的干扰效果,且稳定性更高,有效地提升了干扰机群的作战性能。
[Key word]
[Abstract]
A deep reinforcement learning-based jamming resource allocation method is proposed to address the problem of resource allocation of multi-jammer for jamming netted radar system to cover target penetration. The jamming resource allocation model is described as a Markov decision process, and an action key encoding based double-delayed deep deterministic policy gradient (AKE-TD3) network training algorithm is also proposed, which transforms the mixed-integer optimization problem into a continuous-variable optimization problem to improve the convergence of the algorithm. Simulation results show that the jamming resource allocation method proposed in the paper has better jamming effectiveness and stability for netted radar systems. This method significantly enhances the combat performance of multi-jammer.
[中图分类号]
TN972
[基金项目]
国家自然科学基金资助面上项目(61971109);国防科技创新特区支持项目(重点项目);中央高校其本科研业务费资助项目(ZYGX2018J009)