基于改进虚拟遗憾最小化的雷达抗干扰算法研究

    Research on Radar Anti-Jamming Algorithm Based on Improved Counterfactual Regret Minimization

    • 摘要: 针对现代电子战中干扰机的日益智能化问题,主要研究了基于博弈论的雷达抗干扰策略优化,提出了一种优先经验深度对决虚拟遗憾最小化算法。首先建立雷达与干扰机的扩展式博弈对抗模型,以博弈树的形式通过多轮交互寻找纳什均衡策略;其次将深度学习与虚拟遗憾最小化算法相结合,使用深度神经网络替代传统表格存储以解决雷达对抗中状态空间庞大的计算问题;随后对价值网络引入一个对决网络结构,将状态价值和动作优势解耦以提升雷达对关键频点决策的敏感性;最后采用优先经验回放机制优先回放高价值样本,加速雷达策略的收敛。实验结果表明,所提算法能有效提高雷达抗干扰性能。

       

      Abstract: Aiming at the confrontation problem between radar and intelligent jamming machine in modern electronic warfare, this paper mainly studies the optimization of radar anti-jamming strategy based on game theory, and proposes a Prioritized Experience Replay Deep Dueling Counterfactual Regret Minimization algorithm. Firstly, the game between radar and jammer is modeled as an Extensive-Form Game, using a game tree to search for Nash equilibrium strategies through multiple rounds of interaction; Secondly, deep learning is combined with the Counterfactual Regret Minimization algorithm. A deep neural network is used to replace the traditional table storage, so as to solve the computational problem caused by the huge state space in radar confrontation. Subsequently, dueling network structure is introduced into the value network, which decouples the state value and action advantage. This improvement enhances the sensitivity of radar in making decisions on key frequency points. Finally, a prioritized experience replay mechanism is adopted to replay high-value samples first, accelerating the convergence of the radar strategy. The experimental results show that the proposed algorithm can effectively improve anti-interference performance compared to traditional algorithms.

       

    /

    返回文章
    返回