有源干扰下基于压缩感知的频率捷变雷达强化学习-2022年

ID:37335

大小:3.85 MB

页数:20页

时间:2023-03-03

金币:10

上传者:战必胜

 
Citation: Wang, S.; Liu, Z.; Xie, R.;
Ran, L. Reinforcement Learning for
Compressed-Sensing Based
Frequency Agile Radar in the
Presence of Active Interference.
Remote Sens. 2022, 14, 968. https://
doi.org/10.3390/rs14040968
Academic Editors: Yangquan Chen,
Subhas Mukhopadhyay,
Nunzio Cennamo, M. Jamal Deen,
Junseop Lee and Simone Morais
Received: 21 January 2022
Accepted: 14 February 2022
Published: 16 February 2022
Publishers Note: MDPI stays neutral
with regard to jurisdictional claims in
published maps and institutional affil-
iations.
Copyright: © 2022 by the authors.
Licensee MDPI, Basel, Switzerland.
This article is an open access article
distributed under the terms and
conditions of the Creative Commons
Attribution (CC BY) license (https://
creativecommons.org/licenses/by/
4.0/).
remote sensing
Article
Reinforcement Learning for Compressed-Sensing Based
Frequency Agile Radar in the Presence of Active Interference
Shanshan Wang, Zheng Liu *, Rong Xie and Lei Ran
National Laboratory of Radar Signal Processing, Xidian University, Xi’an 710071, China;
ssw@stu.xidian.edu.cn (S.W.); rxie@mail.xidian.edu.cn (R.X.); rl@xidian.edu.cn (L.R.)
* Correspondence: lz@xidian.edu.cn
Abstract:
Compressed sensing (CS)-based frequency agile radar (FAR) is attractive due to its superior
data rate and target measurement performance. However, traditional frequency strategies for CS-
based FAR are not cognitive enough to adapt well to the increasingly severe active interference
environment. In this paper, we propose a cognitive frequency design method for CS-based FAR using
reinforcement learning (RL). Specifically, we formulate the frequency design of CS-based FAR as a
model-free partially observable Markov decision process (POMDP) to cope with the non-cooperation
of the active interference environment. Then, a recognizer-based belief state computing method is
proposed to relieve the storage and computation burdens in solving the model-free POMDP. This
method is independent of the environmental knowledge and robust to the sensing scenario. Finally,
the double deep Q network-based method using the exploration strategy integrating the CS-based
recovery metric into the
e
-greedy strategy (DDQN-CSR-
e
-greedy) is proposed to solve the model-free
POMDP. This can achieve better target measurement performance while avoiding active interference
compared to the existing techniques. A number of examples are presented to demonstrate the
effectiveness and advantage of the proposed design.
Keywords:
compressed-sensing-based frequency agile radar; cognitive design; anti-interference;
target measurement
1. Introduction
In electronic warfare scenarios, hostile jammers emit active interference by intercepting
and imitating radar signals [1,2], having a significant negative effect on radar functioning.
Hence, it is necessary to equip radar systems with anti-jamming techniques. In addition,
since an ever-growing number of electromagnetic systems require access to the limited
frequency resource, especially after the wide deployment of the fifth generation (5G),
minimizing the active co-frequency interference between different radiators becomes an
attractive consideration. Frequency agile radar (FAR), which transmits pulses with dif-
ferent carrier frequencies in a coherent processing interval (CPI), possesses anti-jamming
capabilities and has the potential to realize spectrum compatibility [
3
,
4
]. Random fre-
quency is a common strategy for FAR with the thumbtack-type ambiguity function, but
it cannot avoid active interference flexibly due to the lack of utilization of environmental
information. Sense-and-avoid (SAA) techniques can be employed to select unoccupied
frequency bands automatically [
5
,
6
], but the selection is based on the active interference
knowledge sensed in the previous time. This cannot handle the anti-interference design
in a dynamically changing environment. Therefore, it is of great significance to learn the
interference dynamics and design a more cognitive frequency strategy for FAR.
Reinforcement learning (RL) is a branch of machine learning that aims at making the
agent learn a control strategy through interaction with the environment [
7
9
]. It has been
widely studied in the cognitive communication field to learn spectrum sense and access
strategies [
10
,
11
]. Inspired by these investigations, some researchers have attempted to
Remote Sens. 2022, 14, 968. https://doi.org/10.3390/rs14040968 https://www.mdpi.com/journal/remotesensing
资源描述:

当前文档最多预览五页,下载文档查看全文

此文档下载收益归作者所有

当前文档最多预览五页,下载文档查看全文
温馨提示:
1. 部分包含数学公式或PPT动画的文件,查看预览时可能会显示错乱或异常,文件下载后无此问题,请放心下载。
2. 本文档由用户上传,版权归属用户,天天文库负责整理代发布。如果您对本文档版权有争议请及时联系客服。
3. 下载前请仔细阅读文档内容,确认文档内容符合您的需求后进行下载,若出现内容与标题不符可向本站投诉处理。
4. 下载文档时可能由于网络波动等原因无法下载或下载错误,付费完成后未能成功下载的用户请联系客服处理。
关闭