Progress and Perspective on Physically Explainable Deep Learning for Synthetic Aperture Radar Image Interpretation<span style="white-space: normal;">（in English）</span>

HUANG Zhongling; YAO Xiwen; HAN Junwei

doi:10.12000/JR21165

Volume 11 Issue 1

Feb. 2022

Turn off MathJax

Article Contents

Abstract

1. Introduction

2. Physics-based Machine Learning

3. Physically Explainable Deep Learning for SAR

4. PXDL in SAR Signal and Characteristic Understanding

5. PXDL in SAR Semantic Understanding and Application

6. Future Outlooks

7. Conclusions

References

Article Navigation > Journal of Radars > 2022 > 11(1): 107-125

HUANG Zhongling, YAO Xiwen, and HAN Junwei. Progress and perspective on physically explainable deep learning for synthetic aperture radar image interpretation[J]. Journal of Radars, 2022, 11(1): 107–125. doi: 10.12000/JR21165

Citation:

HUANG Zhongling, YAO Xiwen, and HAN Junwei. Progress and perspective on physically explainable deep learning for synthetic aperture radar image interpretation[J]. Journal of Radars, 2022, 11(1): 107–125. doi: 10.12000/JR21165

Citation:

PDF( 3775 KB)

Progress and Perspective on Physically Explainable Deep Learning for Synthetic Aperture Radar Image Interpretation（in English）

DOI: 10.12000/JR21165 CSTR: 32380.14.JR21165

School of Automation, Northwestern Polytechnical University, Xi’an 710072, China

Funds: The National Natural Science Foundation of China (62101459), China Postdoctoral Science Foundation (BX2021248), Fundamental Research Funds for the Central Universities (G2021KY05104)

More Information

Corresponding author: HUANG Zhongling, huangzhongling@nwpu.edu.cn
Received Date: 2021-11-04
Accepted Date: 2021-12-09
Rev Recd Date: 2021-12-08
Publish Date: 2021-12-31

Abstract

Abstract

Deep learning technologies have been developed rapidly in Synthetic Aperture Radar (SAR) image interpretation. The current data-driven methods neglect the latent physical characteristics of SAR; thus, the predictions are highly dependent on training data and even violate physical laws. Deep integration of the theory-driven and data-driven approaches for SAR image interpretation is of vital importance. Additionally, the data-driven methods specialize in automatically discovering patterns from a large amount of data that serve as effective complements for physical processes, whereas the integrated interpretable physical models improve the explainability of deep learning algorithms and address the data-hungry problem. This study aimed to develop physically explainable deep learning for SAR image interpretation in signals, scattering mechanisms, semantics, and applications. Strategies for blending the theory-driven and data-driven methods in SAR interpretation are proposed based on physics machine learning to develop novel learnable and explainable paradigms for SAR image interpretation. Further, recent studies on hybrid methods are reviewed, including SAR signal processing, physical characteristics, and semantic image interpretation. Challenges and future perspectives are also discussed on the basis of the research status and related studies in other fields, which can serve as inspiration.
- Synthetic Aperture Radar (SAR),
- Explainable artificial intelligence,
- Physical model,
- Deep learning,
- Image interpretation

FullText(HTML)

1. Introduction

As an active microwave sensor, Synthetic Aperture Radar (SAR) can perform all-day, all-weather imaging without being influenced by light and climate, thus having important application value in military and civilian Earth observation fields. The difference between SAR and optical remote sensing techniques such as visible light and infrared is that it actively emits electromagnetic waves to modulate ground objects. It receives backscattering in the form of echo signals, and two-dimensional SAR images are generated through imaging processing algorithms^[1]. Therefore, SAR images reflect the microwave characteristics of ground objects, and imaging results are influenced by various factors, such as wavelength, incident angle, and polarization mode, and are also closely related to the structure, arrangement, and material characteristics of the target. They are distinct from optical images, with which the human visual system is more familiar, as shown in Fig. 1^[2].

Figure 1. The SAR images obtained by Sentinel-1 under different imaging conditions^[2]

DownLoad: Full-Size Img PowerPoint

SAR image interpretation faces many challenges. Experts usually need to understand SAR imaging mechanisms, microwave scattering characteristics, and other background knowledge to accurately interpret and interpret ground objects in SAR images^[3]. The design of traditional SAR image interpretation methods is mostly based on rich expert knowledge and theoretical models^[4-6], and these methods have strong interpretability. However, the features of manual design tend to be biased to SAR image characteristics in a certain aspect, which requires domain knowledge, and the design process is time-consuming. With the vigorous development of artificial intelligence technology, Deep Learning (DL)-based methods have gradually become the mainstream in this field in recent years. DL can build an end-to-end system to extract multi-level features and learn target tasks automatically and simultaneously, thus overcoming the limits imposed by the manual design of features and classifiers and achieving significant performance improvement^[7].

Currently, most mainstream DL methods for SAR image interpretation are developed from the field of computer vision and are primarily focused on the visual information of SAR amplitude images. A Convolutional Neural Network (CNN) is used to perform automatic feature learning of SAR amplitude images, and the loss function is optimized for specific tasks^[7-10]. The advantage of the data-driven approach is that it can automatically learn potential patterns and rules from massive amounts of data. However, SAR image interpretation typically encounters the following challenges in its practical application:

(1) Learnable but difficult to explain: Unquestionably, Deep Neural Networks (DNNs) are capable of learning, but the model is complex and difficult to explain. The model predictions must be highly reliable and credible in specific SAR application fields, such as battlefield monitoring. Given the “black box” nature of the current DL model, decision-makers have difficulty understanding the results, and the technology’s practical application is restricted.

(2) Big data but small annotations: Even though SAR satellites in orbit can provide vast amounts of data, large-scale SAR data labeling is expensive because of the difficulties of visual interpretation. Training DNNs with a small amount of limited labeled data is a major problem that needs to be solved urgently. Even when many high-quality annotated samples are available, the trained model is prone to poor generalization performance on other SAR images with different imaging parameters and other factors^[8-10].

(3) Limitation of visual interpretation: The texture information of SAR images is an important basis for SAR interpretation. Because of the unique microwave imaging mechanism, however, visually discriminating some targets with complicated scattering is challenging. DL-based solely on amplitude information cannot properly understand SAR images^[11].

We believe that the development of physically explainable DL methods, which are different from computer vision, plays an important role in resolving the aforementioned issues. Such development is of great significance in mining the physical intelligence of microwave vision^[12]. The proposed Physically Explainable DL (PXDL) in this paper aims to establish a hybrid modeling paradigm, which combines the physical model or interpretation expertise of SAR with DL to improve the explainability of the model itself. The data-driven strategy has a high data utilization rate, while the theoretical model is highly interpretable. Through hybrid modeling, the two benefits can complement one another, contributing to more transparent algorithms, enhancing interpretability, and reducing the dependence on labeled samples. It also promotes the development of the third generation of artificial intelligence^[13]. Explainable Artificial Intelligence (XAI) is one of the leading research directions in the field of AI, and much work has been conducted on explainable DL (XDL)^[14-18]. One of these is post-hoc, which employs interpretable analysis methods to describe the model after it has been constructed. Relevant work in the field of SAR can be referred to in the Refs. [19-21]. The proposed PXDL is more prone to the category of self-explanatory models in XDL, which can be regarded as an application technology for SAR image interpretation. It constructs an interpretable AI system for SAR by means of approaches such as physics-based machine learning.

In contrast to traditional DL concepts based on computer vision, which emphasize image interpretation, PXDL methods should construct a new learnable and interpretable paradigm in multiple dimensions. Inspired by the cognitive process of experts, we propose to interpret SAR images from signal and characteristics to image semantics and applications, shown in Fig. 2. Besides extracting SAR image features automatically with DL to achieve end-to-end prediction, other important tasks are to learn the SAR imaging mechanism, clarify the impact of imaging conditions on results, and understand the physical scattering characteristics of targets. Most of these processes are currently completed based on interpretable physical models, but some of them rely on assumptions and approximations, making it challenging to describe complicated circumstances accurately^[4]. If the physical model is insufficient or difficult to derive, then the data-driven methods can be used for simulation or replacement, as well as for automatic physical parameter estimation. Some existing knowledge can be used as a constraint during optimization to prevent the model from learning results contrary to the physical model. The SAR physical model or expert knowledge can also promote traditional data-driven algorithms. They can be reasonably used to guide the DNN to conduct autonomous learning, thus maximizing the role of a large number of unlabeled samples, to obtain a model with stronger generalization ability and physical perception ability, and ensure the physical consistency of interpretation results.

Figure 2. The PXDL for SAR image interpretation is supposed to be carried out from multiple aspects, that deeply integrates the data-driven and knowledge-driven models to develop the novel learnable and explainable intelligent paradigm

DownLoad: Full-Size Img PowerPoint

In this paper, the new research direction of physics-based machine learning is briefly introduced in Section 2. Then, the basic ideas of how to develop physically explainable DL are summarized in Section 3. The advances in physics-data hybrid modeling for different SAR applications in the last two or three years are reviewed in Section 4 and Section 5. Finally, the outlook of PXDL technology for SAR and the conclusion are presented in Section 6 and Sections 7, respectively.

2. Physics-based Machine Learning

Physics-based Machine Learning (Physics-ML) is a newly proposed type of machine learning that aims to embed physical knowledge into machine learning models (primarily DNNs) to solve ill-conditioned problems or inverse problems to improve model performance, accelerate the solution, and enhance generalizability. In numerous domains, including fluid mechanics and aerodynamics, its applications have yielded excellent results^[22]. Physics-informed machine learning is widely utilized in the field of nonlinear Partial Differential Equations (PDE)-based physical processes. Thuerey et al.^[23] introduced this in-depth, where the primary technique is to translate PDE-based physical processes into a neural network that utilizes the automatic derivation mechanism to optimize the physical process embedded in DNN. The physics-informed neural network framework proposed by Raissi et al.^[24] has become among the most prevalent Physics-Informed Neural Network (PINN) techniques, followed by some improvements and related applications^[25,26].

In the field of earth science and remote sensing, Refs. [27-30] reviewed methodologies and applications that integrate machine learning/DL with physical models. The mainstream ideas include improving the objective function and developing a hybrid model with applications in hydrology models, radiative transfer processes, and climate change. For example, the Physics-Guided Neural Network (PGNN) learning framework proposed by Refs. [31,32] inputs the prediction results of the physical model together with the observation data into the neural network and uses it to constrain the neural network training. Similarly, Ref. [33] applied it to atmospheric convection prediction. Some works related to the topic discussed in this paper include applications in optical remote sensing imagery and seismic wave interpretation given in Refs. [34,35], as well as applications in beamforming^[36,37]. In contrast, the physical models of SAR are more intricate, and research on physics-based machine-learning Refs.for SAR is still in its infancy. Thus, it was not included in the aforementioned summary Refs. [27–30]. In addition, a recent review article^[38] about DL in SAR lacks an overview and discussion of the research state in this field.

3. Physically Explainable Deep Learning for SAR

Fig. 3 depicts the SAR image interpretation process indicated in Fig. 2 in detail, where yellow represents the physical models, green represents the input and output of each module, red represents modules that are typically implemented by DNNs, and blue represents the parameter set. The potential PXDL implementation concepts for SAR image interpretation tasks are indicated by numbers in Fig. 3. ①, ②, and ③ represent substituting or simulating a physical process and solving the parameters of the physical model using data-driven methods, as described in Sections 3.1, 3.2, and 3.3.1. ③, ④, and ⑤ illustrate the integration of physical guidance or physical constraints in DNN training or providing prior physical information for data-driven methods, as elaborated in Sections 3.3.2, 3.4, and 3.5. In this section, we briefly outline the basic ideas of how to develop PXDL algorithms for SAR based on Fig. 3. The specific implementation cases are given in Sections 4 and 5.

Figure 3. The SAR image interpretation guideline. ① ② ③ ④ ⑤ are the potential modules to develop PXDL

DownLoad: Full-Size Img PowerPoint

3.1 Improving the parameterization of the physical model by DNN

A moving platform is equipped with radar sensors to emit electromagnetic wave signals that interact with targets to generate echo signals that are received by the sensor. Then, two-dimensional complex-valued SAR image data are obtained by the imaging system’s processing. The imaging results are closely related to the physical properties of the scene/target and the working parameters of the sensor and platform. In application scenarios such as target recognition, the electromagnetic scattering model is typically used to simulate targets for data augmentation or to aid in target recognition^[39,40]. Both electromagnetic simulation and parametric modeling of electromagnetic scattering require a number of crucial parameters to be determined, which is a typically difficult process. In similar circumstances, mapping rules can be learned automatically by DNNs with large volumes of observational data, which can be implanted into the physical model to optimize parameter selection.

3.2 Simulating the nonlinear physical process by DNN

Many complicated nonlinear physical processes have high computational complexity and model errors. With the development of Graphics Processing Unit (GPU) hardware acceleration and parallel computing, multi-layer stacked DNNs have an efficient forward inference speed and a robust capacity to fit nonlinear models. Therefore, some nonlinear physical processes in SAR, such as SAR image formation and SAR electromagnetic simulation shown in Fig. 3, can be directly simulated by DNNs. It will also contribute to the realization of SAR imaging and interpretation integration^[41]. Notably, the constraint of theoretical knowledge must be taken into account during the learning process to prevent the network from producing outcomes that violate physical laws.

3.3 Mutual substitution of DNN and physical model

3.3.1 Substituting the neural network with a reliable physical model

In the field of image processing, Chan et al.^[42] proposed PCANet architecture for image texture feature extraction and classification and constructed filters based on cascade principal component analysis to replace the previous convolution layer to simplify DNN parameter learning. This architecture has also been applied for SAR image interpretation^[43,44]. We believe that it will motivate us to develop PXDL approaches for SAR. On the basis of physical models with sufficient theoretical basis, such as the polarization decomposition model for fully polarized SAR images^[45,46], sub-aperture decomposition model based on Fourier transform^[47], and attribute scattering center model to describe targets^[48], the physical scattering characteristics of two-dimensional complex-valued images output by SAR imaging system can be analyzed and interpreted. These physical models themselves can provide interpretable feature representations and replace a portion of the neural network layers, thus providing meaningful priors and reducing the number of network parameters that must be learned.

3.3.2 Substituting an incomplete physical model with a DNN

When the physical model is insufficient or no comprehensive theoretical foundation exists, the microwave scattering characteristics cannot be fully defined and explained. For example, the polarization decomposition of dual-polarization or single-polarization SAR images is less effective in describing the polarimetric characteristics of targets^[49]. DNN can be used to proactively discover the potential association between targets and polarimetric properties by learning from massive amounts of data. The outcomes of data-driven learning can compensate for the inadequacy of human cognition. However, this method is susceptible to data bias and has poor generalization potential. Also worth considering is whether the learning results can be understood by humans and whether they can effectively support interpretation tasks.

3.4 DNN guided or inspired by a physical model

Traditional DCNN-based methods are mostly applied to SAR amplitude images, where the hierarchical feature representation is obtained by stacking convolution layers whose high-level features have a specific semantic meaning. In addition, interpreters can infer the types of ground objects based on the physical scattering characteristics; that is, the physical scattering characteristics also contain semantic information. Fig. 4 shows the H/α plane of quad-pol SAR and the distribution of some selected land-use and land-cover samples in the H/α plane^[50]. On the basis of the potential relationship in semantics, we can design a PGNN that builds an unsupervised learning loop by using massive SAR images and their scattering characteristics to enhance the model generalization ability. This PGNN guides the model to learn high-level semantic feature representation with physical perception ability. The theoretical knowledge offered by the physical model can also inspire the design or initialization of the DNN model so that the network parameters themselves have physical meanings.

Figure 4. The H/

$\alpha$ plane for full-polarized SAR data and the selected land-use and land-cover samples distributed in Ref. [50]

DownLoad: Full-Size Img PowerPoint

3.5 DNN decision-making restricted by the physical model

The lack of annotated samples is a common challenge in many typical applications, such as SAR target recognition. End-to-end CNN training can directly predict the semantic labels of SAR images. Nevertheless, small annotation makes finding a good generalized solution in DNN optimization challenging. With the physical scattering properties of SAR considered as additional information or as a constraint during training under the condition of limited samples, the learning cost can be greatly reduced, and the model’s generalizability can be enhanced.

For the core of SAR images to be investigated fully, the aforementioned procedures primarily entail the processing of complex data; therefore, constructing complex forms that correlate to the prevalent DNN architectures is important. Notably, the five aforementioned types of PXDL schemes for SAR select only the five modules in Fig. 3 as examples to elaborate, but in reality, different PXDL schemes can be adopted for the same task, and different schemes can also be integrated into one algorithm implementation. The next two sections will review the current research status in terms of signals and characteristics, semantics, and applications, respectively.

4. PXDL in SAR Signal and Characteristic Understanding

In the understanding stage of SAR signal and physical properties, empirical or theoretical models for processing or analysis have been developed based on the physical nature of SAR. With the rapid development of DL technology, a growing number of scholars have recently begun to focus on how to employ data-driven advantages to compensate for the deficiencies of present theoretical and empirical methodologies.

4.1 SAR image simulation

The aim of SAR image simulation is to simulate scenes and targets under different imaging conditions, and the simulated samples can contribute to subsequent interpretation tasks. Physical model-based electromagnetic simulation of SAR targets has been a significant challenge for decades. The selection of simulation parameters is a vital step that directly affects the resemblance of simulation results to real-world data, with a successful simulation facilitating subsequent interpretation tasks. As described in Section 3.1, DL can be embedded into a physical model to improve the parameterization. Several typical examples are introduced here. Niu et al.^[51] proposed to use different DNNs to learn simulation parameters from real SAR images, thereby enabling autonomous parameter setup and enhancing the similarity between simulation results and real samples. In addition, Niu et al.^[52] incorporated a DNN into the electromagnetic simulation system to learn the electromagnetic reflection model from the real SAR image while keeping the imaging model unchanged; that is, the DNN was used to improve the calculation of the electromagnetic reflection coefficient, thereby significantly improving the quality of the simulations.

Currently, a more common approach is to totally achieve SAR image simulation with a DNN, as described in Section 3.2, which is typically implemented using a Generative Adversarial Network (GAN)-based technique. In the initial stages, GAN-based SAR image simulation was still based on relevant technologies in the field of computer vision^[53]. Eventually, subsequent research began to consider the physical rules that must be observed when generating SAR images. For instance, using conditional GAN to incorporate physical parameters, such as category, azimuth angle, and depression angle, Oh et al.^[54] proposed PeaceGAN based on multi-task learning to generate SAR images and complete the target pose estimation and classification, with the azimuth angle that would influence the imaging results of SAR targets taken into account. Similar studies can be found in Refs. [55-57]. On the basis of Conditional Variational Auto-Encoder (CVAE) and GAN, Hu et al.^[58] constructed an SAR target generation model with an explainable feature space. The GAN-based generation model can output an SAR target with a given category and observation angle. The CVAE-based feature space provides a continuous feature representation with changing azimuth angles available for target recognition.

In general, the studies^[51,52] retain the physical process of electromagnetic simulation, and the DNN is used to improve the submodules of the physical model, such as parameter selection. Strong generalization performance is achieved by ensuring that the simulation results have physical consistency, hence enhancing the quality of the simulation image. This field has a great deal of room for improvement in the future. GAN-based SAR image simulation methods^[53-58] have obvious advantages in computational complexity, operability, and other aspects. However, an urgent challenge in this area is how to guarantee that generated SAR images do not contradict the law of electromagnetic scattering and can be interpreted with physical knowledge. Some research in other fields, such as fluid simulation^[59,60], include physical parameters in GAN to regulate the generated results, where physical equations are embedded in adversarial learning to bring the generated results closer to reality, thus providing references and inspirations for the development of this area in the future.

4.2 SAR learning imaging

Recent research has also focused on incorporating DL technology into SAR imaging systems to enhance imaging quality and computing efficiency. The improvement of imaging quality is conducive to subsequent SAR image interpretation. Meanwhile, Luo et al.^[41] proposed the idea of establishing the integration of imaging and interpretation through DL, where the target parameters are learned from echo data to serve SAR image interpretation.

Currently, a research design aims to integrate DL into existing imaging algorithms to improve parametric selection, as summarized in Section 3.1. In ISAR imaging, for instance, Qian et al.^[61] noted that traditional Range Instantaneous Doppler (RID) methods that are utilized for maneuvering target imaging suffer from low resolution and poor noise suppression capability. Thus, they proposed a super-resolution ISAR imaging method in which DL assists Time-Frequency Analysis (TFA). DNN is utilized to learn the mapping function between the low-resolution spectrum input and its high-resolution reference signal, which is then integrated into the RID imaging system to achieve super-resolution with clear focusing. Traditional imaging techniques based on compressed sensing require a human pre-definition of appropriate parameters and a significant amount of time for reconstruction using repeated steps. Liang et al.^[62] proposed that CNN be combined with the traditional iterative shrinkage threshold algorithm to automatically learn the best parameters in the imaging process to ensure that the algorithm is still physically interpretable.

Other studies apply DNNs to simulate imaging algorithms. The essence is to reformulate the signal processing optimization algorithm as neural networks and train the algorithm parameters using DNN to enhance imaging performance. Many iterative sparse reconstruction algorithms can be unfolded as learnable neural networks, such as the Learnable ISTA (LISTA)^[63], Analytical LISTA (ALISTA)^[64], and neuro-enhanced ALISTA^[65], based on the development of iterative contraction threshold algorithm, as well as ADMM-NET^[66] and ADMM-CSNET^[67] based on the Alternating Direction Multiplier Method (ADMM). In the field of sparse microwave imaging, Mason et al.^[68] first proposed to model the SAR imaging process using a DNN based on the ISTA algorithm and demonstrated that DL has faster convergence and lower reconstruction error than the traditional ISTA algorithm. Many researchers then continued the investigation, such as Refs. [69-72]. Refs. [73-75] studied SAR learning imaging based on ADMM. A recent work^[76] presented a brief overview of the current research and pointed out that the interpretability and universality of DL algorithms in SAR imaging applications should be strengthened further in future studies. In general, DL is anticipated to lead to the development of imaging algorithms with greater speed, precision, and resolution, as well as the construction of an intelligent, integrated system for image interpretation, which has enormous potential for future growth.

4.3 Understanding physical characteristics

Understanding the physical microwave features of SAR images is crucial for the interpretation of SAR targets and scenes. For example, the polarization decomposition model, sub-aperture decomposition model, and interferometric phase diagram have been widely used in the fields of land classification^[77], moving target detection^[78], and terrain subsidence monitoring^[79]. In some cases, the current physical property analysis model is difficult to use directly. The polarization decomposition of dual-polarization and single-polarization SAR data is less efficient^[49], and interferometry has phase noise because of the atmosphere and topography^[80]. Currently, some DL technologies have substituted the physical model in the aforementioned fields, employing a data-driven approach to learn the physical properties of SAR, as described in Section 3.3.2.

Polarization information inversion is a typical task that seeks to infer the entire polarization features of the target from partially polarized SAR data. Refs. [81,82] proposed to use DNNs to learn complete polarization information from single-polarized SAR images. Song et al.^[81] proposed to employ CNN to extract texture features of single-channel SAR amplitude images and then map them into the polarization feature space through a feature conversion network to retrieve the essential elements in the polarization covariance matrix. A similar study is Ref. [83]. Zhao et al.^[82] proposed a complex-valued CNN to learn volume and single scattering transfer to analyze the physical scattering characteristics of single/dual-polarized SAR. All the aforementioned DL methods employ polarization physical models, such as the polarization covariance matrix^[81,83] and the Cloude-Pottier decomposition^[82], to generate the ground truth for supervision, thereby allowing the target polarization scattering characteristics of single-channel SAR images to be described.

SAR image colorization is one of the main applications in this direction. Notably, a major research branch of SAR image colorization is mainly analogous to image colorization for grayscale pictures^[84] or style transfer^[85] in the field of computer vision. It focuses primarily on how to assign colors to a single-channel grayscale SAR image or convert it to RGB optical remote sensing image format to aid visual interpretation by humans^[86]. However, the physical consistency of SAR is not guaranteed. Refs. [81-83] focuses more on the acquisition of polarization information through DL, where SAR image colorization is achieved via Pauli decomposition such that the colored results have physical meaning.

The sub-aperture decomposition of SAR image along azimuth has been widely applied in research on moving target detection and coherent target detection^[87,88]. Spigai et al.^[89] proposed a two-dimensional TFA for complex-valued SAR images to model four canonical targets. Ferro-Famil et al.^[90] modeled non-stationary targets by examining the azimuth-dependent polarization properties. Such empirical models were unable to adequately generalize all complicated SAR targets in the wide-area scenario. To address this issue, Huang et al.^[91] proposed an unsupervised learning method to automatically mine the target backscattering variation patterns and extend them to polarized SAR data^[92] (as shown in Fig. 5). In this way, the empirical knowledge in Refs. [89,90] was verified and improved through a data-driven approach.

Figure 5. The unsupervised learning results of different polarized SAR images based on TFA and pol-extended TFA models^[92]

DownLoad: Full-Size Img PowerPoint

The physical characteristics learned by DNN should fit with the physical nature of SAR and be physically interpretable. According to Song et al.^[81], the covariance matrix of the prediction from single-polarization SAR images should meet the semi-positive definite constraint. The target scattering types learned with unsupervised learning^[91] should cover the four typical targets proposed in Ref. [89]. De et al.^[93] attempted to interpret the results of DNNs and relate their output to the physical properties of SAR.

5. PXDL in SAR Semantic Understanding and Application

Traditional DL algorithms started earlier in semantic understanding and SAR applications than the process of understanding SAR signals and characteristics. In the past few years, they have established a solid research foundation in automatic target recognition, scene classification, and change detection. Recently, an increasing number of researchers have been focusing on how to integrate the benefits of physical models with DL techniques.

5.1 Physics-guided and injected learning

This section introduces a novel DL paradigm for SAR image semantic understanding tasks based on the method proposed in our recent work^[94].

We denote the complex-valued SAR image as x and the semantic label as y. The traditional data-driven learning paradigm constructed the end-to-end DNN mapping f, which takes x (or mostly the amplitude information ${x}_{I}$ ) as the input and outputs the semantic label y, denoted as $f:x\to y$ . The input data determined that the neural network parameters should be in real or complex form. Training mapping f requires a large number of labeled samples $(x,y)$ . When dealing with limited annotation, a common practice is to employ transfer learning and other optimization techniques, which will not be elaborated upon here. This paper will investigate the use of physics knowledge to lessen the dependence of DNNs on labeled data. The Physics-Guided and Injected Learning (PGIL) introduced here relies on the concept presented in Sections 3.4 and 3.5, as shown in Fig. 6.

Figure 6. Physics guided and injected learning

DownLoad: Full-Size Img PowerPoint

The physical model ${f}_{\mathrm{p}\mathrm{h}\mathrm{y}}$ is assumed to take SAR image data x as input, and the output is written as ${y}_{\mathrm{p}\mathrm{h}\mathrm{y}}$ . For the SAR image, ${y}_{\mathrm{p}\mathrm{h}\mathrm{y}}$ can represent attribute scattering center^[95], polarization scattering characteristic, and sub-aperture decomposition result^[11]. The PGNN proposed in Ref. [32] combines ${y}_{\mathrm{p}\mathrm{h}\mathrm{y}}$ and observations to form hybrid physical data as network input for prediction, that is, learning mapping.

$f:\{x,{y}_{\mathrm{p}\mathrm{h}\mathrm{y}}\}\to y$

(1)

This method can be simply summarized as multimodal fusion learning. Previous studies realized fusion at the data^[96], feature^[95], and decision levels^[97] in SAR image classification or target recognition.

The presented PGIL paradigm differs slightly from the fusion strategy described previously. It consists of two phases: unsupervised Physics-Guided Learning (PGL) and supervised Physics-Injected Learning (PIL)

${f}_{\mathrm{P}\mathrm{G}\mathrm{L}}:\{{x}_{I},{y}_{\mathrm{p}\mathrm{h}\mathrm{y}}\}\to {F}_{\mathrm{P}\mathrm{A}}$

(2)

${f}_{\mathrm{P}\mathrm{I}\mathrm{L}}:\{x,{F}_{\mathrm{P}\mathrm{A}}\}\to y$

(3)

PGL employs the knowledge provided by the physical model to drive DNN training without annotations to acquire the semantically discriminating feature representation ${F}_{\mathrm{P}\mathrm{A}}$ with physical perceptive capacity. Unlike the process where ${y}_{\mathrm{p}\mathrm{h}\mathrm{y}}$ is used directly as ready-made fusion information, ${F}_{\mathrm{P}\mathrm{A}}$ obtained by PGL guided with ${y}_{\mathrm{p}\mathrm{h}\mathrm{y}}$ is in the form of a feature map, which is more adaptive and closer to the high-level semantics of SAR images and can assist target tasks efficiently. The robust generalizability of ${F}_{\mathrm{P}\mathrm{A}}$ is ensured by the unsupervised training mode’s capacity to make full use of large-scale training samples. In PIL, ${F}_{\mathrm{P}\mathrm{A}}$ is injected into the traditional data-driven network by designing a feature transformation layer, and supervised learning is performed with a few label samples. The physical knowledge in PGL can serve as the constraint term of the objective function via the integrated network to restrict network training. Multimodal fusion learning described in Eq. (1) can be considered a special case of Eq. (3). Readers can also refer to the relevant work of multimodal feature fusion^[98,99] to design the PIL feature injection method.

5.2 SAR image classification

We have conducted significant research on the SAR image classification problem based on the aforementioned PGIL learning paradigm, which will be reviewed in this section. Deep SAR-Net (DSN)^[11] extends the sub-aperture decomposition of complex SAR images to the continuous two-dimensional frequency domain space to obtain a high-dimensional time-frequency “hyper-image.” The spatial-frequency domain feature fusion was proposed for SAR image classification, as shown in Fig. 7. Unlike the Complex Convolution Neural Network (CV-CNN), which directly acts on the complex image to learn the mapping from the complex domain to semantic labels, the image decomposition based on the TFA theory is equivalent to replacing a part of the neural network layer of CV-CNN to obtain the interpretable feature expression, as described in Section 3.3.1. NN-2 in Fig. 7 was initialized by unsupervised pre-training to obtain the spatial-constrained frequency features, that is, to achieve Eq. (2). Eq. (3) is realized by subsequent feature fusion and label prediction. The experimental results verified that DSN is superior to traditional CNN, especially when only a few annotated samples are available. The overall accuracy of DSN can be improved by 8.58%. For man-made targets, it improved by 14.06%. The performance is also greatly improved compared with that of the data-driven CV-CNN under the condition of small samples.

Figure 7. The SAR image classification framework Deep SAR-Net (DSN)^[11]

DownLoad: Full-Size Img PowerPoint

Huang et al.^[92,100] recently proposed an unsupervised PGL method for SAR scene image classification and polar sea ice type recognition to learn the discriminant semantic feature representation with physical perception ability. This method aims to improve the transparency of the model and enhance the understanding of the physical meaning of SAR images by DNNs. Physical models used for guidance include the entropy-based H/α-Wishart algorithm^[101], Kennaugh matrix, geodesic distance-based GD-Wishart algorithm^[102], the continuous two-dimensional subband decomposition of single-pol complex SAR image, and the extended version on polarized data^[89,92]. The objective function is proposed according to a basic assumption that a correlation exists between SAR physical scattering characteristics and SAR image semantics, as shown in Fig. 4, where the scattering class is highly related to semantics. On the basis of the semantic relationship, a differentiable objective function is designed to guide the neural network to learn features that possess physical perception ability and contain high-level semantic information, completing the process described in Eq. (2). The unsupervised training strategy can make full use of the unlabeled SAR image samples to ensure the generalization performance of the features on the test set. Ref. [94] quantifies the physical perception ability of features, demonstrating that the features learned by PGL have physical constraints that conventional CNN features do not. Fig. 8 shows the feature distribution of the unsupervised physical guided learning method on the training set and the test set^[100] compared with features of CNN supervised learning. The features obtained from the PGL can ensure better semantic discrimination on the test dataset.

Figure 8. The feature visualization of the unsupervised physics guided learning and supervised CNN classification on training and test set^[100]

DownLoad: Full-Size Img PowerPoint

Ref. [92] proposed to combine the physical guided network with the small sample learning algorithm for classification in the decision stage, as mentioned in Section 3.5. Ref. [94] designed a multi-scale feature transformation operator to realize Eq. (3), and constraints of physical guided learning were added to the decision-learning process to ensure the physical consistency of classification semantic features. In addition, the authors explained ${y}_{\mathrm{p}\mathrm{h}\mathrm{y}}$ as the guide signal that drives PGL training^[94], evaluated the importance of the physical model in the algorithm, and determined how the final prediction result can be constrained to be physically consistent. The existing defects of the algorithm and the direction for future improvement were discussed by interpreting ${y}_{\mathrm{p}\mathrm{h}\mathrm{y}}$ .

5.3 SAR Automatic Target Recognition (SAR-ATR)

SAR Automatic Target Recognition (SAR-ATR) has always received great attention. The classical definition of SAR-ATR includes three steps: target detection, discrimination, and recognition. On the basis of end-to-end DNNs, SAR-ATR is generally divided into two parts: target detection and target recognition.

The Attribute Scattering Center (ASC) of SAR targets is widely used in traditional SAR-ATR. On the basis of geometric diffraction theory and physical optics theory, ASC employs a set of parameters to characterize the electromagnetic and geometric characteristics of the target, which can accurately depict the physical properties of the SAR target^[48]. Some recent advanced research integrated the ASC model with DL for SAR target recognition.

Some research follows Eq. (1) for feature fusion. For example, Zhang et al.^[95] proposed a two-way FEC learning framework, which transformed a parameterized representation of attribute scattering center into bag-of-words features fused with a CNN feature map. Li et al.^[103] transformed ASC into several component feature maps with definite physical significance and fused them with the global features of CNN to efficiently capture the local electromagnetic characteristics of the target. Ref. [104], similar to Ref. [103], conducted partial convolution learning for ASC components with a bidirectional recurrent neural network. Liu et al.^[105] regarded the amplitude and phase of SAR target images as multiple modalities and proposed multimodal manifold feature learning and fusion to achieve target recognition. Under the conditions of limited samples, the performance was enhanced, but the implications of amplitude and phase on the recognition process have yet to be clarified. Refs. [95,103,104] used the geometric information of the scattering center provided by the ASC model to strengthen the DL model’s understanding of SAR targets, which is more helpful in improving the interpretability of the neural network.

As indicated in Section 3.4, another line of research is the heuristic learning of physical models. The PGNN design embedded the physical principles into neural networks that make it physically interpretable. Liu et al.^[106] proposed to transfer the domain knowledge of the ASC model to the first layer’s convolution kernels of CV-CNN, which endowed the initialization of neurons with physical significance. Fig. 9 visualizes the amplitude information of the first layer’s complex convolution kernels based on the ASC model^[106]. The horizontal axis denotes the azimuth angles ranging from 0° to 90° with 10° intervals, and the vertical represents different lengths of the scattering center. Compared with randomly initialized neurons, this method not only speeds up the network optimization significantly but also provides deep network interpretability to obtain the hidden layer feature representation with physical meaning. Similar work, such as the polarimetric rotation kernel proposed by Cui et al.^[107], adaptively learns polarimetric rotation angle in CNNs. With regard to how physical knowledge is used in neural network design, readers can consult relevant studies from other disciplines^[108,109].

Figure 9. The amplitude images of convolution kernels in the first layer of CV-CNN based on ASC model initialization^[106]

DownLoad: Full-Size Img PowerPoint

Common research concepts for SAR target recognition also include transfer learning and domain adaptation. Some past studies proposed transfer learning and domain adaptation methods from SAR scene images^[9], natural images, and optical remote sensing images^[8,10] to SAR target recognition. In 2017, Malmgreen-Hansen et al.^[39] proposed for the first time to use simulated SAR targets as source data to learn real SAR target classes. However, SAR targets are sensitive to imaging parameters, such as azimuth angle and wavelength, which will greatly affect the morphological structure in SAR images. To guarantee that the pre-trained model can properly recognize SAR objects under real imaging settings, physical perception must be provided. Recently, He et al.^[110] applied domain adaptation to narrow the difference of high-level features among the simulated SAR targets under different imaging conditions to ensure that the pre-training model has the ability to identify various SAR targets. In Ref. [111], the simulated SAR targets are used as the source data in few-shot learning for SAR target recognition. Data augmentation is conducted with SAR domain knowledge related to the azimuth, amplitude, and phase data of vehicles. Similarly, Agarwal et al.^[112] proposed to augment SAR data by azimuth interpolation based on a physical model to support the training of DL algorithms.

Common public datasets for target recognition, such as the Moving and Stationary Target Acquisition and Recognition (MSTAR)^[113] and OpenSARShip^[114], provide original complex data, which are helpful in conducting research on PXDL and integrating physical models such as attribute scattering centers into data-driven methods. However, the public datasets oriented to SAR target detection basically provide only amplitude images, such as AIR-SARShip^[115]. DL-based target detection mostly comes from computer vision and is performed in the image domain (for details, please refer to the review article^[116]). This issue is another significant obstacle that restricts the development of DL methods for physically explainable object detection. Lei et al.^[96] proposed a features-enhanced DL method based on a complex SAR ship target detection dataset with a rotated bounding box, which combines the scattering results of sub-aperture decomposition with amplitude information into the input network for learning. There is still a great deal of room for further development for advanced SAR target detection datasets with more comprehensive information or the PXDL detection methods.

5.4 SAR image semantic segmentation

SAR image semantic segmentation aims to assign a semantic label to each pixel in the SAR image. Because of the complex background, speckle interference, discontinuity of target morphology, and other phenomena, producing satisfactory results in SAR images using a DL segmentation method in computer vision is challenging. In the early stages, pixel-level SAR image classification was often performed on Polarimetric SAR (PolSAR) images because the polarimetric feature assisted in distinguishing between target classes with differing scattering characteristics. Later DL algorithms often took the polarimetric feature component as the input to improve the understanding of target scattering characteristics^[117-120]. The aforementioned studies can be classified as the concept described in Section 3.3.1.

Building segmentation is an important branch of SAR image semantic segmentation and is widely used in the fields of global urbanization monitoring and three-dimensional reconstruction of buildings. We mainly discuss the research progress in this area. Recently, multiple Building Area (BA) segmentation or building segmentation data sets have been proposed^[121-125]. BA segmentation aims to annotate large urban BAs based on large-area medium-resolution SAR images, for instance, the GF-3 FSII SAR data used in Ref. [122]. In contrast, building segmentation mainly focuses on building instances in higher-resolution SAR images^{[121,123-125]}. Because of the difficulties of visual interpretation of SAR images, most building segmentation datasets are annotated based on optical images or auxiliary data such as street view (e.g.,., Open Street Map, OSM). Fig. 10(a) demonstrates two annotation cases in the building segmentation dataset proposed by Xia et al.^[121] and the prediction results of the DL model. Notably, skyscrapers can be seen clearly in optical images, but they cannot be recognized in SAR images due to complex scatterings. The ground truth provided by OSM reflects the building footprint projected on the ground. Thus, the DL model has difficulty learning the semantic mapping directly from the SAR image to the ground truth. In another case, the optical image showed that no building is present in the upper-right corner. Because of the geometric feature of layover in the SAR image, however, DNN can identify the backscattering of buildings even though the real building is not in the top-right corner. Strictly speaking, what is displayed in the SAR images is not the building itself but the backscattering of electromagnetic waves acting on the target and surrounding environment. The DL model needs physical knowledge to bridge the gap from the backscattering representation to the target semantics.

Figure 10. The different SAR image building segmentation datasets and algorithms^[121,123]

DownLoad: Full-Size Img PowerPoint

The InSAR image building segmentation dataset proposed by Chenet al.^[123] provides annotations of scattering characteristics of buildings, including either four categories of shadow, layover, single scattering, and secondary scattering, or two categories of layover and shadow, as shown in Fig. 10(b). CVCMFF-Net, a complex convolutional image segmentation network proposed in Ref. [123], uses master and slave images of InSAR as input to establish a mapping from complex SAR images to the basic scattering characteristics of buildings. The segmentation results have explainable physical meaning, which can help in the analysis of the semantic information, location, height, and other information of buildings. Qiu et al.^[125] recently proposed an SAR microwave vision 3D imaging dataset based on single-view complex image data, which performed detailed semantic annotation for building instances and retained the layover number information. However, the paper also shows that the semantic segmentation model (such as MaskRCNN) based only on visual information has low accuracy^[125], and building segmentation based on this dataset is difficult. More research on PXDL techniques based on this dataset is anticipated to be conducted in the foreseeable future.

6. Future Outlooks

In general, the PXDL for SAR image interpretation is still in its infancy. Most advanced research focuses on the SAR automatic target recognition field due to the early start of DL development in this direction and the support of MSTAR and other public datasets that contain complex data, imaging parameters, and other multidimensional information. In view of some problems and challenges that remain in the current field, future research can be conducted in the following directions:

(1) SAR image interpretation dataset

To promote the further development of PXDL methods in this field that deeply combines with the physical characteristics of SAR, constructing SAR image interpretation datasets with richer information beyond amplitude images is of great significance. Such datasets include SAR target detection and recognition datasets with complex data and imaging information, SAR image classification or segmentation datasets with physical scattering characteristics, and SAR echo datasets.

(2) Physical constraint in data-driven learning

Learning, simulating, or replacing the physical processes via DNNs without considering the physical knowledge of SAR in applications such as SAR target simulation and super-resolution reconstruction is inappropriate. In the future, physical constraints in data-driven learning should be strengthened. The network training should be constrained by punishing results that violate physical laws so that physically consistent prediction results can be obtained. DNN can also be used to model the errors of the physical model to improve the existing physics and theory^[29].

(3) Physics-guided learning for SAR

At present, many studies have embedded physics knowledge into neural network models from the perspective of feature fusion, thus realizing the initial attempt at hybrid modeling. To break through the application bottleneck of a small number of labeled samples, a PGNN that promotes unsupervised learning needs to be developed. PGL aims to make full use of a physical model and domain knowledge, as well as a large amount of unlabeled SAR image data, where the objective function or model structure design can be motivated by physical laws. It can automatically mine feature representation with strong generalization ability and physical perception ability. Furthermore, PGL should be combined with optimization methods, such as few-shot learning, zero-shot learning, and meta-learning, to form an SAR-specific small sample DL system.

(4) Interdisciplinary with XAI

Not only should a DL-based SAR image interpretation method achieve faster speed, higher accuracy, and stronger performance across a variety of tasks, but it must also meet practical application requirements such as a more transparent algorithm, more trustworthy results, and greater stability against disturbance. Most of the current studies discussed in this paper concentrate on how to integrate DL with the SAR physics model, mainly focusing on the improvement of model performance after the incorporation of physics knowledge, with few discussions on interpretability. The information and priors provided by the theoretical physical model are anticipated to establish a balance between the algorithm’s transparency and degree of intelligence, thus enabling human–computer interaction. A crucial step is to conduct interdisciplinary research that combines XAI-related theories and technology to advance the PXDL approaches for SAR interpretation.

(5) Combined with uncertainty quantification

Some practical application scenarios of SAR interpretation require reliable prediction results. Although DL methods have achieved high accuracy in some SAR image interpretation tasks, users still cannot trust the predictions of deep models. For example, the physical characteristics of SAR targets are sensitive to imaging parameters, and small perturbations may lead to dramatic changes in the results. In the absence of training samples, users need to perceive the uncertainty of the results predicted by the model, thus giving them confidence as a reference or discarding overly suspicious results. Combining uncertainty quantification with PXDL is conducive to reducing the uncertainty of the predicted results caused by data-driven learning bias by taking advantage of the physical model’s robustness.

7. Conclusions

In the past few years, SAR image interpretation technology based on DL has constantly updated the evaluation metrics for a variety of tasks and produced remarkable results in comparison to model-based methods. The era of post-DL that combines knowledge-driven and data-driven approaches has arrived. The interpretable SAR physical mechanism and the learnable DNN have extensive development opportunities and are complementary. This paper provides an overview of the fundamental concepts of physics-based machine learning and summarizes the challenges and feasibility of developing PXDL methods in the SAR image interpretation field. We review recent cutting-edge research that combines DL and a physical model in the understanding of SAR signal and physical characteristics, as well as semantic understanding and applications, and look forward to future development. The research in this field is not yet mature, and it is anticipated that more professionals and academics from diverse domains will participate, learn from one another, and conduct more in-depth studies on PXDL algorithms for SAR in the future.

References(125)

References

[1]	CUMMING I G, WONG F H, 洪文, 胡东辉, 韩冰, 等译. 合成孔径雷达成像算法与实现[M]. 北京: 电子工业出版社, 2019, 93–100. CUMMING I G, WONG F H, HONG Wen, HU Donghui, HAN Bing, et al. translation. Digital Processing of Synthetic Aperture Radar Data Algorithms and Implementation[M]. Beijing: Publishing House of Electronics Industry, 2019, 93–100.
[2]	黄钟泠. 面向合成孔径雷达图像分类的深度学习方法研究[D]. [博士论文], 中国科学院大学, 2020: 59. HUANG Zhongling. A study on synthetic aperture radar image classification with deep learning[D]. [Ph. D. dissertation], University of Chinese Academy of Sciences, 2020: 59.
[3]	谷秀昌, 付琨, 仇晓兰. SAR图像判读解译基础[M]. 北京: 科学出版社, 2017. GU Xiuchang, FU Kun, and QIU Xiaolan. Fundamentals of SAR Image of SAR Image Interpretation[M]. Beijing: Science Press, 2017.
[4]	OLIVER C and QUEGAN S. Understanding Synthetic Aperture Radar Images[M]. London: SciTech Publishing, 2004.
[5]	GAO Gui, OUYANG Kewei, LUO Yongbo, et al. Scheme of parameter estimation for generalized gamma distribution and its application to ship detection in SAR images[J]. IEEE Transactions on Geoscience and Remote Sensing, 2017, 55(3): 1812–1832. doi: 10.1109/TGRS.2016.2634862
[6]	LENG Xiangguang, JI Kefeng, ZHOU Shilin, et al. Ship detection based on complex signal kurtosis in single-channel SAR imagery[J]. IEEE Transactions on Geoscience and Remote Sensing, 2019, 57(9): 6447–6461. doi: 10.1109/TGRS.2019.2906054
[7]	CHEN Sizhe, WANG Haipeng, XU Feng, et al. Target classification using the deep convolutional networks for SAR images[J]. IEEE Transactions on Geoscience and Remote Sensing, 2016, 54(8): 4806–4817. doi: 10.1109/TGRS.2016.2551720
[8]	HUANG Zhongling, DUMITRU C O, PAN Zongxu, et al. Classification of large-scale high-resolution SAR images with deep transfer learning[J]. IEEE Geoscience and Remote Sensing Letters, 2021, 18(1): 107–111. doi: 10.1109/LGRS.2020.2965558
[9]	HUANG Zhongling, PAN Zongxu, and LEI Bin. Transfer learning with deep convolutional neural network for SAR target classification with limited labeled data[J]. Remote Sensing, 2017, 9(9): 907. doi: 10.3390/rs9090907
[10]	HUANG Zhongling, PAN Zongxu, and LEI Bin. What, where, and how to transfer in SAR target recognition based on deep CNNs[J]. IEEE Transactions on Geoscience and Remote Sensing, 2020, 58(4): 2324–2336. doi: 10.1109/TGRS.2019.2947634
[11]	HUANG Zhongling, DATCU M, PAN Zongxu, et al. Deep SAR-Net: Learning objects from signals[J]. ISPRS Journal of Photogrammetry and Remote Sensing, 2020, 161: 179–193. doi: 10.1016/j.isprsjprs.2020.01.016
[12]	金亚秋. 多模式遥感智能信息与目标识别: 微波视觉的物理智能[J]. 雷达学报, 2019, 8(6): 710–716. doi: 10.12000/JR19083 JIN Yaqiu. Multimode remote sensing intelligent information and target recognition: Physical intelligence of microwave vision[J]. Journal of Radars, 2019, 8(6): 710–716. doi: 10.12000/JR19083
[13]	张钹, 朱军, 苏航. 迈向第三代人工智能[J]. 中国科学:信息科学, 2020, 50(9): 1281–1302. doi: 10.1360/SSI-2020-0204 ZHANG Bo, ZHU Jun, and SU Hang. Toward the third generation of artificial intelligence[J]. SCIENTIA SINICA Informationis, 2020, 50(9): 1281–1302. doi: 10.1360/SSI-2020-0204
[14]	DAS A and RAD P. Opportunities and challenges in explainable artificial intelligence (XAI): A survey[OL]. arXiv: 2006.11371, 2020.
[15]	BAI Xiao, WANG Xiang, LIU Xianglong, et al. Explainable deep learning for efficient and robust pattern recognition: A survey of recent developments[J]. Pattern Recognition, 2021, 120: 108102. doi: 10.1016/j.patcog.2021.108102
[16]	ANGELOV P and SOARES E. Towards explainable deep neural networks (xDNN)[J]. Neural Networks, 2020, 130: 185–194. doi: 10.1016/j.neunet.2020.07.010
[17]	MOLNAR C. Interpretable machine learning: A guide for making black box models explainable[EB/OL]. https://christophm.github.io/interpretable-ml-book/, 2021.
[18]	CAMBURU O M. Explaining deep neural networks[D]. [Ph. D. dissertation], Oxford University, 2020.
[19]	李玮杰, 杨威, 刘永祥, 等. 雷达图像深度学习模型的可解释性研究与探索[J]. 中国科学: 信息科学, 待出版. doi: 10.1360/SSI-2021-0102. LI Weijie, YANG Wei, LIU Yongxiang, et al. Research and exploration on interpretability of deep learning model in radar image[J]. SCIENTIA SINICA Informationis, in press. doi: 10.1360/SSI-2021-0102.
[20]	BELLONI C, BALLERI A, AOUF N, et al. Explainability of deep SAR ATR through feature analysis[J]. IEEE Transactions on Aerospace and Electronic Systems, 2021, 57(1): 659–673. doi: 10.1109/TAES.2020.3031435
[21]	郭炜炜, 张增辉, 郁文贤, 等. SAR图像目标识别的可解释性问题探讨[J]. 雷达学报, 2020, 9(3): 462–476. doi: 10.12000/JR20059 GUO Weiwei, ZHANG Zenghui, YU Wenxian, et al. Perspective on explainable SAR target recognition[J]. Journal of Radars, 2020, 9(3): 462–476. doi: 10.12000/JR20059
[22]	KARNIADAKIS G E, KEVREKIDIS I G, LU Lu, et al. Physics-informed machine learning[J]. Nature Reviews Physics, 2021, 3(6): 422–440. doi: 10.1038/s42254-021-00314-5
[23]	THUEREY N, HOLL P, MUELLER M, et al. Physics-based deep learning[OL]. arXiv: 2109.05237, 2021.
[24]	RAISSI M, PERDIKARIS P, and KARNIADAKIS G E. Physics-informed neural networks: A deep learning framework for solving forward and inverse problems involving nonlinear partial differential equations[J]. Journal of Computational Physics, 2019, 378: 686–707. doi: 10.1016/j.jcp.2018.10.045
[25]	MENG Xuhui, LI Zhen, ZHANG Dongkun, et al. PPINN: Parareal physics-informed neural network for time-dependent PDEs[J]. Computer Methods in Applied Mechanics and Engineering, 2020, 370: 113250. doi: 10.1016/j.cma.2020.113250
[26]	GOSWAMI S, ANITESCU C, CHAKRABORTY S, et al. Transfer learning enhanced physics informed neural network for phase-field modeling of fracture[J]. Theoretical and Applied Fracture Mechanics, 2020, 106: 102447. doi: 10.1016/j.tafmec.2019.102447
[27]	KARPATNE A, EBERT-UPHOFF I, RAVELA S, et al. Machine learning for the geosciences: Challenges and opportunities[J]. IEEE Transactions on Knowledge and Data Engineering, 2019, 31(8): 1544–1554. doi: 10.1109/TKDE.2018.2861006
[28]	CAMPS-VALLS G, REICHSTEIN M, ZHU Xiaoxiang, et al. Advancing deep learning for earth sciences: From hybrid modeling to interpretability[C]. IGARSS 2020-2020 IEEE International Geoscience and Remote Sensing Symposium, Waikoloa, USA, 2020: 3979–3982. doi: 10.1109/IGARSS39084.2020.9323558.
[29]	REICHSTEIN M, CAMPS-VALLS G, STEVENS B, et al. Deep learning and process understanding for data-driven Earth system science[J]. Nature, 2019, 566(7743): 195–204. doi: 10.1038/s41586-019-0912-1
[30]	CAMPS-VALLS G, SVENDSEN D H, CORTÉS-ANDRÉS J, et al. Physics-aware machine learning for geosciences and remote sensing[C]. IEEE International Geoscience and Remote Sensing Symposium, Brussels, Belgium, 2021: 2086–2089. doi: 10.1109/IGARSS47720.2021.9554521.
[31]	JIA Xiaowei, WILLARD J, KARPATNE A, et al. Physics guided RNNs for modeling dynamical systems: A case study in simulating lake temperature profiles[C]. The 2019 SIAM International Conference on Data Mining, Calgary, Canada, 2019: 558–566. doi: 10.1137/1.9781611975673.63.
[32]	DAW A, KARPATNE A, WATKINS W, et al. Physics-guided neural networks (PGNN): An application in lake temperature modeling[OL]. arXiv: 1710.11431, 2021. doi: https://arxiv.org/abs/1710.11431.
[33]	BEUCLER T, PRITCHARD M, GENTINE P, et al. Towards physically-consistent, data-driven models of convection[C]. IGARSS 2020-2020 IEEE International Geoscience and Remote Sensing Symposium, Waikoloa, USA, 2020: 3987–3990. doi: 10.1109/IGARSS39084.2020.9324569.
[34]	SHEN Huanfeng, JIANG Menghui, LI Jie, et al. Coupling model-driven and data-driven methods for remote sensing image restoration and fusion[OL]. arXiv: 2108.06073, 2021.
[35]	WANG Yuqing, WANG Qi, LU Wenkai, et al. Physics-constrained seismic impedance inversion based on deep learning[J]. IEEE Geoscience and Remote Sensing Letters, 2021: 1–5. doi: 10.1109/LGRS.2021.3072132
[36]	XIA Wenchao, ZHENG Gan, WONG K K, et al. Model-driven beamforming neural networks[J]. IEEE Wireless Communications, 2020, 27(1): 68–75. doi: 10.1109/MWC.001.1900239
[37]	ZHANG Juping, XIA Wenchao, YOU Minglei, et al. Deep learning enabled optimization of downlink beamforming under per-antenna power constraints: Algorithms and experimental demonstration[J]. IEEE Transactions on Wireless Communications, 2020, 19(6): 3738–3752. doi: 10.1109/TWC.2020.2977340
[38]	ZHU Xiaoxiang, MONTAZERI S, ALI M, et al. Deep learning meets SAR: Concepts, models, pitfalls, and perspectives[J]. IEEE Geoscience and Remote Sensing Magazine, in press. doi: 10.1109/MGRS.2020.3046356.
[39]	MALMGREN-HANSEN D, KUSK A, DALL J, et al. Improving SAR automatic target recognition models with transfer learning from simulated data[J]. IEEE Geoscience and Remote Sensing Letters, 2017, 14(9): 1484–1488. doi: 10.1109/LGRS.2017.2717486
[40]	文贡坚, 朱国强, 殷红成, 等. 基于三维电磁散射参数化模型的SAR目标识别方法[J]. 雷达学报, 2017, 6(2): 115–135. doi: 10.12000/JR17034 WEN Gongjian, ZHU Guoqiang, YIN Hongcheng, et al. SAR ATR based on 3D parametric electromagnetic scattering model[J]. Journal of Radars, 2017, 6(2): 115–135. doi: 10.12000/JR17034
[41]	罗迎, 倪嘉成, 张群. 基于“数据驱动+智能学习”的合成孔径雷达学习成像[J]. 雷达学报, 2020, 9(1): 107–122. doi: 10.12000/JR19103 LUO Ying, NI Jiacheng, and ZHANG Qun. Synthetic aperture radar learning-imaging method based on data-driven technique and artificial intelligence[J]. Journal of Radars, 2020, 9(1): 107–122. doi: 10.12000/JR19103
[42]	CHAN T H, JIA Kui, GAO Shenghua, et al. PCANet: A simple deep learning baseline for image classification?[J]. IEEE Transactions on Image Processing, 2015, 24(12): 5017–5032. doi: 10.1109/TIP.2015.2475625
[43]	LI Mengke, LI Ming, ZHANG Peng, et al. SAR image change detection using PCANet guided by saliency detection[J]. IEEE Geoscience and Remote Sensing Letters, 2019, 16(3): 402–406. doi: 10.1109/LGRS.2018.2876616
[44]	WANG Rongfang, ZHANG Jie, CHEN Jiawei, et al. Imbalanced learning-based automatic SAR images change detection by morphologically supervised PCA-net[J]. IEEE Geoscience and Remote Sensing Letters, 2019, 16(4): 554–558. doi: 10.1109/LGRS.2018.2878420
[45]	CLOUDE S and POTTIER E. An entropy based classification scheme for land applications of polarimetric SAR[J]. IEEE Transactions on Geoscience and Remote Sensing, 1997, 35(1): 68–78. doi: 10.1109/36.551935
[46]	YAMAGUCHI Y, YAJIMA Y, and YAMADA H. A four-component decomposition of POLSAR images based on the coherency matrix[J]. IEEE Geoscience and Remote Sensing Letters, 2006, 3(3): 292–296. doi: 10.1109/LGRS.2006.869986
[47]	FERRO-FAMIL L, REIGBER A, and POTTIER E. Scene characterization using sub-aperture polarimetric interferometric SAR data[C]. IGARSS 2003-2003 IEEE International Geoscience and Remote Sensing Symposium, Toulouse, France, 2003: 702–704. doi: 10.1109/IGARSS.2003.1293889.
[48]	POTTER L C and MOSES R L. Attributed scattering centers for SAR ATR[J]. IEEE Transactions on Image Processing, 1997, 6(1): 79–91. doi: 10.1109/83.552098
[49]	JI Kefeng and WU Yonghui. Scattering mechanism extraction by a modified cloude-pottier decomposition for dual polarization SAR[J]. Remote Sensing, 2015, 7(6): 7447–7470. doi: 10.3390/rs70607447
[50]	YONEZAWA C, WATANABE M, and SAITO G. Polarimetric decomposition analysis of ALOS PALSAR observation data before and after a landslide event[J]. Remote Sensing, 2012, 4(8): 2314–2328. doi: 10.3390/rs4082314
[51]	NIU Shengren, QIU Xiaolan, LEI Bin, et al. Parameter extraction based on deep neural network for SAR target simulation[J]. IEEE Transactions on Geoscience and Remote Sensing, 2020, 58(7): 4901–4914. doi: 10.1109/TGRS.2020.2968493
[52]	NIU Shengren, QIU Xiaolan, LEI Bin, et al. A SAR target image simulation method with DNN embedded to calculate electromagnetic reflection[J]. IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing, 2021, 14: 2593–2610. doi: 10.1109/JSTARS.2021.3056920
[53]	GUO Jiayi, LEI Bin, DING Chibiao, et al. Synthetic aperture radar image synthesis by using generative adversarial nets[J]. IEEE Geoscience and Remote Sensing Letters, 2017, 14(7): 1111–1115. doi: 10.1109/LGRS.2017.2699196
[54]	OH J and KIM M. PeaceGAN: A GAN-based multi-task learning method for SAR target image generation with a pose estimator and an auxiliary classifier[J]. Remote Sensing, 2021, 13(19): 3939. doi: 10.3390/rs13193939
[55]	CUI Zongyong, ZHANG Mingrui, CAO Zongjie, et al. Image data augmentation for SAR sensor via generative adversarial nets[J]. IEEE Access, 2019, 7: 42255–42268. doi: 10.1109/ACCESS.2019.2907728
[56]	SONG Qian, XU Feng, and JIN Yaqiu. SAR image representation learning with adversarial autoencoder networks[C]. IGARSS 2019-2019 IEEE International Geoscience and Remote Sensing Symposium, Yokohama, Japan, 2019: 9498–9501. doi: 10.1109/IGARSS.2019.8898922.
[57]	WANG Ke, ZHANG Gong, LENG Yang, et al. Synthetic aperture radar image generation with deep generative models[J]. IEEE Geoscience and Remote Sensing Letters, 2019, 16(6): 912–916. doi: 10.1109/LGRS.2018.2884898
[58]	HU Xiaowei, FENG Weike, GUO Yiduo, et al. Feature learning for SAR target recognition with unknown classes by using CVAE-GAN[J]. Remote Sensing, 2021, 13(18): 3554. doi: 10.3390/rs13183554
[59]	XIE You, FRANZ E, CHU Mengyu, et al. TempoGAN: A temporally coherent, volumetric GAN for super-resolution fluid flow[J]. ACM Transactions on Graphics, 2018, 37(4): 95.
[60]	CHU Mengyu, THUEREY N, SEIDEL H P, et al. Learning meaningful controls for fluids[J]. ACM Transactions on Graphics, 2021, 40(4): 100. doi: 10.1145/3450626.3459845
[61]	QIAN Jiang, HUANG Shaoyin, WANG Lu, et al. Super-resolution ISAR imaging for maneuvering target based on deep-learning-assisted time-frequency analysis[J]. IEEE Transactions on Geoscience and Remote Sensing, 2021, 60: 5201514. doi: 10.1109/TGRS.2021.3050189
[62]	LIANG Jiadian, WEI Shunjun, WANG Mou, et al. ISAR compressive sensing imaging using convolution neural network with interpretable optimization[C]. IGARSS 2020-2020 IEEE International Geoscience and Remote Sensing Symposium, Waikoloa, USA, 2020: 2483–2486. doi: 10.1109/IGARSS39084.2020.9323601.
[63]	GREGOR K and LECUN Y. Learning fast approximations of sparse coding[C]. 27th International Conference on Machine Learning, Haifa, Israel, 2010: 399–406.
[64]	LIU Jialin, CHEN Xiaohan, WANG Zhangyang, et al. ALISTA: Analytic weights are as good as learned weights in LISTA[C]. The 7th International Conference on Learning Representations, New Orleans, USA, 2019, 1–33.
[65]	BEHRENS F, SAUDER J, and JUNG P. Neurally augmented ALISTA[C]. The 9th International Conference on Learning Representations, Virtual Event, Austria, 2021: 1–10.
[66]	YANG Yan, SUN Jian, LI Huibin, et al. Deep ADMM-Net for compressive sensing MRI[C]. The 30th International Conference on Neural Information Processing Systems, Barcelona, Spain, 2016: 10–18. doi: 10.5555/3157096.3157098.
[67]	YANG Yan, SUN Jian, LI Huibin, et al. ADMM-CSNet: A deep learning approach for image compressive sensing[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2020, 42(3): 521–538. doi: 10.1109/TPAMI.2018.2883941
[68]	MASON E, YONEL B, and YAZICI B. Deep learning for SAR image formation[C]. SPIE 10201, Algorithms for Synthetic Aperture Radar Imagery XXIV, Anaheim, USA, 2017: 1020104. doi: 10.1117/12.2267831.
[69]	GAO Jingkun, DENG Biin, QIN Yuliang, et al. Enhanced radar imaging using a complex-valued convolutional neural network[J]. IEEE Geoscience and Remote Sensing Letters, 2019, 16(1): 35–39. doi: 10.1109/LGRS.2018.2866567
[70]	HU Changyu, WANG Ling, LI Ze, et al. Inverse synthetic aperture radar imaging using a fully convolutional neural network[J]. IEEE Geoscience and Remote Sensing Letters, 2020, 17(7): 1203–1207. doi: 10.1109/LGRS.2019.2943069
[71]	ALVER M B, SALEEM A, and ÇETIN M. Plug-and-play synthetic aperture radar image formation using deep priors[J]. IEEE Transactions on Computational Imaging, 2021, 7: 43–57. doi: 10.1109/TCI.2020.3047473
[72]	WANG Mou, WEI Shunjun, LIANG Jiadian, et al. TPSSI-Net: Fast and enhanced two-path iterative network for 3D SAR sparse imaging[J]. IEEE Transactions on Image Processing, 2021, 30: 7317–7332. doi: 10.1109/TIP.2021.3104168
[73]	HU Changyu, LI Ze, WANG Ling, et al. Inverse synthetic aperture radar imaging using a deep ADMM network[C]. 20th International Radar Symposium (IRS), Ulm, Germany, 2019: 1–9. doi: 10.23919/IRS.2019.8768138.
[74]	LI Xiaoyong, BAI Xueru, and ZHOU Feng. High-resolution ISAR imaging and autofocusing via 2d-ADMM-net[J]. Remote Sensing, 2021, 13(12): 2326. doi: 10.3390/rs13122326
[75]	LI Ruize, ZHANG Shuanghui, ZHANG Chi, et al. Deep learning approach for sparse aperture ISAR imaging and autofocusing based on complex-valued ADMM-net[J]. IEEE Sensors Journal, 2021, 21(3): 3437–3451. doi: 10.1109/JSEN.2020.3025053
[76]	HU Xiaowei, XU Feng, GUO Yiduo, et al. MDLI-Net: Model-driven learning imaging network for high-resolution microwave imaging with large rotating angle and sparse sampling[J]. IEEE Transactions on Geoscience and Remote Sensing, 2021: 1–17. doi: 10.1109/TGRS.2021.3110579
[77]	RATHA D, GAMBA P, BHATTACHARYA A, et al. Novel techniques for built-up area extraction from polarimetric SAR images[J]. IEEE Geoscience and Remote Sensing Letters, 2020, 17(1): 177–181. doi: 10.1109/LGRS.2019.2914913
[78]	AO Dongyang, DATCU M, SCHWARZ G, et al. Moving ship velocity estimation using TanDEM-X data based on subaperture decomposition[J]. IEEE Geoscience and Remote Sensing Letters, 2018, 15(10): 1560–1564. doi: 10.1109/LGRS.2018.2846399
[79]	廖明生, 王茹, 杨梦诗, 等. 城市目标动态监测中的时序InSAR分析方法及应用[J]. 雷达学报, 2020, 9(3): 409–424. doi: 10.12000/JR20022 LIAO Mingsheng, WANG Ru, YANG Mengshi, et al. Techniques and applications of spaceborne time-series InSAR in urban dynamic monitoring[J]. Journal of Radars, 2020, 9(3): 409–424. doi: 10.12000/JR20022
[80]	SICA F, GOBBI G, RIZZOLI P, et al. Φ-Net: Deep residual learning for InSAR parameters estimation[J]. IEEE Transactions on Geoscience and Remote Sensing, 2021, 59(5): 3917–3941. doi: 10.1109/TGRS.2020.3020427
[81]	SONG Qian, XU Feng, and JIN Yaqiu. Radar image colorization: Converting single-polarization to fully polarimetric using deep neural networks[J]. IEEE Access, 2018, 6: 1647–1661. doi: 10.1109/ACCESS.2017.2779875
[82]	ZHAO Juanping, DATCU M, ZHANG Zenghai, et al. Contrastive-regulated CNN in the complex domain: A method to learn physical scattering signatures from flexible PolSAR images[J]. IEEE Transactions on Geoscience and Remote Sensing, 2019, 57(12): 10116–10135. doi: 10.1109/TGRS.2019.2931620
[83]	QU Junrong, QIU Xiaolan, and DING Chibiao. A study of recovering POLSAR information from single-polarized data using DNN[C]. IEEE International Geoscience and Remote Sensing Symposium, Brussels, Belgium, 2021: 812–815. doi: 10.1109/IGARSS47720.2021.9554304.
[84]	CHENG Zezhou, YANG Qingxiong, and SHENG Bin. Deep colorization[C]. The IEEE International Conference on Computer Vision, Santiago, Chile, 2015: 415–423. doi: 10.1109/ICCV.2015.55.
[85]	LUAN Fujun, PARIS S, SHECHTMAN E, et al. Deep photo style transfer[C]. The IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, USA, 2017: 6997–7005. doi: 10.1109/CVPR.2017.740.
[86]	JI Guang, WANG Zhaohui, ZHOU Lifan, et al. SAR image colorization using multidomain cycle-consistency generative adversarial network[J]. IEEE Geoscience and Remote Sensing Letters, 2021, 18(2): 296–300. doi: 10.1109/LGRS.2020.2969891
[87]	TUPIN F and TISON C. Sub-aperture decomposition for SAR urban area analysis[C]. European Conference on Synthetic Aperture Radar (EUSAR), Ulm, Germany, 2004: 431–434.
[88]	BOVENGA F, DERAUW D, RANA F M, et al. Multi-chromatic analysis of SAR images for coherent target detection[J]. Remote Sensing, 2014, 6(9): 8822–8843. doi: 10.3390/rs6098822
[89]	SPIGAI M, TISON C, and SOUYRIS J C. Time-frequency analysis in high-resolution SAR imagery[J]. IEEE Transactions on Geoscience and Remote Sensing, 2011, 49(7): 2699–2711. doi: 10.1109/TGRS.2011.2107914
[90]	FERRO-FAMIL L, REIGBER A, POTTIER E, et al. Scene characterization using subaperture polarimetric SAR data[J]. IEEE Transactions on Geoscience and Remote Sensing, 2003, 41(10): 2264–2276. doi: 10.1109/TGRS.2003.817188
[91]	HUANG Zongling, DATCU M, PAN Zongxu, et al. HDEC-TFA: An unsupervised learning approach for discovering physical scattering properties of single-polarized SAR image[J]. IEEE Transactions on Geoscience and Remote Sensing, 2021, 59(4): 3054–3071. doi: 10.1109/TGRS.2020.3014335
[92]	HUANG Zhongling, DATCU M, PAN Zongxu, et al. A hybrid and explainable deep learning framework for SAR images[C]. IGARSS 2020-2020 IEEE International Geoscience and Remote Sensing Symposium, Waikoloa, USA, 2020: 1727–1730. doi: 10.1109/IGARSS39084.2020.9323845.
[93]	DE S, CLANTON C, BICKERTON S, et al. Exploring the relationships between scattering physics and auto-encoder latent-space embedding[C]. IGARSS 2020-2020 IEEE International Geoscience and Remote Sensing Symposium, Waikoloa, USA, 2020: 3501–3504. doi: 10.1109/IGARSS39084.2020.9323410.
[94]	HUANG Zhongling, YAO Xiwen, DUMITRU C O, et al. Physically explainable CNN for SAR image classification[OL]. arXiv: 2110.14144, 2021.
[95]	ZHANG Jinsong, XING Mengdao, and XIE Yiyuan. FEC: A feature fusion framework for SAR target recognition based on electromagnetic scattering features and deep CNN features[J]. IEEE Transactions on Geoscience and Remote Sensing, 2021, 59(3): 2174–2187. doi: 10.1109/TGRS.2020.3003264
[96]	LEI Songlin, QIU Xiaolan, DING Chibiao, et al. A feature enhancement method based on the sub-aperture decomposition for rotating frame ship detection in SAR images[C]. IEEE International Geoscience and Remote Sensing Symposium, Brussels, Belgium, 2021: 3573–3576. doi: 10.1109/IGARSS47720.2021.9553635.
[97]	THEAGARAJAN R, BHANU B, ERPEK T, et al. Integrating deep learning-based data driven and model-based approaches for inverse synthetic aperture radar target recognition[J]. Optical Engineering, 2020, 59(5): 051407. doi: 10.1117/1.OE.59.5.051407
[98]	HORI C, HORI T, LEE T Y, et al. Attention-based multimodal fusion for video description[C]. The IEEE International Conference on Computer Vision (ICCV), Venice, Italy, 2017: 4203–4212. doi: 10.1109/ICCV.2017.450.
[99]	PORIA S, CAMBRIA E, BAJPAI R, et al. A review of affective computing: From unimodal analysis to multimodal fusion[J]. Information Fusion, 2017, 37: 98–125. doi: 10.1016/j.inffus.2017.02.003
[100]	HUANG Zhongling, DUMITRU C O, and REN Jun. Physics-aware feature learning of SAR images with deep neural networks: A case study[C]. IEEE International Geoscience and Remote Sensing Symposium, Brussels, Belgium, 2021: 1264–1267. doi: 10.1109/IGARSS47720.2021.9554842.
[101]	LEE J S, GRUNES M R, AINSWORTH T L, et al. Unsupervised classification using polarimetric decomposition and the complex Wishart classifier[J]. IEEE Transactions on Geoscience and Remote Sensing, 1999, 37(5): 2249–2258. doi: 10.1109/36.789621
[102]	RATHA D, BHATTACHARYA A, and FRERY A C. Unsupervised classification of PolSAR data using a scattering similarity measure derived from a geodesic distance[J]. IEEE Geoscience and Remote Sensing Letters, 2018, 15(1): 151–155. doi: 10.1109/LGRS.2017.2778749
[103]	LI Yi, DU Lan, and WEI Di. Multiscale CNN based on component analysis for SAR ATR[J]. IEEE Transactions on Geoscience and Remote Sensing, 2021: 1–12. doi: 10.1109/TGRS.2021.3100137
[104]	FENG Sijia, JI Kefeng, ZHANG Linbin, et al. SAR target classification based on integration of ASC parts model and deep learning algorithm[J]. IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing, 2021, 14: 10213–10225. doi: 10.1109/JSTARS.2021.3116979
[105]	LIU Qingshu and LANG Liang. MMFF: Multi-manifold feature fusion based neural networks for target recognition in complex-valued SAR imagery[J]. ISPRS Journal of Photogrammetry and Remote Sensing, 2021, 180: 151–162. doi: 10.1016/j.isprsjprs.2021.08.008
[106]	LIU Jiaming, XING Mengdao, YU Hanwen, et al. EFTL: Complex convolutional networks with electromagnetic feature transfer learning for SAR target recognition[J]. IEEE Transactions on Geoscience and Remote Sensing, 2021: 1–11. doi: 10.1109/TGRS.2021.3083261
[107]	CUI Yuanhao, LIU Fang, JIAO Licheng, et al. Polarimetric multipath convolutional neural network for PolSAR image classification[J]. IEEE Transactions on Geoscience and Remote Sensing, 2021: 1–18. doi: 10.1109/TGRS.2021.3071559
[108]	DAW A, THOMAS R Q, CAREY C C, et al. Physics-guided architecture (PGA) of neural networks for quantifying uncertainty in lake temperature modeling[C]. The 2020 SIAM International Conference on Data Mining (SDM), Cincinnati, USA, 2020: 532–540.
[109]	SUN Jian, NIU Zhan, INNANEN K A, et al. A theory-guided deep-learning formulation and optimization of seismic waveform inversion[J]. Geophysics, 2020, 85(2): R87–R99. doi: 10.1190/geo2019-0138.1
[110]	HE Qishan, ZHAO Lingjun, JI Kefeng, et al. SAR target recognition based on task-driven domain adaptation using simulated data[J]. IEEE Geoscience and Remote Sensing Letters, 2021: 1–5. doi: 10.1109/LGRS.2021.3116707
[111]	ZHANG Linbin, LENG Xiangguang, FENG Sijia, et al. Domain knowledge powered two-stream deep network for few-shot SAR vehicle recognition[J]. IEEE Transactions on Geoscience and Remote Sensing, 2021: 1–15. doi: 10.1109/TGRS.2021.3116349
[112]	AGARWAL T, SUGAVANAM N, and ERTIN E. Sparse signal models for data augmentation in deep learning ATR[C]. IEEE Radar Conference, Florence, Italy, 2020: 1–6. doi: 10.1109/RadarConf2043947.2020.9266382.
[113]	DIEMUNSCH J R and WISSINGER J. Moving and stationary target acquisition and recognition (MSTAR) model-based automatic target recognition: Search technology for a robust ATR[C]. Proceedings of SPIE 3370, Algorithms for synthetic aperture radar Imagery V, Orlando, USA, 1998: 481–492. doi: 10.1117/12.321851.
[114]	HUANG Lanqing, LIU Bin, LI Boying, et al. OpenSARShip: A dataset dedicated to sentinel-1 ship interpretation[J]. IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing, 2018, 11(1): 195–208. doi: 10.1109/JSTARS.2017.2755672
[115]	孙显, 王智睿, 孙元睿, 等. AIR-SARShip-1.0: 高分辨率SAR舰船检测数据集[J]. 雷达学报, 2019, 8(6): 852–862. doi: 10.12000/JR19097 SUN Xian, WANG Zhirui, SUN Yuanrui, et al. AIR-SARSHIP-1.0: High-resolution SAR ship detection dataset[J]. Journal of Radars, 2019, 8(6): 852–862. doi: 10.12000/JR19097
[116]	杜兰, 王兆成, 王燕, 等. 复杂场景下单通道SAR目标检测及鉴别研究进展综述[J]. 雷达学报, 2020, 9(1): 34–54. doi: 10.12000/JR19104 DU Lan, WANG Zhaocheng, WANG Yan, et al. Survey of research progress on target detection and discrimination of single-channel SAR images for complex scenes[J]. Journal of Radars, 2020, 9(1): 34–54. doi: 10.12000/JR19104
[117]	CHEN Siwei and TAO Chensong. PolSAR image classification using polarimetric-feature-driven deep convolutional neural network[J]. IEEE Geoscience and Remote Sensing Letters, 2018, 15(4): 627–631. doi: 10.1109/LGRS.2018.2799877
[118]	LIU Xu, JIAO Licheng, TANG Xu, et al. Polarimetric convolutional network for PoLSAR image classification[J]. IEEE Transactions on Geoscience and Remote Sensing, 2019, 57(5): 3040–3054. doi: 10.1109/TGRS.2018.2879984
[119]	BI Haixia, SUN Jian, and XU Zongben. A graph-based semisupervised deep learning model for PoLSAR image classification[J]. IEEE Transactions on Geoscience and Remote Sensing, 2019, 57(4): 2116–2132. doi: 10.1109/TGRS.2018.2871504
[120]	VINAYARAJ P, SUGIMOTO R, NAKAMURA R, et al. Transfer learning with CNNs for segmentation of PALSAR-2 power decomposition components[J]. IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing, 2020, 13: 6352–6361. doi: 10.1109/JSTARS.2020.3031020
[121]	XIA Junshi, YOKOYA N, ADRIANO B, et al. A benchmark high-resolution GaoFen-3 SAR dataset for building semantic segmentation[J]. IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing, 2021, 14: 5950–5963. doi: 10.1109/JSTARS.2021.3085122
[122]	WU Fan, WANG Chao, ZHANG Hong, et al. Built-up area mapping in China from GF-3 SAR imagery based on the framework of deep learning[J]. Remote Sensing of Environment, 2021, 262: 112515. doi: 10.1016/j.rse.2021.112515
[123]	CHEN Jiankun, QIU Xiaolan, DING Chibiao, et al. CVCMFF Net: Complex-valued convolutional and multifeature fusion network for building semantic segmentation of InSAR images[J]. IEEE Transactions on Geoscience and Remote Sensing, 2021: 1–14. doi: 10.1109/TGRS.2021.3068124
[124]	SHI Xianzheng, FU Shilei, CHEN Jin, et al. Object-level semantic segmentation on the high-resolution Gaofen-3 FUSAR-map dataset[J]. IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing, 2021, 14: 3107–3119. doi: 10.1109/JSTARS.2021.3063797
[125]	仇晓兰, 焦泽坤, 彭凌霄, 等. SARMV3D-1.0: SAR微波视觉三维成像数据集[J]. 雷达学报, 2021, 10(4): 485–498. doi: 10.12000/JR21112 QIU Xiaolan, JIAO Zekun, PENG Lingxiao, et al. SARMV3D-1.0: Synthetic aperture radar microwave vision 3D imaging dataset[J]. Journal of Radars, 2021, 10(4): 485–498. doi: 10.12000/JR21112

Relative Articles

[1]	WANG Xiang, WANG Yumiao, CHEN Xingyu, ZANG Chuanfei, CUI Guolong. Deep Learning-based Marine Target Detection Method with Multiple Feature Fusion[J]. Journal of Radars, 2024, 13(3): 554-564. doi: 10.12000/JR23105
[2]	LUO Ru, ZHAO Lingjun, HE Qishan, JI Kefeng, KUANG Gangyao. Intelligent Technology for Aircraft Detection and Recognition through SAR Imagery: Advancements and Prospects[J]. Journal of Radars, 2024, 13(2): 307-330. doi: 10.12000/JR23056
[3]	TIAN Ye, DING Chibiao, ZHANG Fubo, SHI Min’an. SAR Building Area Layover Detection Based on Deep Learning[J]. Journal of Radars, 2023, 12(2): 441-455. doi: 10.12000/JR23033
[4]	CHEN Xiang, WANG Liandong, XU Xiong, SHEN Xujian, FENG Yuntian. A Review of Radio Frequency Fingerprinting Methods Based on Raw I/Q and Deep Learning[J]. Journal of Radars, 2023, 12(1): 214-234. doi: 10.12000/JR22140
[5]	DING Zihang, XIE Junwei, WANG Bo. Missing Covariance Matrix Recovery with the FDA-MIMO Radar Using Deep Learning Method[J]. Journal of Radars, 2023, 12(5): 1112-1124. doi: 10.12000/JR23002
[6]	HE Mi, PING Qinwen, DAI Ran. Fall Detection Based on Deep Learning Fusing Ultrawideband Radar Spectrograms[J]. Journal of Radars, 2023, 12(2): 343-355. doi: 10.12000/JR22169
[7]	KANG Jian, WANG Zhirui, ZHU Ruoxin, SUN Xian. Supervised Contrastive Learning Regularized High-resolution Synthetic Aperture Radar Building Footprint Generation[J]. Journal of Radars, 2022, 11(1): 157-167. doi: 10.12000/JR21124
[8]	CHEN Siwei, CUI Xingchao, LI Mingdian, TAO Chensong, LI Haoliang. SAR Image Active Jamming Type Recognition Based on Deep CNN Model[J]. Journal of Radars, 2022, 11(5): 897-908. doi: 10.12000/JR22143
[9]	MA Lin, PAN Zongxu, HUANG Zhongling, HAN Bing, HU Yuxin, ZHOU Xiao, LEI Bin. Multichannel False-target Discrimination in SAR Images Based on Sub-aperture and Full-aperture Feature Learning[J]. Journal of Radars, 2021, 10(1): 159-172. doi: 10.12000/JR20106
[10]	WEI Yangkai, ZENG Tao, CHEN Xinliang, DING Zegang, FAN Yujie, WEN Yuhan. Parametric SAR Imaging for Typical Lines and Surfaces[J]. Journal of Radars, 2020, 9(1): 143-153. doi: 10.12000/JR19077
[11]	LI Yongzhen, HUANG Datong, XING Shiqi, WANG Xuesong. A Review of Synthetic Aperture Radar Jamming Technique[J]. Journal of Radars, 2020, 9(5): 753-764. doi: 10.12000/JR20087
[12]	HUANG Yan, ZHAO Bo, TAO Mingliang, CHEN Zhanye, HONG Wei. Review of Synthetic Aperture Radar Interference Suppression[J]. Journal of Radars, 2020, 9(1): 86-106. doi: 10.12000/JR19113
[13]	WEN Gongjian, MA Conghui, DING Baiyuan, SONG Haibo. SAR Target Physics Interpretable Recognition Method Based on Three Dimensional Parametric Electromagnetic Part Model[J]. Journal of Radars, 2020, 9(4): 608-621. doi: 10.12000/JR20099
[14]	GUO Weiwei, ZHANG Zenghui, YU Wenxian, SUN Xiaohua. Perspective on Explainable SAR Target Recognition[J]. Journal of Radars, 2020, 9(3): 462-476. doi: 10.12000/JR20059
[15]	LUO Ying, NI Jiacheng, ZHANG Qun. Synthetic Aperture Radar Learning-imaging Method Based onData-driven Technique and Artificial Intelligence[J]. Journal of Radars, 2020, 9(1): 107-122. doi: 10.12000/JR19103
[16]	ZHANG Jinsong, XING Mengdao, SUN Guangcai. A Water Segmentation Algorithm for SAR Image Based on Dense Depthwise Separable Convolution[J]. Journal of Radars, 2019, 8(3): 400-412. doi: 10.12000/JR19008
[17]	Zhao Feixiang, Liu Yongxiang, Huo Kai. A Radar Target Classification Algorithm Based on Dropout Constrained Deep Extreme Learning Machine[J]. Journal of Radars, 2018, 7(5): 613-621. doi: 10.12000/JR18048
[18]	Wang Jun, Zheng Tong, Lei Peng, Wei Shaoming. Study on Deep Learning in Radar[J]. Journal of Radars, 2018, 7(4): 395-411. doi: 10.12000/JR18040
[19]	Xu Feng, Wang Haipeng, Jin Yaqiu. Deep Learning as Applied in SAR Target Recognition and Terrain Classification[J]. Journal of Radars, 2017, 6(2): 136-148. doi: 10.12000/JR16130
[20]	Jin Tian. An Enhanced Imaging Method for Foliage Penetration Synthetic Aperture Radar[J]. Journal of Radars, 2015, 4(5): 503-508. doi: 10.12000/JR15114

Supplements(0)

Cited By

Cited by

Periodical cited type(16)

1.	宋浩生，甘精伟，虞华，王琳，秘璐然. 一种面向SAR图像多尺度舰船目标的检测算法. 计算机测量与控制. 2025(01): 211-217+225 .
2.	孙晓坤，贠泽楷，胡粲彬，项德良. 面向高分辨率多视角SAR图像的端到端配准算法. 雷达学报(中英文). 2025(02): 389-404 .
3.	曹婧宜，张扬，尤亚楠，王亚敏，杨峰，任维佳，刘军. 基于图网络与不变性特征感知的SAR图像目标识别方法. 雷达学报(中英文). 2025(02): 366-388 .
4.	黄钟泠，吴冲，姚西文，王立鹏，韩军伟. 基于时频分析的SAR目标微波视觉特性智能感知方法与应用. 雷达学报. 2024(02): 331-344 . 本站查看
5.	徐丰，金亚秋. 微波视觉与SAR图像智能解译. 雷达学报. 2024(02): 285-306 . 本站查看
6.	李妙歌，陈渤，王东升，刘宏伟. 面向SAR图像目标分类的CNN模型可视化方法. 雷达学报. 2024(02): 359-373 . 本站查看
7.	罗汝，赵凌君，何奇山，计科峰，匡纲要. SAR图像飞机目标智能检测识别技术研究进展与展望. 雷达学报. 2024(02): 307-330 . 本站查看
8.	陈小龙，何肖阳，邓振华，关键，杜晓林，薛伟，苏宁远，王金豪. 雷达微弱目标智能化处理技术与应用. 雷达学报. 2024(03): 501-524 . 本站查看
9.	何奇山，赵凌君，计科峰，匡纲要. 面向SAR目标识别成像参数敏感性的深度学习技术研究进展. 电子与信息学报. 2024(10): 3827-3848 .
10.	毕海霞，况祖正，李凡，高静怀，徐晨. 极化SAR图像分类深度学习算法综述. 科学通报. 2024(35): 5108-5128 .
11.	康志强，张思乾，封斯嘉，冷祥光，计科峰. 稀疏先验引导CNN学习的SAR图像目标识别方法. 信号处理. 2023(04): 737-750 .
12.	陈筠力，陶明亮，刘艳阳，路瑞峰，粟嘉. 雷达遥感卫星频率分配与射频干扰抑制：机遇与挑战. 上海航天(中英文). 2023(02): 1-12 .
13.	李郝亮，陈思伟. 海面角反射体电磁散射特性与雷达鉴别研究进展与展望. 雷达学报. 2023(04): 738-761 . 本站查看
14.	马宇晴，张湛舸，刘卫，刘祥龙. 基于稳定信息的小数据学习方法. 智能安全. 2023(01): 13-26 .
15.	顾丹丹，廖意，王晓冰. 雷达目标特性知识引导的智能识别技术进展与思考. 制导与引信. 2022(04): 57-64 .
16.	邵文昭，张文新，张书强，王晓辉. 基于深度学习的高分辨率星载遥感影像目标检测综述. 邯郸职业技术学院学报. 2022(04): 34-37+41 .

Other cited types(19)

Proportional views

Proportional views

通讯作者: 陈斌, bchen63@163.com

1.
沈阳化工大学材料科学与工程学院沈阳 110142

Figures(20)

Get Citation

PDF

XML

Article views(8462) PDF downloads(855)

Progress and Perspective on Physically Explainable Deep Learning for Synthetic Aperture Radar Image Interpretation（in English）

DOI: 10.12000/JR21165 CSTR: 32380.14.JR21165