Skip to main content

A convolutional neural network to optimize multi-mission satellite altimeter fusion for improving the marine gravity field

Abstract

Satellite altimetry is the main tool for constructing global or regional marine gravity fields. To improve the accuracy and spatial resolution, it is necessary to fuse multi-mission altimeters. How to determine the weights of multi-mission altimeters is a crucial issue, making the conventional calculation process very complex. In addition, traditional satellite inversion methods are often independent of shipborne gravity, which is used only as validation data, thus not take full advantages of high accuracy and resolution of shipborne gravity. In this study, we introduce a convolutional neural network (CNN) to merge the vertical deflections (DOVs) obtained from multi-altimeter missions to construct a marine gravity model in the South China Sea. High-accuracy shipborne gravity and a dataset comprising DOVs and geo-locations are employed as input data for neural network training. For the validation of CNN method, the gravity model is also computed by conventional Inverse Vening Meinesz (IVM) method. Independent shipborne gravity measurements and SIO V32.1, DTU17 models are used as validation data. The evaluation results show that the CNN-derived model achieves a higher level of accuracy, yielding a standard deviation (STD) of 3.21 mGal, with an improvement of 36.56% compared to IVM-derived model. More than 92% of the differences between the CNN-derived model and shipborne gravity are less than 5 mGal. In addition, spectral analysis results further show that the CNN-derived model has stronger energy at short wavelengths (less than 25 km) compared to other models. These findings reveal that CNN method is feasible for marine gravity recovery and the CNN-derived model can achieve higher accuracy. The CNN method can improve the accuracy and spectral characteristics of the constructed gravity model by taking advantage of the high accuracy and high resolution of shipborne gravity.

Graphical Abstract

1 Introduction

Marine gravity field serves as the essential data for ocean observations, reflecting the distribution, movement, and density changes of materials under the ocean. Accurate knowledge of marine gravity field is crucial for understanding the distribution of marine resources, submarine topography and tectonics, as well as underwater vehicle navigation, benefiting science and society. Marine gravity can be obtained mainly by satellite altimetry and shipborne gravity measurements. Satellite altimetry can provide rapid access to global sea surface heights (SSHs), and measure the shape of the Earth and the geoid, thereby to obtain the global marine gravity field. Over the past decades, thanks to the accumulation of multi-source altimeter data, satellite altimetry has developed into an essential tool for constructing global or regional marine gravity models (Andersen and Knudsen 1998; Haxby et al. 1983; Hwang 1998; Sandwell and Smith 2009). Shipborne gravity measurements offer advantages in accuracy and resolution, and are an important aid in obtaining marine gravity. At present, to further enhance the accuracy and resolution of marine gravity, it is necessary to maximize the integration of multi-source data, including altimeter data and shipborne gravity measurements, while taking advantage of the strengths of each data.

The spatial track density, range precision and trajectory diversity of altimeter are the primary factors affecting the spatial resolution and accuracy of altimetry-derived marine gravity. Combining multi-mission altimeter data, especially from geodetic missions (GMs), is an effective way to improve the spatial density, trajectory diversity and volume of SSHs (Andersen et al. 2010, 2021; Sandwell et al. 2021; Zhu et al. 2020). Different altimeters have different levels of accuracy. When combining multi-mission altimeter data, how to determine appropriate weights for each mission to achieve an optimal result is particularly important. It has been an indispensable component in the study of marine gravity recovery.

Sandwell and Smith (1997) applied an a priori estimate of along-track slope error, to weight the Geosat and ERS-1/GM data. Hwang (1998) weighted the altimeter data from Seasat, Geosat ERS-1 and TOPEX/POSEIDON (T/P) based on the standard deviations (STDs) of mean geoid gradients, and derived a global marine gravity model by empirical variances function and remove–compute–restore (RCR) technology. This method has been used to weight multi-source altimeter data, including new GM data from SARAL/AltiKa/Drifting Phase (AltiKa/DP), Cryosat-2, Jason-1/2 and HY-2A since 2010 (Hwang et al. 2002; Yu et al. 2021; Yu and Hwang 2022). Different observation modes and observation bands of the altimeter complicate the weighting strategy. Zhu et al. (2020) proposed a new iterative method to estimate the weight of the Ka-band AltiKa altimeter. Yu and Hwang (2022) employed the theory of minimum norm quadratic unbiased estimator to calibrate the error variances of geoid gradients from Cryosat-2 and Jason-1/2 missions and calculated the gravity field in the Gulf of Mexico. These studies indicate that the process of weights determination and combination of multi-mission altimetry data is very complicated. Moreover, the covariance function sometimes fails to fully reflect the error characteristics of each satellite, the resulting weights do not achieve the optimal results.

The main approaches for marine gravity recovery from multi-mission altimeters include the inverse Stokes integral, the inverse Vening Meinesz (IVM) formula, the Laplace equation, and the least squares collocation (LSC) method. The inverse Stokes integral is appliable for marine gravity recovery from geoid heights (Wang 1999). The IVM method and the Laplace equation depend on vertical deflections (DOVs) as data source (Hwang 1998; Sandwell and Smith 1997). The LSC method is based on statistical theory and can be used for geoid heights or DOVs. This method relies on a precise covariance matrix between the geoid heights or DOVs and the gravity anomalies, which is difficult to determine exactly. So, it has been used relatively infrequently (Andersen and Knudsen 1998; Olgiati et al. 1995). DOVs are the slope of the geoid in the north–south and east–west directions. Theoretically, due to differential computation, DOVs are less contaminated by long-wavelength radial orbit error, while containing more short-wavelength signals, resulting in that IVM and Laplace equation methods are widely used in gravity recovery (Hwang et al. 2002; Sandwell and Smith 2009; Zhang et al. 2021). However, these common traditional methods rely on strict functional relationship between geoid or DOVs and gravity anomalies, and are often independent of shipboard gravity measurements, which are used only as validation data. This simplifies the complex correlational relationship between geoids or DOVs and gravity anomalies and prevents these methods from taking full advantage of the high accuracy and resolution of shipborne gravity. In this study, we attempt to propose a new inversion method, based on neural networks, which using DOVs as data source.

Neural networks are applicable to simplify the process of gravity recovery from multi-source altimeters and shipborne gravity, due to their powerful massive data processing and nonlinear fitting capabilities. They have been successfully applied in seismic detection and location (Perol et al. 2018), noise suppression (Dong et al. 2020; Zhao et al. 2018), and bathymetry prediction (Annan and Wan 2022; Li et al. 2023; Sun et al. 2022). Neural networks can map various geophysical transfer functions and fuse data from multiple sources to accomplish regression tasks efficiently with high accuracy. Convolutional neural network (CNN) is one of the most widely used neural network model, performing well in extracting highly non-linear features from images, which is challenging for traditional physical algorithms. Therefore, it is possible to adopt CNN to recover the marine gravity by combining multi-mission altimeters without complex weight determination.

In this paper, we attempt to construct the marine gravity field in South China Sea (SCS) by CNN method and RCR technology. CNN has been utilized to recover gravity data in the western Pacific Ocean from satellite altimetry data (Zhu et al. 2023). Distinguished from other studies, we focus on the ability of the CNN method to simplify the fusion of multi-satellite datasets. We derive gravity in short waveband from CNN rather than absolute gravity, since altimeter data can improve the short-wavelength gravity field primarily. We also use high-resolution shipborne gravity measurements to assess the actual resolution of the CNN inversion results compared to other models. Specifically, we first calculate multi-source altimetry-derived DOVs from Altika/DP, CryoSat-2, Jason-2/GM, HY-2A/GM, and Jason-1/GM in SCS. Then, we design a CNN using DOVs and geo-locations as input data, and shipborne gravity measurements as training data. The shipborne gravity measurements require quality control and band separation. Finally, we construct a 1′ × 1′ marine gravity model by the designed CNN and evaluate its accuracy by comparing with conventional IVM method and other gravity data.

2 Study area and data

2.1 Study area

The SCS is located at the junction of the Eurasian plate, Indo-Australian plate, and Philippine Sea plate. As one of the marginal seas of western Pacific, its relatively complex seafloor topography records the formation and evolution of the whole marginal sea, which provides a natural geological laboratory for the study of continental rifting and seafloor spreading processes. SCS is rich in tectonic units, including islands, trenches, basins, and submarine plateaus, and so on (as shown in Fig. 1a). The marine gravity field of SCS is the basic data for crustal structure and tectonic unit research. Therefore, the region of SCS within 13°–18° N, 110°–120° E is selected as the study area to recover the regional marine gravity field using multi-altimeter data sets (Fig. 1a and the red rectangle in the upper right). Considering the edge effect, the region expands outward by 1° during the calculation process (black dashed rectangle in the upper right of Fig. 1a). The region has a large topographic relief, with water depths ranging from 0 to 5000 m. The landforms are characterized by their complexity and diversity, encompassing seamounts, sea basins, trenches, and other features. These characteristics provide a comprehensive validation of the neural network method’s efficacy in gravity recovery.

Fig. 1
figure 1

Bathymetry of the study area (a) and the distribution of shipborne gravity. (b) Red dots and blue dots in figure b) denote calculate points (which are used for training the neural network) and check points (which are excluded from training), respectively

2.2 Datasets used

2.2.1 Shipborne gravity data

As training sample, the collection of shipborne gravity data is the first priority. The shipborne gravity data used in this study are available from the National Centers for Environmental Information (NCEI) and SeaDataNet. The main types of features we use to train the recovery are longitude, latitude, and free-air gravity anomaly. The free-air anomalies have been adjusted for drift correction, lag time and Eötvös correction. The total number of shipborne gravity points is 80,899, and their distribution is shown in Fig. 1b.

For quality control, it is necessary to edit the shipborne gravity first to remove incorrect measurements (see Sect. 3.1). Then to train a neural network model, approximately 80% (64,720 data points illustrated as red dots in Fig. 1b) are selected evenly as calculate points to train the neural network. These data are processed to get the short-wavelength component. During the training process, 80% of the calculate points is randomly selected as the training set and 20% as the test set to iteratively optimize the model. The remaining data points (16,179 points depicted as blue dots in Fig. 1b) are independent of the calculation process and used as check points to evaluate the altimeter-derived gravity models.

2.2.2 Altimetry data

Up to now, there are several satellite altimeters providing numerous SSHs data of GMs, including Geosat/GM, ERS-1/GM, Jason-1/GM, CryoSat-2, HY-2A/GM, Altika/DP, and Jason-2/GM. Whether all geodetic missions still make great contributions to the marine gravity recovery is a concern for simplifying the gravity construction processes. Sandwell et al. (2021) assessed the contribution of each altimeter GM to the marine gravity recovery, and found that Geosat/GM and ERS-1/GM provide minor improvement. Therefore, SSHs data from Geosat/GM and ERS-1/GM are excluded from this study, and data from Altika/DP, HY-2A/GM, CryoSat-2, Jason-1/GM and Jason-2/GM are employed, and the information on the GMs used are shown in Table 1.

Table 1 Information of altimeter data used in this study

We mainly use the 1 Hz along-track non-time critical Level-2 Plus (L2P) products, which can be accessed through AVISO. The L2P products are results of further processing of Level 2 altimetry data (geophysical data records, GDRs). L2P products for all altimetry missions have a uniform format and content (CNES 2020). The reference datum used is WGS-84, which is similar with shipborne gravity data. For Jason-2/GM, the 1-Hz GDRs in Ku band are adopted due to the less measured SSHs from L2P. GDRs have a different reference ellipsoid with equatorial radius of 6378.1363 km and flattening coefficient of 1/298.257 from that of WGS 84, which should be transformed before calculating the DOVs (Jin et al. 2008; Rapp et al. 1994). In the end, we use the SSHs after data editing and various geophysical corrections.

2.2.3 Dynamic topography model and reference gravity field

Given the effect of sea surface topography, a dynamic topography model is considered to interpolate and remove from altimeter-measured SSHs before calculating DOVs. The mean dynamic topography (MDT) model MDT-CNES-CLS18 from AVISO is employed in this study. MDT-CNES-CLS18 model is an estimate of the mean sea surface height above geoid over the 1993–2012 period, with a global geographic coverage and resolution of 0.125°.

To derive gravity in a certain waveband, RCR technique is introduced to separate the gravity field into the long-wavelength reference field and the short-wavelength residual field (Forsberg 1984). In this study, the Earth Gravitational Model 2008 (EGM2008) is chosen as the reference gravity field, which is one of the widely used combined global gravity field models (Andersen and Knudsen 2019; Pavlis et al. 2012; Sandwell and Smith 2009; Zhang et al. 2017; Zhu et al. 2020). EGM2008 is a spherical harmonic model of the Earth’s gravitational potential, which is completed to degree and order 2159, and contains additional coefficients up to degree 2190 and order 2159 (Pavlis et al. 2012). By removing the MDT-CNES-CLS18 model and EGM2008-derived geoid from the SSHs, residual geoid heights can be obtained, which are the key data for DOVs calculation.

2.2.4 Gravity data for validation

To evaluate the accuracy of marine gravity models using external data is an important part of marine gravity recovery. The models of V32.1 from Scripps Institution of Oceanography (SIO), DTU17 from DTU space (DTU) are introduced to verify the recovered gravity model. They adopted different multi-satellite altimeter dataset and data processing method that ensure independence from recovered model. SIO V32.1 adopted new altimeter data from Envisat, Cryosat-2, Jason-1/2, Altika and Sentinel-3A/B. Advanced data processing methods such as retracking waveforms, slope correction, biharmonic spline interpolation, multiple filtering, and Laplace equation are employed for gravity recovery. As a result, it has a spatial resolution up to 1′ × 1′ and an accuracy up to 1–2 mGal in some regions (Sandwell et al. 2014, 2021). DTU17 employed the altimeter data from Geosat, ERS-1, Cryosat-2, Jason-1 and Altika, which have been processed with double waveform retracking method to improve the range accuracy, especially in coastal and polar regions. And it calculated the marine gravity by inverse stokes formula from altimeter measured geoid (Andersen and Knudsen 2019). It is widely recognized that these marine gravity models have high spatial resolution and accuracy, and are typical marine gravity models in the world, which are used as a comparative standard for accuracy assessment.

3 Methods

In this study, the altimeter-measured SSHs after geophysical correction obtained from satellite missions such as Altika/DP, CryoSat-2, Jason-2/GM, HY-2A/GM and Jason-1/GM are used. Two different data processing strategies are employed to compute the gravity anomalies in the study area. The performance of the CNN technique is evaluated by comparing the results, as illustrated in Fig. 2.

Fig. 2
figure 2

The data processing strategy in this study

In the first processing strategy, the residual DOV components of each satellite are calculated separately by along-track LSC method (Hwang and Parsons 1995), after subtracting the long-wavelength reference field and the effect of sea surface topography. Then the CNN technique is applied to each residual DOV components to determine the residual gravity in the study area. Finally, by restoring the long-wavelength gravity from EGM2008, we get the gravity anomaly model named GA_CNN.

The second processing strategy follows a common approach in gravity recovery from altimeter. Firstly, the along-track residual geoid gradients of each satellite are calculated by differentiation, after subtracting the long-wavelength reference field and the effect of sea surface topography. These residual geoid gradients are then fused using appropriate weights to compute the gridded residual DOV by LSC. The weights are determined by the precision of geoid gradients, which is derived from crossover discrepancies of SSHs after the crossover adjustment. The details followed the method proposed by Zhu et al. (2020). The residual DOV is then converted to residual gravity by IVM formula. Finally, by restoring the long-wavelength gravity from EGM2008, a gravity anomaly model named GA_IVM is obtained.

To assess the performance of CNN method, the results from both data processing approaches are compared with shipborne gravity data and other gravity models.

3.1 Data preprocessing

For altimeter data, we begin with the unifying of the reference ellipsoids of L2Ps and GDRs. Specifically, the Jason-2/GM SSHs are converted to those in WGS84 using the method described by Rapp et al. (1994) and Jin et al. (2008). To suppress the high-frequency noise caused by surface temporal variability, a Gaussian low-pass filter is applied to each altimeter SSH. The convolution window radius is identified as 7 km, considering that the distance between adjacent points of the 1 Hz along-track SSH is about 7 km. Subsequently, WGS84-based multi-satellite altimeter-measured SSHs are ready for next DOV gridding.

For shipborne gravity data, the systematic errors between each survey line should be eliminated primarily. Although the Eötvös and drift corrections have been applied to the released shipborne gravity, there may still be systematic errors between different cruises, due to the different measuring organization, instruments, and vessels and so on (Wessel and Watts 2012). To enhance the quality of shipborne gravity data, we employ the gravity anomalies from EGM2008 as a priori model for quality control. The reference ellipsoid for both the EGM2008 and shipborne gravity is WGS84. We first compute the differences between EGM2008 and the shipborne gravity data at measuring points. Then the mean and STD of these differences are counted. Gravity data with differences greater than three times STD are eliminated. If the mean value is far from 0, it indicates that there is a systematic error. And it is necessary to correct the systematic error by adding the average value to each measurement cruise. In the study area, due to the old age and poor accuracy of many shipborne gravity measurements, systematic errors of some cruises need to be eliminated segment by segment, among which the whole routes of RC2614 and KH7605 are not employed. In this study area, there are a total of 84,698 shipborne gravity observations. After quality control, about 4.49% of the observations are deleted as outliers and 80,899 points are retained for subsequent calculations.

In addition, according to the RCR technique, the gravitational field can be divided into long-wavelength reference field and short-wavelength residual field, as shown in Eq. 1.

$$\Delta g\left( {x,y} \right) = \Delta g_{{\text{res}}} \left( {x,y} \right) + \Delta g_{{\text{EGM}}2008} \left( {x,y} \right),$$
(1)

where the \(\Delta g_{{\text{res}}} \left( {x,y} \right)\) represents the short-wavelength residual gravity field, \(\Delta g_{{\text{EGM}}2008} \left( {x,y} \right)\) is the long-wavelength reference gravity obtained from EGM2008.

In CNN method, the short-wavelength component of shipborne gravity at training points is employed as the feature. Before training the neural network, EGM2008 gravity anomalies at training points are subtracted from the shipborne gravity data to obtain the residual gravity anomalies.

3.2 DOV gridding

The components of DOV are the gradients of the geoid resolved in north–south and east–west directions, which can be calculated by along-track differentiation and LSC method (Hwang and Parsons 1995). Computation of DOV through along-track differentiation can suppress long wavelength errors of SSHs and avoid the complex calculations of crossover analysis (Hwang and Parsons 1995; Zhu et al. 2020). Along-track DOV is the negative of the along-track geoid gradients, which can be calculated by differentiating the geoid of two adjacent points with respect to their spherical distance, as follows

$$e_{PQ} = - \frac{{\left( {{\text{SSH}}_Q - {\text{MDT}}_Q } \right) - \left( {{\text{SSH}}_P - {\text{MDT}}_P } \right)}}{{{\text{dist}}_{PQ} }},$$
(2)

where \(e_{PQ}\) is the along-track DOV between two adjacent points P and Q. \({\text{dist}}_{PQ}\) represents the spherical distance between points P and Q. SSHP, SSHQ, MDTP and MDTQ are the adjusted SSH at points P and Q, and the sea surface topography at points P and Q from MDT-CNES-CLS18 model, respectively.

With reference to the RCR technique summarized in Fig. 2, it is a key step to calculate the residual DOV components on regular grid and use them to calculate the residual gravity anomaly. The residual DOV can be obtained by residual along-track geoid gradients, after removing the reference geoid gradients of EGM2008 from along-track geoid gradients. Then the LSC method is applied to calculate gridded residual DOV components by Zhu et al. (2020)

$$\left( {\begin{array}{*{20}c} {\xi_{{\text{res}}} } \\ {\eta_{{\text{res}}} } \\ \end{array} } \right) = \left( {\begin{array}{*{20}c} {C_{\xi e} } \\ {C_{\eta e} } \\ \end{array} } \right)\left( {C_{ee} + C_{nn} } \right)^{ - 1} e_{{\text{res}}} ,$$
(3)

where \(e_{{\text{res}}}\) is the residual geoid gradient. \(\eta_{{\text{res}}}\) and \(\xi_{{\text{res}}}\) are residual east–west and north–south components of DOVs. The covariance matrices \(C_{ee}\), \(C_{\xi e}\) and \(C_{\eta e}\) are for \(e_{{\text{res}}} - e_{{\text{res}}}\), \(\xi_{{\text{res}}} - e_{{\text{res}}}\) and \(\eta_{{\text{res}}} - e_{{\text{res}}}\), respectively. The covariance matrices are obtained from Tscherning–Rapp degree variance Model 4 (Tscherning and Rapp 1974) and coefficients errors of EGM2008. \(C_{nn}\) is the matrix of noise variances of geoid gradients, which can be obtained by the method proposed by Hwang and Parsons (1995) and Zhu et al. (2020).

3.3 Inverse Vening Meinesz formula

Gravity anomalies can be recovered from DOVs by the IVM formula, which was presented by Hwang et al. (2002):

$$\Delta g_P = \frac{\gamma_0 }{{4\pi }}\iint\limits_{\sigma } {H^{\prime}\left( {\xi_Q \cos \alpha_{QP} + \eta_Q \sin \alpha_{QP} } \right){\text{d}}\sigma_{QP} ,}$$
(4)

where \(\Delta g_P\) represents the gravity anomaly at point P. \(\gamma_0\) is the normal gravity based on WGS-84. \(\eta_Q\) and \(\xi_Q\) are the east–west and north–south components of the DOV at point Q, respectively. \(\alpha_{QP}\) is the azimuth from Q to P. H′ is the kernel function defined as

$$H^{^{\prime}\left( {\psi_{PQ} } \right)} = - \frac{{\cos \frac{{\psi_{PQ} }}{2}}}{{2\sin^2 \frac{{\psi_{PQ} }}{2}}} + \frac{{\cos \frac{{\psi_{PQ} }}{2}\left( {3 + 2\sin \frac{{\psi_{PQ} }}{2}} \right)}}{{2\sin \frac{{\psi_{PQ} }}{2}\left( {1 + \sin \frac{{\psi_{PQ} }}{2}} \right)}}.$$
(5)

\(\psi_{PQ}\) is the spherical distance between Q and P. Considering that the residual components of DOVs are given in regular grid, \(\Delta g_P\) can be calculated by 1-D fast Fourier transform (1D FFT) (Hwang 1998). \(\psi_{PQ}\) cannot be 0, or it will cause singularity of kernel function. The innermost zone effect on gravity anomaly should be considered and are computed by \(\Delta g_i = \frac{s_0 \gamma_0 }{2}\left( {\xi_y + \eta_x } \right)\), where \(\xi_y = \partial \xi /\partial y\) and \(\eta_x = \partial \eta /\partial x\) are the north derivative of \(\xi\) and the east derivative of \(\eta\), respectively; \(s_0\) is the radius of the innermost zone, which can be approximated from the grid intervals as: \(s_0 = \sqrt {{\frac{\Delta x\Delta y}{\pi }}}\).

3.4 Convolutional neural network used for gravity recovery

In this study, a multilayer convolutional neural network (CNN) model is constructed to recover residual gravity using multi-altimeter DOVs and shipborne gravity. Its architecture is shown in Fig. 3. The input data is north–south and east–west components of DOVs from Altika/DP, Jason-2/GM, CryoSat-2, HY-2A/GM and Jason-1/GM, respectively, for a total of 10 components. Since the CNN input is an image, the smaller and more pixels of the image, the more it helps to extract the detailed features of the image and improve the performance of the neural network. So, each grid of DOVs is sliced into smaller grids using a 4 × 4 sliding window. That is the DOVs on 1′ × 1′ grid are resampled to 0.251′ × 0.25′. Therefore, each 4 × 4 grid is an equivalent of 1′ spatial resolution. Because the gravity anomaly is an integral calculation of the DOVs in the surrounding region, employing more surrounding points provide more information for gravity determination. Considering the integration radius and the performance of computers, 32 × 32 surrounding points are chosen. Since the positions of the calculation points also affect the gravity field, longitude and latitude are considered as input features. All these input features are 3D-concatenated to form a 32 × 32 × 12 input image.

Fig. 3
figure 3

CNN architecture used for gravity recovery

In addition to the input images and output result, the CNN architecture is mainly composed of seven blocks. The first three blocks are similar and consist of convolution, batch normalization, max pooling and rectified linear unit (ReLU) layers, respectively. The convolution layer conducts convolutional operations on the input images to extract features such as edges, textures and shapes. The batch normalization layer normalizes the extracted features of each layer to ensure a similar feature distribution and enhance stability. The max pooling layer reduces the dimensionality of the data and preserves the most important information. The ReLU layer applies nonlinear transformations to the output to help extract complex features. Compared to other activation functions, ReLU has stronger nonlinear fitting ability to improve the expressive power of neural network models. And it is the most commonly used nonlinear activation function (Glorot et al. 2011). The fourth block consists of convolution, batch normalization and ReLU layers, followed closely by a 25% dropout layer. The dropout layer is designed to mitigate overfitting and enhance the model's generalization capability. After flattening the extracted features, the last three blocks contain a fully connected layer with ReLU as activation function, respectively. The fully connected layers map the extracted features to the regression labels, learn the associations between the features, and implement the final regression calculation. Finally, the output of CNN is the residual gravity anomaly at the point on 1′ × 1′ grids.

Once the neural network architecture is constructed, the important task is to train the neural network using sample data. Shipborne residual gravity with high accuracy and DOVs at training points are employed as training data, and the rest of the data is applied for testing. The neural network is trained according to the following method.

First of all, the neural network model is initialized to randomly generate initial parameters such as weights and biases. Secondly, the input data are propagated forward through the neural network architecture to obtain the output value. Thirdly, the error between the output value and the target value (testing data) is calculated, and the error is transmitted back to the network when it is larger than the expected value. The weights and biases will update according to the proper algorithm, and so on iteratively until the error meets the expected value, then the model training is completed. In this study, a loss function based on the root-mean-square error between the computed gravity and shipborne residual gravity is used to train the network for 40 epochs at an initial learning rate of 0.001. Based on the learning rate and loss function, the weights and biases are updated using Adam optimization algorithm. Adam is an algorithm for first-order gradient-based optimization of stochastic objective functions, based on adaptive estimates of lower-order moments. It can adaptively adjust the learning rate of each parameter according to the historical gradient and update of each parameter during the training process, thus speeding up the training of neural networks. Adam is widely used in large-scale neural network research (Kingma and Ba 2014). Ultimately, the derived residual gravity is obtained by feeding the DOVs obtained from altimetry to the well-trained neural network.

4 Results and evaluation

4.1 Gravity anomalies from CNN and IVM method

Based on the data process methods described in Sect. 3 and Fig. 2, we derive two 1′ × 1′ marine gravity models in the study area. The model calculated by CNN method is marked as GA_CNN in Fig. 4a, while the model calculated by IVM formula is marked as GA_IVM in Fig. 4b. Additionally, two gravity models used for validation, SIO V32.1 and DTU17, are also illustrated in Fig. 4c, d, under the same color bar. As demonstrated in Fig. 4, the spatial distribution of GA_CNN is highly in agreement with those of other gravity models, which clearly represents the geomorphic features such as seamounts, ridges, trenches, and submarine plains. The gravity anomalies within the entire study area range from − 170 to 220 mGal, with obvious fluctuations, which are closely related to the seafloor topography. Moreover, GA_CNN and GA_IVM contain more high-frequency details compared with DTU17 and SIO V32.1. The reason may be that the V32.1 model adopts low-pass filtering related to water depth in the calculation process, cutting off the short wavelength gravity signal due to upward continuation (Sandwell et al. 2021). The DTU17 model uses geoid as data source for recovering the gravity anomalies, which contains less high-frequency information compared with DOVs.

Fig. 4
figure 4

Gravity models in study area. a GA_CNN model. b GA_IVM model. c SIO V32.1 model. d DTU17 model

4.2 Comparison with shipborne gravity data

To evaluate the accuracy of the GA_CNN model and the performance of the CNN method, we compare the GA_CNN model with shipborne gravity data at check points excluded from neural network training. The linear regression and differences between the GA_CNN model and the shipborne gravity are calculated, and the scatter density plot and difference distributions are represented in Fig. 5. Two main evaluation indicators are chosen to measure the linear regression: the goodness of fit R2 and the root mean square error (RMSE). R2 values approaching 1 indicate that the linear regression model is valid, while an RMSE close to 0 is optimal (Sun et al. 2022). In Fig. 5a, the linear regression between GA_CNN and shipborne gravity anomalies at check points yields an R2 exceeding 0.99 and an RMSE of 3.22 mGal. All these indicate a strong correlation between the GA_CNN model and shipborne gravity. In addition, the histograms of the two gravity datasets are highly consistent, with most of the values distributed in the − 30 mGal to 30 mGal range. The density plot shows that the data density is greatest in the range of 0 mGal to 5 mGal.

Fig. 5
figure 5

The scatter density and distribution of difference between GA_CNN model and shipborne gravity at check points. a The scatter density between GA_CNN model and shipborne gravity, the black line is the linear regression fitting line. b The histogram of the differences between GA_CNN model and shipborne gravity. c The spatial distribution of the differences between GA_CNN model and shipborne gravity, the red dots represent points that have absolute deviations greater than 5 mGal, and the green dots show the points that have absolute deviations less than 5 mGal

Figure 5b, c show the histogram and spatial distribution of differences between the GA_CNN model and shipborne gravity. The histogram result illustrates that the discrepancy values are mostly between − 5 and 5 mGal, which account for 92.84% of the total, with only a few points having larger discrepancy values. The points with larger discrepancy values are mainly located in the central and eastern regions, where water depths are shallow and topography changes rapidly. This may be due to the fact that these areas are influenced by land and the accuracy of SSHs from multi-altimeter is relatively low, limiting the accuracy of gravity recovery. In addition, at the turning points of the ship surveying line, the discrepancy is generally large due to the effect of sharp directional changes on Eötvös correction, thus influencing the accuracy of shipborne gravity measurement. As one of the important means of marine gravity measurement, shipborne gravity measurement requires more reasonable track planning and more accurate data processing.

The deviations of GA_CNN, GA_IVM and other models from the shipborne gravity at check points are computed. The STDs and correlation coefficients are shown in Table 2 and the comparison of histogram distributions is displayed in Fig. 6. For the GA_CNN model, the STD of deviation is 3.21 mGal, with an improvement of 36.56% compared to GA_IVM, 34.89% compared to SIO V32.1, and 38.27% compared to DTU17. The correlation coefficient between the GA_CNN model and shipborne gravity reaches 99.63%, surpassing that of the other models. From Fig. 6, it is evident that 92.84% of the absolute deviations between the GA_CNN model and shipborne gravity are less than 5 mGal, which is significantly higher than GA_IVM model (74.62%), SIO V32.1 model (75.90%), and DTU17 model (73.34%). These results show that the GA_CNN model achieved higher accuracy and CNN method for marine gravity recovery is feasible.

Table 2 Comparisons between altimeter-derived gravity models and shipborne gravity data
Fig. 6
figure 6

The histogram distribution of the absolute differences between shipborne gravity and gravity models at check points

Furthermore, to analyze the actual spatial resolution of derived models, an uninterrupted marine gravity profile with higher resolution, measured by independent JMGrav marine gravimeter (Wu et al. 2023), is applied for cross spectral analysis between marine and derived altimetric gravity. The location of the marine gravity profile is presented in the top left of Fig. 7a. JMGrav marine gravimeter adopts a new gravity sensor based on electromagnetic damping, ensuring the accuracy of the gravity measurements better than 2 mGal, with approximate resolution of 500 m (Wu et al. 2023). Figure 7a displays the result of the marine and altimetric gravity profile comparisons. All four altimetric gravity models yield approximately the same profiles as the JMGrav gravity, verifying the veracity of the computed results. Table 3 provides the statistical results of difference between each altimetry-derived gravity model and JMGrav gravity. Compared to the JMGrav gravity, the GA_CNN model has an STD error of 3.17 mGal, while the other models GA_IVM, SIO V32.1 and DTU17 have STDs of 3.67, 3.54 and 3.89 mGal, respectively.

Fig. 7
figure 7

The JMGrav gravity profile in study area (a), location of the profile is inserted in the figure to the top left. And the cross-spectral analysis of shipborne gravity and GA_CNN, GA_IVM, SIO V32.1 and DTU17, respectively (be)

Table 3 Comparisons between altimeter-derived gravity models and JMGrav shipborne gravity data

Cross-spectral analysis can measure the correlation of two signals at different wavelengths. Shipborne gravity is assumed to have high resolution due to its high sampling rate of a few meters. By cross spectrum between the interpolated altimetry and shipborne gravity profile, the actual resolution of the altimetry-derived gravity field can be analyzed (Yale et al. 1995). The cross spectrum between JMGrav gravity and interpolated altimetry gravity profiles are calculated and shown in Fig. 7b–e. The findings suggest that at short wavelengths the shipborne gravity anomalies are more powerful than altimetry-derived gravity anomalies. It is noteworthy that the GA_CNN and GA_IVM models gradually attain power comparable to shipborne gravity at wavelengths longer than approximately 7 km. Whereas the SIO V32.1 model gradually reaches the same power as shipborne gravity at wavelengths beyond about 10 km, and the DTU17 model has the same power as shipborne gravity at wavelengths beyond about 20 km. The results further show that the altimetry-derived GA_CNN and GA_IVM models have relatively higher resolution compared to SIO V32.1 and DTU17 models.

4.3 Comparison with other gravity models

To further assess the GA_CNN model, we directly compare it with the GA_IVM, SIO V32.1, and DTU17 models. We calculate the discrepancy values between the GA_CNN model and the other models, and the statistical results are shown in Table 4. The STDs of differences between the GA_CNN model and other models are between 3 and 4 mGal, which is slightly larger than that of GA_IVM model. The reason may be that GA_CNN model is quite different from the traditional inversion models over a large area in the western of the study area, as shown in Fig. 4. In these regions, the training data are sparsely distributed, whereas the CNN method relies on the amount and distribution density of the training data. This is probably making the CNN method perform worse than traditional methods. The exact conclusions need to be further verified by more independent high-precision shipborne gravity.

Table 4 Comparisons between altimeter-derived gravity and other marine gravity models

The power spectral density (PSD) can be used as a measure for evaluating the energy of signals at varying wavelength. Many researchers have used PSD for marine geophysical model assessment because a higher PSD indicates more details at the same wavelength (Annan and Wan 2022; Li et al. 2023; Sun et al. 2022). The PSD is computed as 10log10 (P), where P is the relevant power. It is expressed in dB. The PSDs of the four gridded marine gravity models are shown in Fig. 8.

Fig. 8
figure 8

PSD of gravity models. The black line represents the PSD of the CNN-derived model, the red line represents the IVM-derived model, the blue line represents the V32.1 model, and the green line represents the DTU17 model. The red broken rectangle indicates the wavelengths less than 25 km

At medium to long wavelengths (≥ 25 km) as shown in Fig. 8, the energies of the four gravity models are comparable and relatively high. At short wavelengths less than 25 km (in red broken rectangle in Fig. 8), the energies are low. Nevertheless, the GA_CNN model generates a relatively high energy, which indicates that the GA_CNN model carries more short-wavelength gravity details than the other models. The reason may be that the GA_CNN model is derived from CNN method, which uses shipborne gravity to train the neural network model. The spatial resolution and accuracy of shipborne gravity are both relatively high, so GA_CNN contains more high-frequency signals of gravity. In addition, SIO V32.1 model adopts low-pass filtering related to water depth in the calculation process, which filters out some high-frequency information. DTU17 uses the geoid as the data source to recover gravity anomaly, and the geoid contains less high-frequency information than DOVs. It further attests that CNN method can introduce shipborne gravity as prior information reasonably to improve the accuracy and spectral characteristics of constructed gravity model.

5 Conclusions

In this study, a convolutional neural network (CNN) and remove-compute-restore (RCR) technology are applied to derive the gravity model in the SCS by combining multi-mission DOVs from Altika/DP, HY-2A/GM, CryoSat-2, Jason-2/GM and Jason-1/GM. The network’s inputs are 32 × 32 × 12 × n images consisting of geo-information about the calculated point and residual DOVs on surrounding points. The output of CNN is the residual gravity anomaly at the calculated point. The final gravity anomaly is then recovered by restoring the long-wavelength reference gravity field. In addition to CNN method, gravity anomaly also be calculated by IVM formula. To assess the accuracy of the recovered gravity model and the performance of CNN method, we compare GA_CNN, GA_IVM models to independent shipborne gravity, JMGrav gravity profile and gravity anomaly models SIO V32.1 and DTU17.

The results show that the CNN method could calculate marine gravity model effectively via simplified fusion of multi-source DOVs, and the derived model reached a high accuracy level. There is a strong correlation between the GA_CNN model and shipborne gravity, with R2 of the linear regression exceeding 0.99. The STD of the differences between the GA_CNN model and shipborne gravity is 3.21 mGal, with significantly improvement than GA_IVM model of 5.06 mGal, the SIO V32.1 model of 4.93 mGal, and the DTU17 model of 5.20 mGal. More than 92% of the absolute differences between the GA_CNN model and shipborne gravity are less than 5 mGal. The STD between the GA_CNN model and JMGrav gravity profile is 3.17 mGal, which is superior to all other models. The correlation coefficients between the GA_CNN model and the other gravity models exceed 99%, indicating that the GA_CNN model is largely consistent with other gravity models. Furthermore, the PSDs of four gravity models are also computed and compared. The results show that the GA_CNN model has stronger energy at short wavelength (less than 25 km) than other models, which suggests that CNN method can introduce shipborne gravity as prior information reasonably to improve the accuracy and spectral characteristics of constructed gravity model.

Availability of data and materials

The L2P products for missions Jason-2, Jason-1, SARAL/AltiKa, Crysoat-2, HaiYang-2A and the GDR products of Jason-3 are available at AVISO+ (https://www.aviso.altimetry.fr/). The shipborne gravity data are provided by the National Centers for Environmental Information (NCEI, https://www.ncei.noaa.gov/) and SeaDataNet Pan-European infrastructure for ocean and marine data management (https://www.seadatanet.org). The altimeter-derived gravity anomaly models V32.1 and DTU17 used in this study are obtained from University of California at San Diego (ftp://topex.ucsd.edu/) and DTU Space, f: https://www.space.dtu.dk/.

Abbreviations

CNN:

Convolutional neural network

DOVs:

Vertical deflections

IVM:

Inverse Vening Meinesz

STD:

Standard deviation

SSHs:

Sea surface heights

LSC:

Least squares collocation

GMs:

Geodetic missions

T/P:

TOPEX/POSEIDON

RCR:

Remove–compute–restore

AltiKa/DP:

SARAL/AltiKa/Drifting Phase

SCS:

South China Sea

L2P:

Level-2 Plus

GDRs:

Geophysical data records

MDT:

Mean dynamic topography

EGM2008:

Earth Gravitational Model 2008

NCEI:

National Centers for Environmental Information

SIO:

Scripps Institution of Oceanography

DTU:

DTU space

1D FFT:

1-D fast Fourier transform

ReLU:

Rectified linear unit

RMSE:

Root mean square error

PSD:

Radially power spectral density

References

Download references

Acknowledgements

We are grateful to the anonymous reviewers for constructive reviews that improved the manuscript greatly. This research is funded by the National Natural Science Foundation of China (Grant Nos. 41931076, 42192535, 42174102) and the Natural Science Foundation of Hubei Province (2022CFB816).

Funding

This research is funded by the National Natural Science Foundation of China (Grant Nos. 41931076, 42192535, 42174102) and the Natural Science Foundation of Hubei Province (2022CFB816).

Author information

Authors and Affiliations

Authors

Contributions

LB, QL and HS conceived this study and designed the experiments. QL and ZZ carried out the experiments and wrote the initial draft of the manuscript. LB, YW, LW, GM and HS analyzed the results and revised the manuscript. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Lifeng Bao.

Ethics declarations

Competing interests

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Li, Q., Zhai, Z., Bao, L. et al. A convolutional neural network to optimize multi-mission satellite altimeter fusion for improving the marine gravity field. Earth Planets Space 76, 129 (2024). https://doi.org/10.1186/s40623-024-02070-z

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1186/s40623-024-02070-z

Keywords