 Full paper
 Open Access
Temporal resolution of internal magnetic field modes from satellite data
 João Domingos^{1, 2, 3}Email authorView ORCID ID profile,
 Maria Alexandra Pais^{1, 2},
 Dominique Jault^{3} and
 Mioara Mandea^{4}
 Received: 18 September 2018
 Accepted: 31 December 2018
 Published: 10 January 2019
Abstract
Keywords
 Geomagnetic field
 Virtual observatories
 Principal component analysis
 Swarm constellation
 CHAMP satellite
Introduction
Over more than a century, the temporal variations of the Earth’s magnetic field have been mainly monitored by continuous recordings in geomagnetic observatories. Although the number of observatories has steadily increased, their geographic coverage remains highly uneven. To overcome this difficulty, satellite data are now available, albeit for only the past two decades. Recent magnetic field models have been constructed from measurements obtained from low Earth orbiting satellites carrying magnetometers, mainly Ørsted, CHAMP, SACC and Swarm. Unfortunately, these satellite missions are short in comparison with the correlation times of the Earth’s core magnetic field (Hulot and Le Mouël 1994). Moreover, there may be gaps related to instrumental failure, or nonconsecutive missions. Lack of satellite vectorial data between September 2010 (deorbiting of CHAMP) and November 2013 (launch of Swarm mission) has contributed to the decrease in spatial resolution of main field models during that period (Beggan and Whaler 2018).
Most analyses of the variability of the main field have been based on time series of spherical harmonic coefficients. Lhuillier et al. (2011) estimated the correlation time of the main nondipole field at \(415/\ell\) years where \(\ell\) is the harmonic degree. In addition to its slow variation, the main magnetic field also has decadal and interannual fluctuations, and satellite data covering the last 20 years can be very useful to investigate this range of periods. Close to 1year timescale, the contribution of the external field is present and quite strong and a proper separation is not yet possible. Finally, at the highfrequency end of the main field spectrum, its amplitude is too small to be perceived at observatories and satellites.
The time resolution of series of spherical harmonic coefficients from geomagnetic field models has naturally been investigated as a function of \(\ell\). Models based on observatory records had lower time resolution for higher degree \(\ell\): as an example, Langel et al. (1986) represented the secular variation from 1903 to 1982 using cubic Bsplines with up to six nodes for degrees 1–4 but only one internal node for degrees 5 and 6. The situation has improved with the availability of satellite data, and yet, for degrees 5–8, Gillet (2018) remarks that only signals with periods above about 6 years can be retrieved from timevarying current satellite field models. The time resolution of satellite field models is better for lower degrees and worse for higher ones (compare the spectrum of the second and third time derivatives of the field in Figure 2 of Lesur et al. (2010)). This is a consequence of the regularisation techniques used to produce the models (Finlay et al. 2016). It is noteworthy that the lower variability of fine scale structures in models of the Earth’s magnetic field results from the nature of the inverse modelling and does not reflect the time variability of small scale features of the actual field. This observation has motivated the development of models of the timevarying Earth’s magnetic field that do not involve a projection of the geomagnetic potential on spherical harmonics (Olsen and Mandea 2007).
Principal component analysis (PCA), also known as principal orthogonal decomposition [see, e.g. Taira et al. (2017) for a recent overview], is a widely used statistical tool to extract a minimal number of energetic coherent structures from a time sequence of scalar maps. It has the advantage of describing the spacetime evolution of a physical quantity as the superposition of a few decorrelated components (or modes), a property referred to as dimensionality reduction. It is one among different databased modal techniques used as a prospective analysis of real or simulated data. Besides, in certain cases, physically meaningful structures displaying the symmetries of underlying source mechanisms may emerge from PCA modes. By directly using records of the geomagnetic field at fixed locations (observatories) to conduct a PCA analysis, it may be possible to extract features—with small length scales and/or short timescales—left aside when constructing spherical harmonic models.
As a matter of fact, PCA analysis has been employed to calculate spatiotemporal patterns of the external component of the field from geomagnetic observatory data (Balasis and Egbert 2006; Shore et al. 2016), giving only an ancillary role to spherical harmonic modelling. Furthermore, PCA may allow for the separation between the internal and the external field (which is key to detect rapid variations of the internal field). The analysis of the geomagnetic field vector requires a slight modification of the standard PCA method in order to ensure that the spatial modes for the three components are each subject to the same time variation. We shall refer to it as vectorial PCA. Shore et al. (2016) performed such an analysis using groundbased geomagnetic data from 1997 to 2009 keeping only the five quietest days in each month, after removal of internal main field and crustal contributions. The different modes that they identified have a clear physical significance. Their mode 1 describes the magnetospheric ring current and is strongly fluctuating. The mode 2 represents an annual oscillation of the global system of currents and mode 3 describes ionospheric signals with semiannual period.
The two studies of Balasis and Egbert (2006) and Shore et al. (2016) were concerned with the determination of external magnetic field variations. In the context of main field studies, however, global coverage is crucial. This explains why the first attempts at applying the PCA method to the main field have relied on spherical harmonic coefficients as the quantity to be analysed either directly (Chulliat and Maus 2014) or after projection to produce maps of the radial field at the coremantle boundary (Pais et al. 2015a). Pais et al. (2015a) derived explicitly the mathematical conditions when the two approaches are equivalent. From CHAMP models centred on epochs ranging from 2002 to 2009, Chulliat and Maus (2014) retrieved a standing wave of period about 6 years. Pais et al. (2015a) used the CM4 model for 1960–2002 (Sabaka et al. 2004), which is mainly based on observatory data, to obtain largescale modes and discuss the relationship between their time evolution and the occurrence of geomagnetic jerks. In these studies, the time variability of the modes increases with their rank, a result that is compatible with the \(\sim f^{4}\) energy spectrum of the main field (De Santis et al. 2003; Lesur et al. 2018).
Using PCA in the context of dynamical systems theory, to reconstruct the state space of a system from experimental time series of some univariate stochastic variable, Gibson et al. (1992) showed that discrete Legendre polynomials can very closely approximate the set of orthogonal basis functions that describe time variation. In a recent study, McKinnon et al. (2016) used explicitly the first four Legendre polynomials to project a data matrix on variability of daily maximum and minimum summer temperatures over the Northern Hemisphere, because they are so similar in shape with the time basis resorting from PCA.
Mandea and Olsen (2006) introduced the concept of “Virtual Observatories” (VO) at the satellite altitude in order to describe the core magnetic field and its secular variation. It consists in computing the secular variation directly from the satellite data, as done at ground observatories, but on a dense and regular grid, overcoming the uneven distribution of observatories. Each VO “measurement” averages satellite data inside a defined volume and during a specific time window typically 30 days long. For each VO, Mandea and Olsen (2006) used CHAMP measurements, which they reduced to a common altitude. The correction rests on the hypothesis that the residuals of the measurements (obtained by the subtraction of an a priori spherical harmonic field model) are given by a potential field varying linearly in space. Assuming that over 1 month the external field contributions can be neglected, they estimated the parameters of the residuals and used them to compute a mean magnetic field residual for each VO and each month. The resulting VO time series correlate well with the corresponding groundbased observatories time series (Olsen and Mandea 2007). In contrast with measurements at real ground observatories, VO series are free from local crustal effects. Beggan et al. (2009) followed up the approach developed by Mandea and Olsen (2006) and created VO datasets from both Ørsted and CHAMP satellite measurements. They found that external fields cannot be entirely removed and produce, as a result, unrealistic secular acceleration of the main field. Saturnino et al. (2018) illustrated an alternative methodology with 2 years of Swarm data. They neither selected the data nor removed a model for the external field. They applied instead an equivalent source dipole technique. They accounted for the recorded magnetic field by an ensemble of timevarying dipoles located at the coremantle boundary and reconstructed VO series at a uniform altitude from these timevarying dipoles.
We undertake here a PCA analysis of the VO datasets recently created by Barrois et al. (2018). They calculated separate datasets from CHAMP and Swarm measurements which are as free as possible from contributions of the external field. Both time series have a time resolution of 4 months with almost no gaps despite strict selection criteria of the magnetic measurements.
The next section (“Data” section) consists in a description of the VO datasets. It is followed in “Methods” section by a summary of the PCA method that we employed. Our main results are presented in “Results” section, where we compare the modes calculated from CHAMP and Swarm data, respectively, and a discussion follows in “Discussion” section, focusing on the issue of time resolution. The paper ends by pointing out, in “Conclusions and perspectives” section, the main conclusions and some interesting future developments.
Data
Virtual observatories
Here, we use two sets of recent VO series, similar to the one presented by Barrois et al. (2018) with some small changes including the number of VOs: one of them was calculated from measurements provided by the CHAMP vector magnetometer (from July 2000 to September 2010) and the other one was computed from three Swarm vector magnetometers (from November 2013 to December 2017). For the selfconsistency of this paper, let us summarise the applied procedure.
To calculate the magnetic field values at each VO location, firstly a data selection needs to be applied in order to reduce the noise and aliasing due to the highfrequency currents in the magnetosphere and the ionosphere, and charges circulating near and across the satellites orbit. The initial dataset consists of CHAMP MAGL3 and Swarm Level 1b MAGL, version 0501 at 15second sampling rate data. Large outliers deviating more than 500 nT from the CHAOS6x3 model (Finlay et al. 2016), were removed. The next step in data selection consists in considering only nighttime (Sun at most \(10^\circ\) above horizon) and geomagnetically quiet conditions (\(K_p < 3^0\), \(\mathrm {d}RC/\mathrm {d}t < 3\) nT/h, \(E_m \le 0.8\) mV/m, \(B_z > 0\) and \(B_y < 10\) nT, where \(K_p\) is the geomagnetic activity index, RC is the ring current disturbance index, \(E_m\) is the merging magnetic field at the magnetopause, \(B_y\) and \(B_z\) are components of the interplanetary magnetic field, see Barrois et al. (2018) for a precise definition of these parameters). For the CHAMP dataset, alongtrack sums and differences were used, while the Swarm dataset benefits from using both alongtrack and crosstrack sums and differences, which provide a signal less perturbed by external sources.
To minimise other possible unmodelled sources, several processing steps were applied and the contributions from different sources removed from data: (1) the magnetospheric field (up to degree \(\ell =2\)) and its Earthinduced counterpart as given by the CHAOS6x3 model; (2) the ionospheric and its induced fields as given by the CM4 model (Sabaka et al. 2004); (3) the timedependent internal field for spherical harmonic degrees \(\ell \le 20\) and the static internal field for \(\ell > 20\) given by the CHAOS6x3 model. Note that these corrections involve different coordinate systems. Both the core and ionospheric fields are estimated in the geographic coordinate system. The fields due to near Earth magnetospheric sources, including the ring current, are calculated in the solar magnetic coordinate system (with the z axis defined as the dipole axis, and the SunEarth line contained in the xz plane, see, e.g. Laundal and Richmond 2017). The fields due to remote magnetospheric sources are calculated in the geocentric solar magnetic coordinate system (x axis along the EarthSun line, dipole axis in the xz plane).
In order to divide the surface of a sphere into nearly equal area regions, the Recursive Zonal Equal Area Sphere Partitioning Toolbox (Leopardi 2006) was used, giving 300 VOs for the dataset provided by Chris Finlay and Magnus Hammer (personal communication). In the vicinity of each VO, the residual field can then be represented as the gradient of a scalar potential V (determined as a linear combination of cubic polynomials in local Cartesian coordinates) inside a cylinder, separately. For each VO position and at a given epoch all data inside a cylinder of radius 700 km and within 4 months were used to build a local data vector. The three components of the residual field and the locations of the observation points are calculated in a local Cartesian coordinate system with the origin at the VO position.
Data selection for PCA
The VOs data from CHAMP satellite and Swarm constellation of three satellites were used to perform a PCA analysis of the magnetic field. VO data are provided for a fixed set of spatial coordinates, which is a requirement of PCA decomposition. In order to work with the longest possible simultaneous series in a fixed number of VOs, the period under study is shorter than the satellite mission periods, from November 2001 to November 2009 for CHAMP and from March 2014 to November 2017 for Swarm. Both datasets consist of vector magnetic field values every 4 months, for a total of 25 epochs for CHAMP and 12 epochs for Swarm. The number of grid points for CHAMP is \(N_P=292\), since 8 of the original VOs presented recurrent gaps.
For Swarm, a different set was constructed, since field values representative of the core field were systematically missing at high latitude VOs, due to strong external perturbations. For this reason, Swarm dataset consists of only \(N_P=258\) VOs between \(60^{\circ }\) and \(60^{\circ }\) latitude (subauroral grid). Same as in Shore et al. (2016), PCA is applied on a data matrix with \(B_r\), \(B_\theta\) and \(B_\phi\) components (where \((r,\theta ,\phi )\) are spherical polar coordinates).
Although Swarm data are more precise, the longer time series from CHAMP allows to expect a better resolution of modes that have annual to decadal characteristic timescales. The equal area distribution of grid points makes it unnecessary to apply a latitudinal weighting factor, which would have been the case for the geographic regular grid spacing approach or for the existing network of ground observatories (Shore et al. 2016). The altitude \(R_{VO}\) of the VOs obtained from CHAMP and Swarm data is 300 km and 500 km, respectively. This slightly complicates the comparison between the modes obtained from the two datasets.
Methods
Implementation of principal component analysis
How close to \(\mathbf {X}\) is (2) depends on how many terms we keep in the expansion, i.e. on k, but also on the intrinsic nature of the signal. As an example, when applying PCA to the main field (MF), its secular variation (SV) and its secular acceleration (SA) during the time period 1960 to 2002, the first mode in each independent analysis explains over 90% of the MF variance, about 70 % of the SV variance and only 30 % of the SA variance (Pais et al. 2015a). The variance explained by modej, describing how variable is the (zero mean) parameter \(y^j\) among different realisations, is given by \(\sum _{i=1}^M (y_i^j)^2/(M1)\). This variance is the element (j, j) of diagonal matrix \(\mathbf {C}_Y=\mathbf {Y}^T \mathbf {Y}/(M1)\), \(\lambda ^j\), which is also the nonnegative real eigenvalue j of covariance matrix \(\mathbf {C}_X\) once the ordering by decreasing value has been applied. Note that \(\lambda ^j\) and \(\alpha ^j\) in (2) are related through \(\lambda ^j=(\alpha ^j)^2/ (M1)\).
Spherical harmonic models
The eccentric dipole approximation
The field can be approximated as a point dipole moving with respect to the Earth’s centre, with its magnetic moment rotating relative to the geographic frame. Then, the field variation is simulated considering only the eccentric dipole component as given by Schmidt (1934) and Bartels (1936) who imposed the invariance of the dipole moment: the eccentric dipole sodefined has the same moment and the same orientation as the centred dipole defined from the three Gauss coefficients of degree 1. The position \(\mathbf {r}_{ED}\) of the eccentric dipole (i.e. the offset between the dipole centre and the Earth’s centre) is then chosen to minimise the squared quadrupole moment as seen from \(\mathbf {r}_{ED}\) [see, e.g. Lowes (1994) and references therein]. Bartels (1936) gave the equations used to determine \(\mathbf {r}_{ED}\) [see his equations (14) and (15) reproduced in FraserSmith (1987)], which depend only on Gauss coefficients of degrees 1 and 2.
On spheres \(r=R\) outside the Earth’s surface and centred at the Earth’s centre, an eccentric dipole located at \(\mathbf {r}_{ED}\), with \(s=r_{ED}/R\ll 1\) produces a field dominated by its components of degree 1 and 2. James and Winch (1967) showed that a dipole of moment \(\mathcal {M} \hat{m}\) located at \(\mathbf {r}_{ED}\) is equivalent to an infinite sum of centred multipoles, all with axes along \(\mathbf {r}_{ED}\) except for the first term (aligned along \(\hat{m}\)). The strength of different multipoles scales with degree \(\ell\) as \(\mathcal {M} s^{\ell 1}/{(\ell 1)}!\), where \(s\sim 0.08\) for the present field and at the satellite altitude.
Application of PCA to VO series
Results
The PCA is applied to 8 years of CHAMP data, from November 2001 to November 2009, and 44 months of Swarm data, from March 2014 to November 2017, with a sampling interval of 4 months. We have chosen to remove the average field (including the crustal field) prior to calculating the PCA. Keeping it would have introduced the mean field, which is approximately a zonal dipole, as another spatial mode.
Identification of the principal components as Legendre polynomials
The (normalised) time series corresponding to the first four modes from CHAMP are represented in Fig. 1. We discuss them together with the time series for modes 1, 2 and 4 from Swarm (see Fig. 2). It appears that these principal components are well approximated by polynomials of degree increasing with the mode order.
Using eigenvalues of the covariance matrices from CHAMP and from Swarm VO series, we obtain \(\tau _W^c \sim\) 30–70 year from Eq. (4), much larger than each corresponding dataset window \(\tau _W\), of 8 and 3.7 year. A more precise gauge for the smallwindow regime is \(\varepsilon =(\tau _W/\tau _W^c)^2\), that should be kept such that \(\varepsilon \ll 1\) in order that Legendre polynomials may be used as temporal basis functions (Gibson et al. 1992). This condition is verified in our study since \(\varepsilon \sim 0.01\).
The singular spectrum and the level of noise variance
The singular spectrum (the set of \({\lambda ^j}\) values) is shown for each dataset, in Fig. 3. After a rapid decrease, not properly represented neither by an exponential nor a power law, a knee follows and then an asymptotic slightly decreasing trend. This is often referred to as a “plateau” and is attained after only a few modes at a height level that is related to the amplitude of the Gaussian noise in the data (see, e.g. Gibson et al. 1992). For eigenvalues above the plateau, North’s rule of thumb (3) is used to locate the order in the PCA expansion where modes are no longer independently resolved and become degenerate. This occurs after order 4 for CHAMP and order 3 for Swarm datasets.
Differences between the two spectra in Fig. 3 might be due to different behaviour of the field in the two periods 2001–2009 and 2014–2017, different quality of the two datasets or different sample size, M. From results shown in Figs. 1 and 2, the order1 PC is a straight line, i.e. \(\mathbf {y}^1= \alpha ^1 \mathbf {y'}^1 = a (tt_0)\) where \(t_0\) is the mean epoch for each dataset. Note that, in those two figures, time t is in year/3 (4 months) and the slope is in units of (year/3)\(^{1}\). The parameter a is quite similar for the two datasets. In fact, from \(\mathbf {y'}^1\) plotted in Figs. 1 and 2 and the values of \(\alpha ^1\), we obtain \(a_{\mathrm {Swarm}}=1063\) nT/year and \(a_{\mathrm {CHAMP}}=1132\) nT/year, a difference of only 6%. We shall assume that at epochs that are not too far apart, the field rate of change has not significantly varied and that estimates of this rate do not depend on M. Taking into account that all \(\mathbf {y}^j\) have zero mean, the variance represented by the first mode, which is linear in time, scales with \(\tau _W\), the width of the time window, as \(\tau _W^2\) (i.e. \(M^2\)). It turns out that this approximate relationship explains well the relative amplitude of this mode calculated from CHAMP or from Swarm data (see Fig. 3). Conversely, eigenvalues (mode variances) in the noise plateau (\(\lambda ^j\) for \(j \gtrsim 6\)) are expected to be independent on M if they represent a white noise stochastic process. Therefore, the noise plateau does not depend much on the width of the time window. We can directly compare the noise levels for CHAMP and for Swarm data (see the grey lines in Fig. 3) as the number of CHAMP and Swarm VOs is almost the same. Estimated in nT, we find that the height of the noise level for Swarm data is about 0.4 times the value for CHAMP data.
Figure 3 also shows results of a test where we compare the PCA spectra using the full CHAMP dataset (8 years) with that using only the first 3.7 years, the same length as the Swarm dataset (see the thinner solid line). This nicely supports the previous assumption that the plateau level is independent of M. It also reproduces the predicted variation in \(\lambda ^1\). Finally, this test illustrates that more and more modes are resolved as the time window is extended: the amplitude of the resolved polynomial modes increases, while the noise level remains constant.
Assuming that the time series sample analytical functions, Gibson et al. (1992) found that the variance of the modes decreases exponentially with their order in the smallwindow regime. Figure 3 for CHAMP data shows a different law for the decrease of the eigenvalues of the first modes. It should be noted however that the geomagnetic field measurements are arguably better described as realisations of a stochastic process (e.g. Hellio et al. 2014) rather than as values taken by an infinitely derivable function. It would thus be interesting to learn how the eigenvalues in the singular spectrum are expected to decrease when the signal samples the solution of a stochastic differential equation.
Internal and external modes from resolved empirical orthogonal functions
Figure 4 shows the spatial features of the first four modes of the PCA decomposition of CHAMP VOs. The vector PCA decomposition implies that each mode is characterised by three different plots, one for each \(B_r\), \(B_\theta\) and \(B_\phi\) components. In order to recover the spatial distribution of scaled values for a given mode, over time, values in charts of Figs. 4 and 5 have simply to be multiplied by corresponding time series in Figs. 1 and 2 [see Eq. (2)]. In Fig. 5, we show four distinct modes for Swarm, three of them with temporal variations very similar to CHAMP and mode 3 with a much shorter period of 1 year. The grid for Swarm VOs has less representation in the auroral latitudes, due to very intense external fields there, so we do not show the EOFs at these latitudes.
Normalised misfit values comparing PCA modes with internal and external SH models, for CHAMP and Swarm
Satellite  Model  Misfit  

Mode 1  Mode 2  Mode 3  Mode 4  
CHAMP  Internal  0.003  0.043  0.140  0.256 
External  0.576  0.576  0.568  0.538  
Swarm  Internal  0.003  0.056  0.387  0.301 
External  0.553  0.555  0.386  0.506 
Spatial resolution of the successive modes
We represent in Fig. 6, for all modes, their average degree \(\ell _B\), which is directly related to a length scale, as a function of their time resolution, estimated from the degree of the appropriate Legendre polynomial. The time resolution \(T_B\) cannot be directly identified with a characteristic timescale as it clearly depends on the length \(\tau _W\) of the time series; the time \(T_B\) cannot exceed \(\tau _W\). As an example, both modes 1 from CHAMP and Swarm correspond to the secular variation of the field, which has a typical timescale much longer than the duration of the satellite records. We nevertheless observe that, globally, \(\ell _B\) increases as \(T_B\) decreases: the smallest features of the field have the shortest timescales as expected from a physical standpoint (Nataf and Schaeffer 2015). This conclusion rests heavily on modes that we do not trust so much, as indicated by the size of the symbols (the larger the symbol, the least the agreement with an internal field model). The strongest evidence yet comes from the comparison between the first two Swarm modes.
Discussion
Main PCA mode well explained as the variation of the eccentric dipole
The first mode of the PCA decomposition from either CHAMP or Swarm data closely follows a straight line that corresponds exactly to the Legendre polynomial of degree 1 (Figs. 1a, 2a). Furthermore, Fig. 6 indicates that the average harmonic degree of the spatial structure is very small, not far from 2. As a result, it does not come as a surprise that the spatial structure associated with mode 1 (in Figs. 4, 5), can be explained to a large extent using a simple model of a point dipole that moves with respect to the Earth’s surface, at the same time as its magnetic moment is rotating.
Figure 7 shows the secular variation due to the eccentric dipole evolution calculated from the CHAOS6 model on the top (difference between the field obtained at consecutive epochs) for epoch 2014, and on the bottom we show the secular variation computed directly from the IGRF12 model, for 2015–2020. The spatial signature we obtain for the moving eccentric dipole associated secular variation is clearly similar to the spatial pattern of mode 1 and to the IGRF12 model.
PCA mode 1 gives the average secular variation during the time interval of CHAMP and Swarm datasets. As a result, it compares well with the IGRF models for these time periods (Thébault et al. 2015).
Higher order time derivatives
The direct reading of computed PCs as Legendre polynomials of increasing degree is valid only while in the smallwindowwidth regime, which applies to our datasets, for which \(\tau _W<< \tau _W^c\). If \(\tau _W\) increases to the point that it gets close to or even larger than \(\tau _W^c\), PCs must be interpreted in a different way. Often, when using PCA on data matrices as (1), the computed PCs are expected to reveal characteristic timescales of the physical mechanisms contributing to those particular datasets. As an example, Domingos et al. (2017) retrieved a quasiperiodicity of \(\sim\) 11 year when applying PCA to the flux of high energy particles over the South Atlantic Anomaly, due to the solar forcing of the neutral atmosphere density. In the same way, if the VOs global signal includes a component due to a distinct source and with a correlation time short with respect to \(\tau _W\), we expect that PCA succeeds in extracting it, if its energy rises above the noise level. This is in fact what occurs in the mode 3 from Swarm database.
The second modes resolved in both CHAMP and Swarm datasets may be regarded as Secular Acceleration (SA) modes. Their importance comes from the fact that SA is seen as a window on the Earth’s core dynamics, since it is mainly the result of the interaction between the flow acceleration and the field, and flow acceleration is read in terms of contributing force terms (e.g. Aubert 2018). The corresponding PCs are very close to the discrete Legendre polynomial of degree2 (see Figs. 1b, 2b), which is precisely the required filter to calculate the second time derivative of the geomagnetic field. The spatialtime information retrieved through these modes allows for the characterisation of the components of the SA field, at the mean epoch of each dataset. In agreement with this interpretation, the EOF2 chart in Fig. 4d–f at 300 km altitude (mean epoch 2005.8), is very similar to the chart of SA at the Earth’s surface for 2006 as given by Chulliat and Maus (2014) (see their Figure 7, also built from CHAMP data). It is dominated by spatial structures of SH degree 3 (as expected from our Fig. 6). The secular acceleration at the Earth’s surface evolves rapidly and the whole process wherein the Atlantic equatorial patches of SA radial field change from one sign to the other takes about 4 year [see again Figure 7 of Chulliat and Maus (2014)]. This suggests that \(\tau _W\) from Swarm may be too small to properly resolve this mode and could explain why EOF2 is quite different between the two datasets (compare Figs. 4d–f, 5d–f), with a correlation coefficient of only 0.47. Almost the same correlation (0.46) can be found between mode 2 from Swarm and mode 3 from CHAMP, despite the time series corresponding to different order time derivatives.
Modes 2 and 3 from CHAMP have similar average SH degree \(\ell _B\) (in contrast with the typical increase of \(\ell _B\) with the order of the mode), and they are quite similar, up to a longitude shift. The amplitude of mode 3 is higher than predicted from the power law scaling put forward by Gibson et al. (see “The singular spectrum and the level of noise variance” section); yet, it is smaller than the amplitude of mode 2 (see Fig. 3) and the two modes cannot be simply combined to form a propagating wave. Their similarity is nevertheless intriguing, and we can wonder whether it hints at a nonstochastic time pattern, which might be compared, for example, to the geomagnetic oscillation (of period about 6 years) reported by Silva et al. (2012) in the Earth’s surface SA, mostly from the CM4 model (from 1960 to 2002.5). Other workers have searched too for periodic signals in the SA but at the coremantle boundary (CMB) instead of the Earth’s surface. Chulliat and Maus (2014) reported a standing wave, of dominant degree \(\ell =6\) at the CMB and of period about 6 years, in part of the equatorial belt from a principal component analysis of CHAMP data projected at the core surface [see also Chulliat et al. (2015)]. It is noteworthy, however, that continuation of the SA from the Earth’s surface \(r=R_E\) to the CMB \(r=R_C\) is fraught with difficulties. It entails amplification by a factor of \((R_E/R_C)^{\ell +2}\) and gives prominence to higher harmonic degrees (small scale structures) that have to be filtered out, typically above \(\ell =6\). All these studies argue for SA variability and equatorial symmetry of the radial SA in the Atlantic region equatorial belt. This agrees quite well with the geometry of the two modes 2 and 3 from CHAMP at West Africa (Fig. 4d, g). We now need longer and/or more accurate VO series to decide whether these two modes are really interconnected in the description of the same wavelike structure of the field, or if otherwise they do not differ significantly from the expectation of the stochastic null hypothesis producing the \(f^{4}\) spectrum of the main field [see Dommenget (2007) for the discussion of such a null hypothesis and the perspectives for future work outlined in “Conclusions and perspectives” section].
A \(f^{4}\) spectrum for the main field corresponds to a \(f^0\) spectrum for the secular acceleration, i.e. to a vanishing SA correlation time \(\tau _{SA}\), but this time has also been estimated from the ratio between SV and SA energy for each degree \(\ell\) separately (Lesur et al. 2008). Computed values from geomagnetic field models give \(\tau _{SA}\sim\) 10–20 year, independently of the SH degree (up to \(\ell \sim 10\) above which \(\tau _{SA}\) increases steeply). A similar value is obtained from the output of geodynamo simulations up to \(\ell \sim 10\) above which \(\tau _{SA}\) rapidly decreases (Christensen et al. 2012; Aubert 2018). From this perspective also, the window length \(\tau _W\) of the series that we have analysed is too short.
If modes 2 and 3 from CHAMP were not both related in building a deterministic structure, then the resolved mode 3 could still be read as the thirdorder time derivative of the geomagnetic field around the mean epoch 2005. It has a relatively large characteristic length scale \(\ell _B \sim 3\), with two main radial flux patches over Brazil and West Africa, at southern low latitudes. It imparts a special high time variability to SA in the nearequatorial region and in particular to the SA radial flux patch over West Africa which remains equatorially symmetric over time.
The fourth mode from Swarm in Fig. 5 shares the same cubic behaviour as the third mode from CHAMP (Figs. 1c, 2d), and some of the same symmetries can be seen in the EOF charts. In particular, the antisymmetry over the Americas in \(B_r\) (Figs. 4g, 5j) and the diagonally symmetric patches over Central America and South Africa in \(B_\theta\) (Figs. 4h, 5k), both stand out. In the Swarm dataset, this mode is not so well resolved and shows some degree of degeneracy (see Fig. 3).
Among all modes resolved by our PCA analysis, it remains to discuss mode 4 from CHAMP. It shows similarities to mode 2 (from CHAMP) but displays higher variability close to geomagnetic polar regions, possibly due to the superposed effect of external currents.
Modes of order higher than four remain unresolved below the noise level (see Fig. 3). We can expect that if we had access to longer series, we would retrieve from the PCA decomposition extra terms for the series of Legendre polynomials, with better definition.
Annual oscillation mode from Swarm
Mode 3 from Swarm shows a spatial structure pulsating with an approximately annual period, clearly pointing to a magnetic source influenced by the sun. Although the VO series have been cleaned as much as possible from the magnetic fields with sources in the ionosphere and the magnetosphere, there is thus some residual external signal remaining in the Swarm VO series. The spatial structure of mode 3, namely its radial component (Fig. 5g), is clearly distinct from the geometry of other EOFs since it is mostly symmetric about the rotation axis (more precisely, the geomagnetic dipole axis). It shows a band around the geomagnetic equator, characteristic of a zonal quadrupolar geometry (resulting from a \(P_2^0 (\cos \theta )\) term in the magnetic potential, see Eq. 6, in the geomagnetic dipole reference frame). Mode 3 from Swarm cannot be due to a solenoidal field of internal origin (relative to the VOs altitude) since the misfit to an internal potential field reported in Table 1 is high. Neither is it a solenoidal field completely external to the VO locations. Thus, contributions from both internal (ionosphere, induced fields in the crust and mantle) and external (magnetosphere) sources are present. Outside the auroral regions, which are not represented in our study, we report deviations reaching \(\pm 2.6\) nT (obtained by multiplying the time series in Fig. 2c by charts in Fig. 5g, see Eq. (2)). This value is much reduced compared to amplitudes between 10 and 20 nT for Malin and Isikara (1976) and Shore et al. (2016). It thus appears that much of the annual oscillation has been removed from the VO series.
The annual oscillation observed at the Earth’s surface (Malin and Isikara 1976), before any treatment to minimise it, also presents a dominant quadrupolar structure. However, the geographic coordinate system used in Malin and Isikara (1976) and in the present study is arguably not the most appropriate system to represent magnetic fields with sources in the magnetosphere. As a matter of fact, Shore et al. (2016) calculated EOFs in the solar magnetic system to make the external field as static as possible in their chosen reference frame. They nevertheless also detected an annual oscillation and interpreted it as the result of a NorthSouth oscillation of the system of external currents arising from the seasonal motion of the Sun in the solar magnetic coordinate system. Shore et al. (2016) showed the annual oscillation (see their Figure 6b) as a function of the magnetic local time defined in this coordinate system. They did not state whether the signal would be static in the geocentric solar magnetic system used by Barrois et al. (2018) to calculate, in addition to the solar magnetic system, some of the external field subtracted from the VO series. Transformation between the solar magnetic system and the geographic coordinate system involves longitudinal averages that explain well the dominantly axial structure of the mode 3 that we obtained from Swarm data.
Even though the CHAMP dataset covers a larger time window, we have found a wellresolved annual mode in Swarm series only. In Fig. 3, the plateau height associated with the noise level decreases from CHAMP to Swarm from about \(8.9\times 10^{2}\) to \(1.3\times 10^{2}\) nT\(^2\). The noise level in Swarm has decreased below the annual mode energy level allowing it to appear as a resolved mode. In fact, the annual mode can also be found in CHAMP but at rank 6–7, with amplitude well below the noise level. These two modes (6 and 7) have a correlation factor of 0.549 and 0.528, respectively, with the annual mode from Swarm (mode 3).
Finally, the 4month resolution is rather long compared to the 1year period of the mode discussed in this section. At an intermediate stage of this study, we examined VO series with monthly resolution. We found again a mode with annual periodicity and similar geometry as the mode investigated here.
Conclusions and perspectives
This study is an effort to retrieve new information on the Earth’s main magnetic field from recent geomagnetic data. We showed that PCA is an interesting alternative to the widely used SH by decomposing the signal into decorrelated modes. PCA seems to more easily retrieve a behaviour where the most rapid oscillations come from the smallest scale features, which is expected on a physical ground.
The PCA decomposition is particularly useful in cases where it allows to distinguish among different physical source contributions (Shore et al. 2016). The fit of internal and external potential field models to PCA modes guided us in a plausible separation between internal and mixed modes. We identified three clearly nondegenerate modes of internal origin from CHAMP data and two from Swarm data. The time series of these modes (PCs) can be represented by Legendre polynomials of increasing degree. The correspondence between the principal components of rank 4 with Legendre polynomials and the partial success of the modelling of the associate EOFs as internal potential fields give us also some confidence in the interpretation of the modes of rank 4 obtained from CHAMP and Swarm. Finally, from Swarm data only, we observe a mode with an annual period and quadrupolar geometry of the radial component (mode 3).

The spatial structure of an eccentric dipole moving towards Asia with amplitude increasing linearly in time is a good approximation to the SV of the main field at first order. After an interval of about 10 years between \(\sim\) 2005 and \(\sim\) 2015 (from mean epoch of CHAMP to mean epoch of Swarm), mode 1 from CHAMP still explains 60% of the time variability of the field during the Swarm period.

We found that the mode 3 obtained from CHAMP data is well resolved. It corresponds to the thirdorder time derivative at the middle epoch of the CHAMP series (November 2005). Once fitted into a spherical harmonic model, the mode can be downward continued to the CMB taking care to minimise the amplification of small scale noise. Interpreting the magnetic field changes with the radial induction equation at the core surface, we can undertake the estimation not only of the surface flow velocity and of the flow acceleration but also of the secondorder time derivative of the velocity. This is all the more interesting as recent studies differ in their estimation of the core flow acceleration over the period 2000–2016. Livermore et al. (2017) found that a high latitude nonaxisymmetric jet has dramatically increased over this period, by a factor 3, whereas Barrois et al. (2018) found a much more modest intensification.

Dommenget (2007) suggested evaluating dataderived EOF modes against modes synthesised from a stochastic null hypothesis. He argued that a spatial firstorder autoregressive process (AR1) can be used as the null information for the spatial structure of the climate variability. Similarly, we may use a temporal secondorder autoregressive process (AR2) as the null hypothesis for the magnetic field variability (based on the \(f^{4}\) magnetic energy spectrum). Dommenget (2007) showed EOF modes for realisations of the twodimensional AR1 stochastic process and introduced the idea of distinct EOFs, i.e. the modes that are the most distinguished from the modes of the null hypothesis. We may use the same approach, now in the time domain, to search for principal components associated with waves in the core, especially the torsional oscillations with about 6 years period (Gillet et al. 2010).

In our tests, we found that we need a VO series of length at least 1.67 year to retrieve the annual mode from Swarm [see also Domingos et al. (2017) for an example where the 11year solar cycle is extracted from a 16.5yearlong series]. Using the same ratio between the length of the time series and the period of the mode, we tentatively suggest that we need at least 10 years of highquality data to retrieve the mode associated with torsional oscillations in the core, which is expected to reach amplitude of \(\sim 2\) nT/year at Earth’s surface (Cox et al. 2016).

VOs were devised in order to monitor the internal field and as such the emergence among the EOFs of an annual mode originating from external sources was unexpected. There remains in the VO series a signal arising from external sources with a significant amplitude of about 2 nT. The presence of this signal points to the need of improvements in the calculation of VO series. This effort could benefit from using, as a guide, mode 3 from Swarm that we computed here.
Declarations
Authors' contributions
JD carried out all computational tasks, made figures and wrote first draft of manuscript. AP supervised the work of JD in particular the implementation of PCA and exploitation of results. DJ worked on the interpretation of results and perspectives. MM provided expert advice with geomagnetic field data. All authors discussed the results, participated in the writing and commented on the manuscript. All authors read and approved the final manuscript.
Acknowledgements
We want to thank Chris Finlay and Magnus Hammer for providing the Virtual Observatory datasets used in this work, and Nicolas Gillet for reading a draft of this paper and making useful comments.
Competing interests
The authors declare that they have no competing interests.
Availability of data and materials
The datasets supporting the conclusions of this article are available in the figshare repository: https://doi.org/10.6084/m9.figshare.7068305.v1.
Funding
This work has been carried out with financial support for JD by CNES (Centre National d’Études Spatiales, France) and “UC  Universidade de Coimbra (DPA5616382)”. CITEUC is funded by FCT  Foundation for Science and Technology (UID/Multi/00611/2013) and COMPETE 2020  Operational Programme Competitiveness and Internationalization (POCI010145FEDER006922).
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.
Authors’ Affiliations
References
 Aubert J (2018) Geomagnetic acceleration and rapid hydromagnetic wave dynamics in advanced numerical simulations of the geodynamo. Geophys J Int 214:531–547View ArticleGoogle Scholar
 Balasis G, Egbert GD (2006) Empirical orthogonal function analysis of magnetic observatory data: further evidence for nonaxisymmetric magnetospheric sources for satellite induction studies. Geophys Res Lett 33:L11311View ArticleGoogle Scholar
 Barrois O, Hammer MD, Finlay CC, Martin Y, Gillet N (2018) Assimilation of ground and satellite magnetic measurements: inference of core surface magnetic and velocity field changes. Geophys J Int 215:695–712View ArticleGoogle Scholar
 Bartels J (1936) The eccentric dipole approximating the earth’s magnetic field. Terr Mag Atmos Elect 41:225–250View ArticleGoogle Scholar
 Beggan CD, Whaler KA (2018) Ensemble Kalman filter analysis of magnetic field models during the CHAMPSwarm gap. Phys Earth Planet Inter 281:103–110View ArticleGoogle Scholar
 Beggan CD, Whaler KA, Macmillan S (2009) Biased residuals of core flow models from satellitederived ‘virtual observatories’. Geophys J Int 177:463–475View ArticleGoogle Scholar
 Christensen UR, Aubert J (2006) Scaling properties of convectiondriven dynamos in rotating spherical shells and application to planetary magnetic fields. Geophys J Int 166:97–114View ArticleGoogle Scholar
 Christensen UR, Wardinski I, Lesur V (2012) Timescales of geomagnetic secular acceleration in satellite field models and geodynamo models. Geophys J Int 190:243–254View ArticleGoogle Scholar
 Chulliat A, Maus S (2014) Geomagnetic secular acceleration, jerks, and a localized standing wave at the core surface from 2000 to 2010. J Geophys Res Solid Earth 119:1531–1543View ArticleGoogle Scholar
 Chulliat A, Alken P, Maus S (2015) Fast equatorial waves propagating at the top of the Earth’s core. Geophys Res Lett 42:3321–3329View ArticleGoogle Scholar
 Cox G, Livermore PW, Mound J (2016) The observational signature of modelled torsional waves and comparison to geomagnetic jerks. Phys Earth Planet Inter 255:50–65View ArticleGoogle Scholar
 De Santis A, Barraclough DR, Tozzi R (2003) Spatial and temporal spectra of the geomagnetic field and their scaling properties. Phys Earth Planet Inter 135:125–134View ArticleGoogle Scholar
 Domingos J, Jault D, Pais MA, Mandea M (2017) The South Atlantic Anomaly throughout the solar cycle. Earth Planet Sci Lett 473:154–163View ArticleGoogle Scholar
 Dommenget D (2007) Evaluating EOF modes against a stochastic null hypothesis. Clim Dyn 28:517–531View ArticleGoogle Scholar
 Finlay CC, Olsen N, Kotsiaros N, Gillet N, TøffnerClausen L (2016) Recent geomagnetic secular variation from Swarm and ground observatories as estimated in the CHAOS6 geomagnetic field model. Earth Planets Space 68:112. https://doi.org/10.1186/s4062301604861 View ArticleGoogle Scholar
 FraserSmith AC (1987) Centered and eccentric geomagnetic dipoles and their poles, 1600–1985. Rev Geophys 25:1–16View ArticleGoogle Scholar
 Gibson JF, Doyne Farmer J, Casdagli M, Eubank S (1992) An analytic approach to practical state space reconstruction. Physica D 57:1–30View ArticleGoogle Scholar
 Gillet N (2018) Spatial and temporal changes of the geomagnetic field: insights from forward and inverse core field models. In: Geomagnetism, aeronomy and space weather: a journey from the earth’s core to the sun, chapter 9, IAGAGoogle Scholar
 Gillet N, Jault D, Canet E, Fournier A (2010) Fast torsional waves and strong magnetic field within the Earth’s core. Nature 465:74–77View ArticleGoogle Scholar
 Hannachi A, Jolliffe IT, Stephenson DB (2007) Empirical orthogonal functions and related techniques in atmospheric science: a review. Int J Climatol 27:1119–1152View ArticleGoogle Scholar
 Hellio G, Gillet N, Bouligand C, Jault D (2014) Stochastic modelling of regional archaeomagnetic series. Geophys J Int 199:931–943View ArticleGoogle Scholar
 Hulot G, Le Mouël JL (1994) A statistical approach to the Earth’s main magnetic field. Phys Earth Planet Inter 82:167–183View ArticleGoogle Scholar
 James R, Winch DE (1967) The eccentric dipole. Pure Appl Geophys 66:77–86View ArticleGoogle Scholar
 Jolliffe IT (2002) Principal component analysis, 2nd edn. Springer, New YorkGoogle Scholar
 Langel RA, Kerridge DJ, Barraclough DR, Malin SRC (1986) Geomagnetic temporal change: 1903–1982, a spline representation. J Geomag Geoelectr 38:573–597View ArticleGoogle Scholar
 Laundal K, Richmond AD (2017) Magnetic coordinate systems. Space Sci Rev 206(1–4):27–59View ArticleGoogle Scholar
 Leopardi P (2006) A partition of the unit sphere into regions of equal area and small diameter. Electron Trans Numer Anal 25:309–327Google Scholar
 Lesur V, Wardinski I, Rother M, Mandea M (2008) GRIMM: the GFZ Reference Internal Magnetic Model based on vector satellite and observatory data. Geophys J Int 173:382–394View ArticleGoogle Scholar
 Lesur V, Wardinski I, Hamoudi M, Rother M (2010) The second generation of the GFZ Reference Internal Magnetic Model. Earth Planets Space 62:765–773. https://doi.org/10.5047/eps.2010.07.007 View ArticleGoogle Scholar
 Lesur V, Wardinski I, Baerenzung J, Holschneider M (2018) On the frequency spectra of the core magnetic field Gauss coefficients. Phys Earth Planet Inter 276:145–158View ArticleGoogle Scholar
 Lhuillier F, Fournier A, Hulot G, Aubert J (2011) The geomagnetic secularvariation timescale in observations and numerical dynamo models. Geophys Res Lett 38:L09306View ArticleGoogle Scholar
 Livermore PW, Hollerbach R, Finlay CC (2017) An accelerating highlatitude jet in Earth’s core. Nat Geosci 10:62–68View ArticleGoogle Scholar
 Lowes FJ (1994) The geomagnetic eccentric dipole: fact and fallacies. Geophys J Int 118:671–679View ArticleGoogle Scholar
 Malin SRC, Isikara AM (1976) Annual variation of the geomagnetic field. Geophys J R Astron Soc 47:445–457View ArticleGoogle Scholar
 Mandea M, Olsen N (2006) A new approach to directly determine the secular variation from magnetic satellite observations. Geophys Res Lett 33:L15306View ArticleGoogle Scholar
 McKinnon KA, Rhines A, Tingley MP, Huybers P (2016) The changing shape of Northern Hemisphere summer temperature distributions. J Geophys Res Atmos 121:8849–8868View ArticleGoogle Scholar
 Nataf HC, Schaeffer N (2015) Turbulence in the core. In: Schubert G (ed) Treatise on geophysics, chapter 8.06, vol 8. Elsevier, OxfordGoogle Scholar
 North GR, Bell TL, Cahalan RF, Moeng FJ (1982) Sampling errors in the estimation of empirical orthogonal functions. Mon Weather Rev 110:699–706View ArticleGoogle Scholar
 Olsen N, Mandea M (2007) Investigation of a secular variation impulse using satellite data: the 2003 geomagnetic jerk. Earth Planet Sci Lett 255:94–105View ArticleGoogle Scholar
 Pais MA, Alberto P, Pinheiro FJG (2015a) Timecorrelated patterns from spherical harmonic expansions: application to geomagnetism. J Geophys Res Solid Earth 120:8012–8030View ArticleGoogle Scholar
 Pais MA, Morozova AL, Schaeffer N (2015b) Variability modes in core flows inverted from geomagnetic field models. Geophys J Int 200:402–420View ArticleGoogle Scholar
 Sabaka TJ, Olsen N, Purucker E (2004) Extending comprehensive models of the Earth’s magnetic field with Ørsted and CHAMP data. Geophys J Int 159:521–547View ArticleGoogle Scholar
 Sabaka TJ, Hulot G, Olsen N (2010) Mathematical properties relevant to geomagnetic field modeling. In: Freeden W, Nashed M, Sonar T (eds) Handbook of geomathematics, chapter 43. Springer, New York, pp 503–538View ArticleGoogle Scholar
 Saturnino D, Langlais B, Amit H, Civet F, Mandea M, Beucler E (2018) Combining virtual observatory and equivalent source dipole approaches to describe the geomagnetic field with Swarm measurements. Phys Earth Planet Inter 276:118–133View ArticleGoogle Scholar
 Schmidt A (1934) Der magnetische Mittelpunk der Erde und seine Bedeutung. Gerlands Beitr Geophys 41:346–358Google Scholar
 Shore RM, Whaler KA, Macmillan S, Beggan C, Velímský J, Olsen N (2016) Decadal period external magnetic field variations determined via eigenanalysis. J Geophys Res Space Phys 121:5172–5184View ArticleGoogle Scholar
 Silva L, Jackson L, Mound J (2012) Assessing the importance and expression of the 6 year geomagnetic oscillation. J Geophys Res Solid Earth 117:B10101View ArticleGoogle Scholar
 Taira K, Brunton SL, Dawson STM, Rowley CW, Colonius T, McKeon BJ, Schmidt OT, Gordeyev S, Theofilis V, Ukeiley LS (2017) Modal analysis of fluid flows: an overview. AIAA J 55:4013–4041View ArticleGoogle Scholar
 Thébault E, Finlay CC, Beggan CD, Alken P, Aubert J, Barrois O, Bertrand F, Bondar T, Boness A, Brocco L, Canet E, Chambodut A, Chulliat A, Coïsson P, Civet F, Du A, Fournier A, Fratter I, Gillet N, Hamilton B, Hamoudi M, Hulot G, Jager T, Korte M, Kuang W, Lalanne X, Langlais B, Léger JM, Lesur V, Lowes FJ, Macmillan S, Mandea M, Manoj C, Maus S, Olsen N, Petrov V, Ridley V, Rother M, Sabaka TJ, Saturnino D, Schachtschneider R, Sirol O, Tangborn A, TøffnerClausen L, Vigneron P, Wardinski I, Zvereva T (2015) International Geomagnetic Reference Field: the 12th generation. Earth Planets Space 67:79. https://doi.org/10.1186/s4062301502289 View ArticleGoogle Scholar
 Tsyganenko NA, Sitnov MI (2005) Modeling the dynamics of the inner magnetosphere during strong geomagnetic storms. J Geophys Res Space Phys 110:A03208View ArticleGoogle Scholar