Skip to main content

The Mie representation for Mercury’s magnetic field

Abstract

The parameterization of the magnetospheric field contribution, generated by currents flowing in the magnetosphere is of major importance for the analysis of Mercury’s internal magnetic field. Using a combination of the Gauss and the Mie representation (toroidal–poloidal decomposition) for the parameterization of the magnetic field enables the analysis of magnetic field data measured in current carrying regions in the vicinity of Mercury. In view of the BepiColombo mission, the magnetic field resulting from the plasma interaction of Mercury with the solar wind is simulated with a hybrid simulation code and the internal Gauss coefficients for the dipole, quadrupole and octupole field are reconstructed from the data, evaluated along the prospective trajectories of the Mercury Planetary Orbiter (MPO) using Capon’s method. Especially, it turns out that a high-precision determination of Mercury’s octupole field is expectable from the future analysis of the magnetic field data measured by the magnetometer on board MPO. Furthermore, magnetic field data of the MESSENGER mission are analyzed and the reconstructed internal Gauss coefficients are in reasonable agreement with the results from more conventional methods such as the least-square fit.

Introduction

Characterization of Mercury’s internal magnetic field is one of the primary goals of the BepiColombo mission (Benkhoff et al. 2010). The magnetic field in the vicinity of Mercury is composed of internal and external parts. The internal field originates in the dynamo-generated field, crustal remanent field and induction field, which are mainly dominated by the dipole, quadrupole and octupole field. The external field originates from currents flowing in the magnetosphere. For a clear separation of the internal magnetic field from the external field, each part of the magnetic field has to be modeled properly. Especially the parameterization of the external parts is of major importance since these parts contribute a significant amount to the total field within Mercury’s magnetosphere (Anderson et al. 2011).

Above the planetary surface, the internal part of the field is a current-free magnetic field which can be parameterized by the Gauss representation (Gauß 1839; Glassmeier and Tsurutani 2014). When only data in source-free regions without currents are analyzed, the Gauss representation also yields a parametrization for the external parts of the field (Olsen et al. 2010). But in the vicinity of Mercury significant currents are expected (Anderson et al. 2011, eg); and therefore, the Gauss representation is insufficient due to the current-generated magnetic field outside the planet. In this paper, we construct a parametric Mercury magnetic field model by extending the Gauss representation to the Gauss–Mie representation.

Previous studies parameterized the external parts by making use of a global magnetospheric model, such as the paraboloid model for Mercury’s magnetosphere (Alexeev et al. 2008), that has successfully been applied to the analysis of Mercury’s internal and external magnetic field (Alexeev et al. 2010; Johnson et al. 2021), as well as the modified Tsyganenko model (Korth et al. 2004). As an alternative for these models the Mie representation is useful for decomposing the magnetic field in current carrying regions in the vicinity of Mercury (Backus 1986; Olsen 1997; Olsen et al. 2010).

The Mie representation, also known as toroidal-poloidal decomposition, is based on the solenoidality of the magnetic field and enables to decompose the field into its toroidal and poloidal parts (Backus 1986; Backus et al. 1996; Kazantsev and Kardakov 2019). This representation has successfully been applied to several problems in space plasma physics, especially for the analysis of the Earth’s magnetosphere. For example, Engels and Olsen (1999) used the Mie representation for calculating the magnetic effect caused by a three dimensional current density model. Olsen (1997) applied the Mie representation to reconstruct ionospheric F-currents in the Earth’s magnetosphere from MAGSAT data. Furthermore, Bayer et al. (2001) introduced the wavelet Mie representation of the magnetic field to calculate current densities from MAGSAT and CHAMP data. This approach has been expanded on the modeling of the Earth’s magnetic field in terms of vector kernel functions (Mayer and Maier 2006). Kosik (1984) constructed a model for the Earth’s magnetosphere based on the Mie representation.

In this work, the magnetic field in the vicinity of Mercury is parameterized by a combination of the Gauss and the Mie representation (hereafter, Gauss–Mie representation), based on the works of Backus (1986), Backus et al. (1996) and Olsen (1997), for analyzing Mercury’s internal magnetic field. The desired internal Gauss coefficients for the dipole, quadrupole and octupole field are estimated with Capon’s method (Capon 1969). The Capon method serves as a powerful inversion method for linear inverse problems and was used by, e.g., Motschmann et al. (1996), Glassmeier et al. (2001), Narita et al. (2006) and Narita (2012) to evaluate spatial spectra of space plasma waves. As shown by Toepfer et al. (2020a) the Capon method can also be used in a generalized way to compare actual measurements with theoretical models. Here, we expand on this method. First of all, the mathematical foundations of the Mie representation, as derived by Backus (1986) and Backus et al. (1996) are revisited. Afterwards, the resulting thin shell approximation is applied to simulated magnetic field data, that are evaluated along the future data points of the Mercury Planetary Orbiter (MPO). This enables the judgement of the expectable inversion results from the data of the magnetometer (Glassmeier et al. 2010; Heyner et al. 2020) on board MPO. Finally, the model is applied to MESSENGER data and the reconstructed Gauss coefficients are compared with the results of former works by Anderson et al. (2012), Thébault et al. (2018) and Wardinski et al. (2019).

The Mie representation

The solenoidality of the magnetic field \(\underline{B}\) guarantees the existence of a vector potential \(\underline{A}\), so that

$$\begin{aligned} \underline{B}=\partial _{\underline{x}}\times \underline{A} \end{aligned}$$
(1)

holds, where \(\partial _{\underline{x}}\) is the spatial derivative. Using spherical coordinates with radius \(r\in \left[ 0,\infty \right)\), longitude \(\lambda \in \left[ 0,2\pi \right]\) and co-latitude angle \(\theta \in \left[ 0,\pi \right]\) the vector potential can be seperated into its component \(\Psi _T\underline{r}\) parallel to \(\underline{r}=r\,\underline{e}_r\), where \(\underline{e}_r\) is the unit vector in radial direction, and its components \(\underline{F}\times \underline{r}\) perpendicular to \(\underline{r}\), yielding

$$\begin{aligned} \underline{A}=\Psi _T\underline{r}+\underline{F}\times \underline{r}+\partial _{\underline{x}}\varphi \end{aligned}$$
(2)

with the scalar functions \(\Psi _T\) and \(\varphi\) as well as a vector field \(\underline{F}\) (Jacobs 1987; Krause and Rädler 1980). Because of \(\partial _{\underline{x}}\times \partial _{\underline{x}}\varphi =0\), the function \(\varphi\) can be chosen properly so that \(\partial _{\underline{x}}\times \underline{F}=0\) holds, and therefore

$$\begin{aligned} \underline{F}=\partial _{\underline{x}}\Psi _P \end{aligned}$$
(3)

with a scalar function \(\Psi _P\) without changing the magnetic field (Jacobs 1987; Krause and Rädler 1980). The vector potential results in

$$\begin{aligned} \underline{A}&=\Psi _T\underline{r}+\partial _{\underline{x}}\Psi _P\times \underline{r} \end{aligned}$$
(4)
$$\begin{aligned}&=\Psi _T\underline{r}+\partial _{\underline{x}}\times \left( \Psi _P\underline{r} \right) . \end{aligned}$$
(5)

Substituting \(\underline{A}\) into Eq. (1) delivers

$$\begin{aligned} \underline{B}=\partial _{\underline{x}}\times \left( \Psi _T\underline{r} \right) +\partial _{\underline{x}}\times \left[ \partial _{\underline{x}}\times \left( \Psi _P\underline{r} \right) \right] , \end{aligned}$$
(6)

which is called the Mie representation of the magnetic field (Backus 1986; Backus et al. 1996).

The first term on the right hand side in Eq. (6)

$$\begin{aligned} \underline{B}_T=\partial _{\underline{x}}\times \left( \Psi _T\underline{r} \right) \end{aligned}$$
(7)

is the toroidal part of the field and the second term

$$\begin{aligned} \underline{B}_P=\partial _{\underline{x}}\times \left[ \partial _{\underline{x}}\times \left( \Psi _P\underline{r} \right) \right] \end{aligned}$$
(8)

is the poloidal part of \(\underline{B}\).

From the definition of \(\underline{B}_T\) and \(\underline{B}_P\) it is clear that \(\underline{B}_T\) is perpendicular to \(\underline{B}_P\) and also perpendicular to \(\underline{r}\). Therefore, the toroidal part of the field does not have a radial component. Furthermore, poloidal magnetic fields are generated by toroidal currents and vice versa:

$$\begin{aligned} \partial _{\underline{x}}\times \underline{B}_T=\partial _{\underline{x}}\times \left[ \partial _{\underline{x}}\times \left( \underline{r}\Psi _T \right) \right] \end{aligned}$$
(9)

is a poloidal vector and

$$\begin{aligned} \partial _{\underline{x}}\times \underline{B}_P=\partial _{\underline{x}}\times \left[ \partial _{\underline{x}}\times \left[ \partial _{\underline{x}}\times \left( \underline{r}\Psi _P \right) \right] \right] =\partial _{\underline{x}}\times \left[ \left( -\partial _{\underline{x}}^2\Psi _P\right) \underline{r} \right] \end{aligned}$$
(10)

is a toroidal vector.

In curl-free regions where especially the poloidal current density \(\underline{j}_P\) vanishes, Ampère’s law reads as follows:

$$\begin{aligned} \mu _0 \underline{j}_P=\partial _{\underline{x}}\times \left[ \partial _{\underline{x}}\times \left( \Psi _T \underline{r} \right) \right] =-\underline{r}\partial _{\underline{x}}^2\Psi _T+\partial _{\underline{x}}\partial _r\left( r\Psi _T \right) =0 \end{aligned}$$
(11)

or equivalently

$$\begin{aligned} \partial _{\underline{x}}\partial _r\left( r\Psi _T \right) =\underline{r}\partial _{\underline{x}}^2\Psi _T. \end{aligned}$$
(12)

Therefore, the gradient of \(\partial _r\left( r\Psi _T \right)\) has only a radial component, leaving us with

$$\begin{aligned} \partial _\lambda \Psi _T=0=\partial _\theta \Psi _T , \end{aligned}$$
(13)

so that the function \(\Psi _T\) solely depends on the radial distance from the center \(\left( \Psi _T=\Psi _T(r)\right)\) and thus, the toroidal magnetic field vanishes

$$\begin{aligned} \underline{B}_T=\partial _{\underline{x}}\times \left( \Psi _T \underline{r} \right) =\left( \partial _{\underline{x}}\Psi _T \right) \times \underline{r}=\partial _r\Psi _T\underline{e}_r\times \underline{r}=0. \end{aligned}$$
(14)

On the other hand, when the toroidal current density vanishes

$$\begin{aligned} \mu _0\underline{j}_T&=\partial _{\underline{x}}\times \partial _{\underline{x}}\times \left[ \partial _{\underline{x}}\times \left( \Psi _P\underline{r} \right) \right] =\partial _{\underline{x}}\times \left[ -r\left( \partial _{\underline{x}}^2\Psi _P\right) \underline{e}_r \right] \end{aligned}$$
(15)
$$\begin{aligned}&=\frac{1}{r\sin (\theta )}\partial _\varphi \left[ -r\partial _{\underline{x}}^2\Psi _P \right] \underline{e}_\theta -\frac{1}{r}\partial _\theta \left[ -r\partial _{\underline{x}}^2\Psi _P \right] \underline{e}_\varphi \end{aligned}$$
(16)
$$\begin{aligned}&=0 \end{aligned}$$
(17)

or equivalently

$$\begin{aligned} \partial _\varphi \partial _{\underline{x}}^2\Psi _P=0 \end{aligned}$$
(18)

and simulanously

$$\begin{aligned} \partial _\theta \partial _{\underline{x}}^2\Psi _P=0 \end{aligned}$$
(19)

the function \(\partial _{\underline{x}}^2\Psi _P\) solely depends on the radius. Therefore, the poloidal field

$$\begin{aligned} \underline{B}_P=\partial _{\underline{x}}\times \left[ \partial _{\underline{x}}\times \left( \Psi _P\underline{r} \right) \right] =-\underline{r}\partial _{\underline{x}}^2\Psi _P+\partial _{\underline{x}}\partial _r\left( r\Psi _P \right) \ne 0 \end{aligned}$$
(20)

in general remains finite in current-free regions.

Relation to the Gauss representation

When magnetic field data in curl-free regions (where \(\partial _{\underline{x}}\times \underline{B}=0\) holds) are analyzed, there exists a scalar potential \(\Phi\) so that the magnetic field can be written as

$$\begin{aligned} \underline{B}=-\partial _{\underline{x}}\Phi =-\partial _r\Phi \,\underline{e}_r-\frac{1}{r}\partial _\theta \Phi \,\underline{e}_\theta -\frac{1}{r\sin (\theta )}\partial _\lambda \Phi \,\underline{e}_\lambda , \end{aligned}$$
(21)

which is known as the Gauss representation of the magnetic field (Gauß 1839; Glassmeier and Tsurutani 2014). Simultanously, the Mie representation in due consideration that the toroidal magnetic field vanishes in curl-free regions is given by

$$\begin{aligned} \underline{B}&=\partial _{\underline{x}}\times \left[ \partial _{\underline{x}}\times \left( \Psi _P\underline{r} \right) \right] \end{aligned}$$
(22)
$$\begin{aligned}&=\frac{1}{r\sin (\theta )}\left[ -\partial _\theta \left( \sin (\theta )\partial _\theta \Psi _P \right) -\frac{1}{\sin (\theta )}\partial _\lambda ^2\Psi _P \right] \underline{e}_r \end{aligned}$$
(23)
$$\begin{aligned}&\quad +\frac{1}{r} \partial _\theta \partial _r\left( r\Psi _P \right) \underline{e}_\theta +\frac{1}{r\sin (\theta )} \partial _\lambda \partial _r\left( r\Psi _P \right) \underline{e}_\lambda \end{aligned}$$
(24)

Comparison of coefficients with Eq. (21) for \(\underline{e}_\theta\) shows that

$$\begin{aligned} \frac{1}{r}\partial _r\partial _\theta \left( r\Psi _P \right) =-\frac{1}{r}\partial _\theta \Phi \end{aligned}$$
(25)

and analogously for \(\underline{e}_\lambda\)

$$\begin{aligned} \frac{1}{r\sin (\theta )} \partial _r\partial _\lambda \left( r\Psi _P \right) =-\frac{1}{r\sin (\theta )}\partial _\lambda \Phi . \end{aligned}$$
(26)

Consequently, when \(\Psi _P\) is known, the scalar potential is given by

$$\begin{aligned} \Phi =-\partial _r\left( r\Psi _P \right) . \end{aligned}$$
(27)

Comparison of the \(\underline{e}_r\)-coefficients delivers

$$\begin{aligned} \frac{1}{r\sin (\theta )}\left[ \partial _\theta \left( \sin (\theta )\partial _\theta \Psi _P \right) +\frac{1}{\sin (\theta )}\partial _\lambda ^2\Psi _P \right] =\partial _r\Phi \end{aligned}$$
(28)

or equivalently

$$\begin{aligned} \partial _S^2\Psi _P=\frac{1}{r}\partial _r\Phi , \end{aligned}$$
(29)

where

$$\begin{aligned} \partial _S^2=\frac{1}{r^2\sin (\theta )}\partial _\theta \left( \sin (\theta )\partial _\theta \ \right) +\frac{1}{r^2\sin ^2(\theta )}\partial _\lambda ^2 \end{aligned}$$
(30)

is the angular part of the Laplacian. For a given \(\Phi\), the function \(\Psi _P\) can be found by solving Eqs. (27, 29) simultanously.

As a consequence, the scalar function \(\Psi _P\) and the scalar potential \(\Phi\) are equivalent in curl-free regions and the Mie representation transists into the Gauss representation. Thus, the Gauss representation can be understood as a special case of the Mie representation.

Parameterization of the magnetic field

Assuming that the conductivity of Mercury’s mantle is negligibly small (like lunar regolith (Zharkova et al. 2020)), the planetary contribution to the field outside Mercury is purely poloidal. The currents flowing in the magnetosphere generate poloidal and toroidal magnetic fields that superpose with the curl-free planetary magnetic field. To be able to separate the planetary magnetic field out of the measured field and to parameterize it via the Gauss coefficients, a combined parametrization composed of the Mie and the Gauss representation (Gauss–Mie representation), which is based on the works of Backus (1986) and Olsen (1997) is necessary.

Suppose that the magnetic field in the vicinity of Mercury is measured within a spherical shell \(S(a,c)\) with inner radius \(a>R_\text {M}\), where \(R_\text {M}\) indicates the radius of Mercury, outer radius c and mean radius \(b=\frac{1}{2}\left( a+c \right)\) as displayed in Fig. 1. The shell can be constructed independently of the orbit’s geometry by conceptually covering the orbit of the spacecraft. Furthermore, the shell may include current carrying regions. Although the Mie representation enables us to analyze those currents, we focus on the analysis of Mercury’s internal magnetic field.

Fig. 1
figure 1

Cross section of the spherical shell \(S(a,c)\) with inner radius \(a\), outer radius c and mean radius \(b=(a+c)/2\). The inner sphere with radius \(R_\text {M}\) symbolizes Mercury

Due to the underlying geometry, the space around Mercury can be decomposed into three disjoint radial zones:

  • points in the region \(r<a\) below the shell

  • points in the region \(r>c\) above the shell

  • points in the region \(a \le r \le c\) inside the shell layer

Making use of the superposition principle the total magnetic field \(\underline{B}\) measured inside the shell layer (\(a \le r \le c\)) is a composition of the field \(\underline{B}_{\underline{j}\in \left[ a,c \right] }\) generated by currents flowing inside the shell and the field \(\underline{B}_{\underline{j}\notin \left[ a,c \right] }\) generated by currents flowing outside the shell. Again considering the underlying geometry, the second part can be divided into an internal part \(\underline{B}^i\) resulting from currents flowing in the region \(r<a\) and an external part \(\underline{B}^e\) resulting from currents flowing in the region \(r>c\), so that

$$\begin{aligned} \underline{B}_{\underline{j}\notin \left[ a,c \right] }=\underline{B}^i+\underline{B}^e. \end{aligned}$$
(31)

As \(\underline{B}^i\) and \(\underline{B}^e\) have their sources beyond the shell they are purely poloidal and especially nonrotational within the shell. Thus, there exist scalar potentials \(\Phi ^i\) and \(\Phi ^e\) so that the field can be parameterized in the shell via the Gauss representation resulting in

$$\begin{aligned} \underline{B}^i= -\; \partial _{\underline{x}}\Phi ^i \end{aligned}$$
(32)

and

$$\begin{aligned} \underline{B}^e= - \; \partial _{\underline{x}}\Phi ^e, \end{aligned}$$
(33)

where the scalar potentials are given by (Gauß 1839; Glassmeier and Tsurutani 2014)

$$\begin{aligned} & \Phi ^i=R_\text {M}\sum _{l=1}^\infty \sum _{m=0}^l\left( \frac{R_\text {M}}{r} \right) ^{l+1} \left[ g_l^m\,\cos (m \lambda )+ h_l^m\,\sin (m \lambda )\right] P_l^m\left( \cos (\theta ) \right) \end{aligned}$$
(34)

and

$$\begin{aligned} &\Phi ^e=R_\text {M}\sum _{l=1}^\infty \sum _{m=0}^l\left( \frac{r}{R_\text {M}} \right) ^{l}\left[ q_l^m\,\cos (m \lambda )+ s_l^m\,\sin (m \lambda )\right] P_l^m\left( \cos (\theta ) \right) . \end{aligned}$$
(35)

The expansion coefficients \(g_l^m\) and \(h_l^m\) are the internal Gauss coefficients, the coefficients \(q_l^m\) and \(s_l^m\) are the external Gauss coefficients and \(P_l^m\) are the Schmidt normalized Legendre polynomials of degree l and order m. Since Mercury’s internal magnetic field is dominated by the internal dipole, quadrupole and octupole fields, the series expansions in Eqs. (34, 35) will be truncated at the degree \(l=3\) for the practical application.

It should be noted that the internal field \(\underline{B}^i\) is canonically described in a Mercury-Body-Fixed co-rotating coordinate system (MBF), whereas the external field \(\underline{B}^e\) is canonically described in a Mercury-Solar-Orbital system (MSO) with the x-axis orientated towards the sun, the z-axis orientated parallel to the rotation axis, i.e. antiparallel to the internal dipole moment, and the y-axis completes the right-handed system (Heyner et al. 2020). Let \(\underline{x}_{\text {MSO}}=\left( x_{\text {MSO}},y_{\text {MSO}},z_{\text {MSO}}\right) ^T\) define the MSO coordinate system and let \(\underline{x}_{\text {MBF}}=\left( x_{\text {MBF}},y_{\text {MBF}},z_{\text {MBF}}\right) ^T\) be the coordinates of the co-rotating MBF system. Then, the internal parts of the field are given by

$$\begin{aligned} \underline{B}^i(\underline{x}_{\text {MBF}})=-\partial _{\underline{x}_{\text {MBF}}}\Phi ^i(\underline{x}_{\text {MBF}})=\underline{\underline{H}}^i(\underline{x}_{\text {MBF}})\,\underline{g}^i, \end{aligned}$$
(36)

whereas the external parts are described in the MSO system, i.e.

$$\begin{aligned} \underline{B}^e(\underline{x}_{\text {MSO}})=-\partial _{\underline{x}_{\text {MSO}}}\Phi ^e(\underline{x}_{\text {MSO}})=\underline{\underline{H}}^e(\underline{x}_{\text {MSO}})\,\underline{g}^e, \end{aligned}$$
(37)

where the terms of the series expansion are arranged into the matrices \(\underline{\underline{H}}^i\) and \(\underline{\underline{H}}^e\) and the corresponding Gauss coefficients are summarized into the vectors \(\underline{g}^i\) and \(\underline{g}^e\).

The co-rotating MBF system can be transformed into the MSO system via

$$\begin{aligned} \underline{x}_{\text {MSO}}=\underline{\underline{A}}\, \underline{x}_{\text {MBF}}, \end{aligned}$$
(38)

where \(\underline{\underline{A}}\) describes a rotation matrix around the z-axis depending on the angular velocity of Mercury’s self-rotation measured within the MSO system.

For the practical application it is convenient to describe both parts of the field in one coordinate system, for example the MSO system. The transformed data are given by

$$\begin{aligned} \underline{B}\left( \underline{x}_{\text {MSO}}\right)&=\underline{\underline{A}}\,\underline{B}^i\left( \underline{x}_{\text {MBF}}\right) +\underline{B}^e\left( \underline{x}_{\text {MSO}}\right) \end{aligned}$$
(39)
$$\begin{aligned}&=\underline{\underline{A}}\,\underline{\underline{H}}^i(\underline{x}_{\text {MBF}})\,\underline{g}^i+\underline{\underline{H}}^e(\underline{x}_{\text {MSO}})\,\underline{g}^e \end{aligned}$$
(40)

in the MSO system.

Since for the first validation the model will be applied to simulated magentic field data, it is useful to match the coordinate system of the parametrization with the coordinate system of the simulation. Therefore, in the following all parts of the magnetic field are described in a Mercury-Body-Fixed anti-solar orientated coordinate system (MASO) with coordinates \(\underline{x}=(x,y,z)^T\), where the x-axis is orientated towards the nightside of Mercury (away from the sun), the z-axis is orientated antiparallel to the internal dipole moment and the y-axis completes the right-handed system, so that

$$\underline{x} = \left[ {\begin{array}{*{20}c} { - 1} & 0 & 0 \\ 0 & { - 1} & 0 \\ 0 & 0 & 1 \\ \end{array} } \right]\underline{x} _{{{\text{MSO}}}} .$$
(41)

As already mentioned in the introduction ("Introduction" section) there is no current-free shell-like region around Mercury (Olsen et al. 2010). The currents flowing in the shell generate toroidal \(\underline{B}_T^{sh}\) and poloidal \(\underline{B}_P^{sh}\) magnetic fields which superpose with \(\underline{B}^i\) and \(\underline{B}^e\). Thus, the total measured field within the shell is composed of four parts given by

$$\begin{aligned} \underline{B}&=\underline{B}_{\underline{j}\notin \left[ a,c \right] }+\underline{B}_{\underline{j}\in \left[ a,c \right] } =\underline{B}^i+ \underline{B}^e+\underline{B}^{sh}_T+\underline{B}^{sh}_P \end{aligned}$$
(42)
$$\begin{aligned}&=-\partial _{\underline{x}}\Phi ^i-\partial _{\underline{x}}\Phi ^e+\partial _{\underline{x}}\times \left[ \partial _{\underline{x}}\times \left( \underline{r}\Psi _P^{sh} \right) \right] +\partial _{\underline{x}}\times \left( \underline{r}\Psi _T^{sh} \right) , \end{aligned}$$
(43)

where each part of the field is described either by a scalar potential \(\Phi ^i\), \(\Phi ^e\) or a scalar function \(\Psi _P^{sh}\), \(\Psi _T^{sh}\).

The scalar potentials \(\Phi ^i\) and \(\Phi ^e\) are already parameterized by the Gauss coefficients. In the following, a proper parameterization for the scalar functions \(\Psi _T^{sh}\) and \(\Psi _P^{sh}\) is required. Because of the underlying spherical geometry it is straightforward to expand the functions into spherical harmonics

$$\begin{aligned} \Psi _T^{sh}=\sum _{l=1}^\infty \sum _{m=0}^l\left[ a_l^m(r)\,\cos (m \lambda )+ b_l^m(r)\,\sin (m \lambda )\right] P_l^m\left( \cos (\theta ) \right) \end{aligned}$$
(44)

and

$$\begin{aligned} \Psi _P^{sh}=R_\text {M}\sum _{l=1}^\infty \sum _{m=0}^l\left[ c_l^m(r)\,\cos (m \lambda )+ d_l^m(r)\,\sin (m \lambda )\right] P_l^m\left( \cos (\theta ) \right) , \end{aligned}$$
(45)

where \(a_l^m(r)\), \(b_l^m(r)\), \(c_l^m(r)\) and \(d_l^m(r)\) are the expansion coefficients which in general depend on the radius r and again \(P_l^m\) are the Schmidt normalized Legendre polynomials. Since the toroidal and poloidal fields can be locally generated by currents flowing in the shell the radial dependences of the fields and the expansion coefficients, respectively, are unknown.

Series expansion of the coefficients

Since the exact radial dependence of the expansion coefficients \(a_l^m(r)\) and \(b_l^m(r)\) is unknown, it is useful to expand these functions into a Taylor series in the vicinity of the mean radius b of the shell. Within this series expansion it is advisable not to incorporate the effect of all components of the poloidal current density to the toroidal magnetic field at once. Here, we first concentrate on the radial component of the current density and consider the horizontal components in higher orders of the Taylor series.

The toroidal magnetic field \(\underline{B}_T^{sh}\) is generated by poloidal currents \(\underline{j}_P\) (cf. Eq. (11)). Ampère’s law yields

$$\begin{aligned} \mu _0\underline{j}_P&=\partial _{\underline{x}}\times \underline{B}_T^{sh}=\partial _{\underline{x}}\times \left[ \partial _{\underline{x}}\times \left( \underline{r}\Psi _T^{sh} \right) \right] \end{aligned}$$
(46)
$$\begin{aligned}&=\frac{1}{r\sin (\theta )}\left[ - \partial _\theta \left( \sin (\theta )\partial _\theta \Psi _T^{sh}\right) -\frac{1}{\sin (\theta )}\partial _\lambda ^2\Psi _T^{sh} \right] \underline{e}_r \end{aligned}$$
(47)
$$\begin{aligned}&\quad +\frac{1}{r}\partial _\theta \partial _r\left( r\Psi _T^{sh} \right) \underline{e}_\theta +\frac{1}{r\sin (\theta )}\partial _\lambda \partial _r\left( r\Psi _T^{sh} \right) \underline{e}_\lambda . \end{aligned}$$
(48)

The components of the horizontal \(\underline{e}_\theta\)- and \(\underline{e}_\lambda\)-direction are proportional to \(\partial _r\left( r\Psi _T^{sh} \right)\). Therefore, the ansatz

$$\begin{aligned} a_l^m(r)=\frac{R_\text {M}}{r}\,\left[ a_l^m+a_{\,l}^{\prime \,m}\rho +{\mathcal {O}}(\rho ^2) \right] \end{aligned}$$
(49)

and

$$\begin{aligned} b_l^m(r)=\frac{R_\text {M}}{r}\,\left[ b_l^m+b_{\,l}^{\prime \,m}\rho +{\mathcal {O}}(\rho ^2) \right] , \end{aligned}$$
(50)

where \(\rho =\frac{ r-b }{R_\text {M}}\) and \(a_l^m\), \(a_{\,l}^{\prime \,m}\), \(b_l^m\), \(b_{\,l}^{\prime \,m}\) are constants for each pair of l and m, is utilized. In the first order of the Taylor series expansion in the vicinity of the mean radius b, where \(\Psi _T^{sh}\sim \frac{1}{r}\), the horizontal components of \(\underline{j}_P\) vanish and only the contributions of the radial currents driving the toroidal magnetic field are considered (Olsen 1997). Using higher orders of the Taylor series, also the contributions of the horizontal components of \(\underline{j}_P\) to the toroidal magnetic field in the vicinity of the mean radius b can be incorporated.

The scalar function of the toroidal magnetic field results in

$$\begin{aligned} \Psi _T^{sh}=\frac{R_\text {M}}{r}\sum _{l=1}^\infty \sum _{m=0}^l\left[ \alpha _l^m\ +\alpha _{\,l}^{\prime \,m}\,\rho +{\mathcal {O}}(\rho ^2)\right] P_l^m\left( \cos (\theta ) \right) , \end{aligned}$$
(51)

where \(\alpha _l^m=a_l^m\,\cos (m \lambda )+ b_l^m\,\sin (m \lambda )\) and \(\alpha _l^{\prime \,m}=a_{\,l}^{\prime \,m}\,\cos (m \lambda )+ b_{\,l}^{\prime \,m}\,\sin (m \lambda )\) (Olsen 1997). Thereby, each order of the Taylor series is linked with an additional set of expansion coefficients \(a_l^m\), \(b_l^m\), \(a_{\,l}^{\prime \,m}\), \(b_{\,l}^{\prime \,m}\) and so on which can be reconstructed from the data in analogy to the Gauss coefficients.

From a mathematical point of view the scalar function \(\Psi _P^{sh}\) of the poloidal magnetic field \(\underline{B}_P^{sh}\) can be parametrized analogously to the toroidal counterpart. But within the reconstruction procedure the poloidal fields that are generated by toroidal currents flowing inside the shell cannot be distinguished from the internally and externally driven poloidal fields, since these fields follow the same topological structure. But when the half thickness of the shell, defined by \(h=(c-a)/2\) is smaller than the length scale on which the toroidal currents change in radial direction, the shell is called a thin shell and the scalar function \(\Psi _P^{sh}\) of the poloidal field \(\underline{B}_P^{sh}\) vanishes within this thin shell approximation (Backus 1986; Backus et al. 1996) as illustrated in the following section.

The thin shell approximation

The thin shell approximation (Backus 1986; Backus et al. 1996) finally allows the separation of the poloidal field into its internal and external contributions.

Conferring to Eq. (6), the Mie representation for the magnetic field in the whole space \({\mathbb {R}}^3\) is given by

$$\begin{aligned} \underline{B}=\partial _{\underline{x}}\times \left( \Psi _T\underline{r} \right) +\partial _{\underline{x}}\times \left[ \partial _{\underline{x}}\times \left( \Psi _P\underline{r} \right) \right] . \end{aligned}$$
(52)

Following Ampère’s law the current density \(\underline{j}\) is also solenoidal and can as well be parameterized via the Mie representation resulting in

$$\begin{aligned} \mu _0\underline{j}=\partial _{\underline{x}}\times \left( \Gamma _T\underline{r} \right) +\partial _{\underline{x}}\times \left[ \partial _{\underline{x}}\times \left( \Gamma _P\underline{r} \right) \right] \end{aligned}$$
(53)

with related scalar functions \(\Gamma _T\) and \(\Gamma _P\).

Since the poloidal part of the current density corresponds with the curl of the toroidal magnetic field, the comparision of Eq. (53) with Eq. (9) shows that the scalar function \(\Psi _T\) and \(\Gamma _P\) are the same

$$\begin{aligned} \Psi _T=\Gamma _P. \end{aligned}$$
(54)

Analogously, the toroidal part of the current density corresponds with the curl of the poloidal magnetic field and the comparision of Eq. (53) with Eq. (10) shows that the functions \(\Psi _P\) and \(\Gamma _T\) are related via

$$\begin{aligned} \partial _{\underline{x}}^2\Psi _P=-\Gamma _T, \end{aligned}$$
(55)

so that the function \(\Psi _P\) is given by the Green’s function method

$$\begin{aligned} \Psi _P(\underline{r})=\frac{1}{4\pi }\int _{{\mathbb {R}}^3}\frac{\Gamma _T(\underline{r}^{\prime })}{\big | \underline{r}-\underline{r^{\prime }} \big |}\,\text {d}^3r^{\prime }. \end{aligned}$$
(56)

Due to the underlying geometry the toroidal current density \(\underline{j}_T\) flowing in the whole space can be written as the sum of the toroidal currents \(\underline{j}_T^i\) flowing in the region \(r<a\), the toroidal currents \(\underline{j}_T^e\) flowing in the region \(r>c\) and the toroidal currents \(\underline{j}_T^{sh}\) flowing inside the spherical shell in the region \(a\le r \le c\), so that

$$\begin{aligned} \underline{j}_T=\underline{j}_T^i+\underline{j}_T^e+\underline{j}_T^{sh}. \end{aligned}$$
(57)

Thereby, the Mie representation of each part is given by

$$\begin{aligned}&\underline{j}_T^i=\partial _{\underline{x}}\times \left( \Gamma _T^i\underline{r}\right) \end{aligned}$$
(58)
$$\begin{aligned}&\underline{j}_T^e=\partial _{\underline{x}}\times \left( \Gamma _T^e\underline{r}\right) \end{aligned}$$
(59)
$$\begin{aligned}&\underline{j}_T^{sh}=\partial _{\underline{x}}\times \left( \Gamma _T^{sh}\underline{r}\right) \end{aligned}$$
(60)

with \(\Gamma _T^i=\Gamma _T\,\chi _{\left[ r<a \right] }\), \(\Gamma _T^e=\Gamma _T\,\chi _{\left[ r>c \right] }\) and \(\Gamma _T^{sh}=\Gamma _T\,\chi _{\left[ a<r<c \right] }\), where

$$\begin{aligned} \chi _I(x)=\left\{ \begin{array}{ll} 1 &{} ,\, x\in I\\ 0 &{} ,\, x\notin I \end{array}\right. \end{aligned}$$
(61)

is the indicator function of the interval I. Using this segmentation the scalar function of the poloidal field can be rewritten as

$$\begin{aligned} \Psi _P(\underline{r})&= \frac{1}{4\pi }\int _{V(r{\prime }<a)}\frac{\Gamma _T^i(\underline{r}^{\prime })}{\big | \underline{r}-\underline{r^{\prime }} \big |}\,\text {d}^3r^{\prime }\nonumber \\&\quad +\frac{1}{4\pi }\int _{V(r{\prime }>a)}\frac{\Gamma _T^e(\underline{r}^{\prime })}{\big | \underline{r}-\underline{r^{\prime }} \big |}\,\text {d}^3r^{\prime }\nonumber \\&\quad +\frac{1}{4\pi }\int _{S(a,c)}\frac{\Gamma _T^{sh}(\underline{r}^{\prime })}{\big | \underline{r}-\underline{r^{\prime }} \big |}\,\text {d}^3r^{\prime }. \end{aligned}$$
(62)

Thus, the part of the scalar function that corresponds to the poloidal magnetic field which is generated inside the shell is given by

$$\begin{aligned} \Psi ^{sh}_P(\underline{r})=\frac{1}{4\pi }\int _{S(a,c)}\frac{\Gamma _T^{sh}(\underline{r}^{\prime })}{\big | \underline{r}-\underline{r^{\prime }} \big |}\,\text {d}^3r^{\prime }. \end{aligned}$$
(63)

Since \(2b=a+c\) and \(2h=c-a\) the bounds of integration can be rewritten as \(a=b-h\) and \(c=b+h\), so that

$$\begin{aligned} \Psi _P^{sh}(\underline{r})=\frac{1}{4\pi }\int _{S(b-h,b+h)}\frac{\Gamma _T^{sh}(\underline{r}^{\prime })}{\big | \underline{r}-\underline{r^{\prime }} \big |}\,\text {d}^3r^{\prime }. \end{aligned}$$
(64)

Analogously to the scalar function \(\Psi _T\), the function \(\Gamma _T\) can be expanded into a Taylor series in the vicinity of the mean radius b, resulting in

$$\begin{aligned} \Gamma _T^{sh}(\underline{r}^\prime )=\sum _{n=0}^\infty \frac{1}{n!}\partial ^n_{r^\prime } \Gamma _T^{sh}\big |_{r^\prime =b}(r^\prime -b)^n. \end{aligned}$$
(65)

Substituting the Taylor series into the function \(\Psi _P^{sh}\) delivers

$$\begin{aligned} \Psi _P^{sh}(\underline{r})=\frac{1}{4\pi }\sum _{n=0}^\infty \frac{1}{n!}\int _{S(b-h,b+h)}\partial ^n_{r^\prime } \Gamma _T^{sh}\big |_{r^\prime =b}\frac{(r^\prime -b)^n}{\big | \underline{r}-\underline{r^{\prime }} \big |}\,\text {d}^3r^{\prime }. \end{aligned}$$
(66)

For the further evaluation of the integral it is assumed that the derivatives of the toroidal currents with respect to r are bounded, i.e., there exists a constant \(L>0\), so that

$$\begin{aligned} L^n \big | \partial ^n_r \underline{j}_T \big |\le \big | \underline{j}_T \big | \end{aligned}$$
(67)

for \(n\in {\mathbb {N}}\). Thus, L represents the length scale on which the toroidal currents change in radial direction. From

$$\begin{aligned} \underline{j}_T&=\partial _{\underline{x}}\times \left( \Gamma _T\underline{r} \right) \\ &=\frac{1}{\sin (\theta )}\partial _\lambda \Gamma _T\,\underline{e}_\theta -\partial _\theta \Gamma _T \,\underline{e}_\lambda\\ & =\left( \frac{1}{\sin (\theta )}\underline{e}_\theta \partial _\lambda -\underline{e}_\lambda \partial _\theta \right) \Gamma _T \end{aligned}$$
(68)

and therefore

$$\begin{aligned} \partial _r^n\underline{j}_T=\left( \frac{1}{\sin (\theta )}\underline{e}_\theta \partial _\lambda -\underline{e}_\lambda \partial _\theta \right) \partial _r^n \Gamma _T \end{aligned}$$
(69)

it follows that

$$\begin{aligned} L^n \big | \partial ^n_r \Gamma _T^{sh} \big |\le \big | \Gamma _T^{sh} \big | \end{aligned}$$
(70)

or equivalently

$$\begin{aligned} \big | \partial ^n_r \Gamma _T^{sh} \big |\le \frac{\big | \Gamma _T^{sh} \big |}{L^n}. \end{aligned}$$
(71)

Since \(r^\prime \in \left[ b-h,b+h \right]\), each summand within the Taylor series can be estimated upwards via

$$\begin{aligned} \big |\partial ^n_{r^\prime } \Gamma _T^{sh}(b)\big |\cdot \big |r^\prime -b\big |^n\le \frac{\big | \Gamma _T^{sh}(b)\big |}{L^n}\cdot h^n \end{aligned}$$
(72)

delivering for Eq. (66)

$$\begin{aligned} &\big | \Psi _P^{sh}(\underline{r}) \big |\le \frac{1}{4\pi }\sum _{n=0}^\infty \frac{1}{n!}\left( \frac{h}{L} \right) ^n\,\max _{\theta ^\prime ,\lambda ^\prime }\big \{\big | \Gamma _T^{sh}(b)\big | \big \}\int _{S(b-h,b+h)}\frac{1}{\big | \underline{r}-\underline{r^{\prime }} \big |}\,\text {d}^3r^{\prime }. \end{aligned}$$
(73)

The integral in Eq. (73) may be evaluated for any \(\underline{r}\in {\mathbb {R}}^3\) but we restrict to \(\underline{r}\) inside the shell as only there the magnetic field is measured. Then, the remaining integral results in

$$\begin{aligned}\begin{aligned} \int _{S(b-h,b+h)}\frac{1}{\big | \underline{r}-\underline{r^{\prime }} \big |}\,\text {d}^3r^{\prime }=&\frac{4\pi }{3}\,\frac{1}{r}\left[ r^3-(b-h)^3 \right] +2\pi \left[ (b+h)^2-r^2 \right] , \end{aligned}\end{aligned}$$
(74)

utilizing that the coordinate system can be chosen properly so that \(\theta ^\prime\) defines the angle between \(\underline{r}\) and \(\underline{r}^\prime\). Therefore

$$\begin{aligned} |\Psi _{P}^{{sh}} (\underset{\raise0.3em\hbox{$\smash{\scriptscriptstyle-}$}}{r} )| \le \sum\limits_{{n = 0}}^{\infty } {\frac{1}{{n!}}} \left( {\frac{h}{L}} \right)^{n} {\mkern 1mu} \max _ {\theta ^{\prime } ,\lambda ^{\prime }} \left\{ {|\Gamma _{T}^{{sh}} (b)|} \right\}\left\{ {\frac{1}{3}\left[ {r^{2} - \frac{{(b - h)^{3} }}{r}} \right] + \frac{1}{2}\left[ {(b + h)^{2} - r^{2} } \right]} \right\} . \\ \end{aligned}$$
(75)

For all \(r\in \left[ b-h,b+h \right]\), the function

$$\begin{aligned} f(r)=\frac{1}{3}\left[ r^2-\frac{(b-h)^3}{r} \right] +\frac{1}{2}\left[ (b+h)^2-r^2\right] \end{aligned}$$
(76)

is non-negative and reaches its maximum value at \(r=b-h\) with the related function value \(f(b-h)=2bh\). Therefore, an upper bound for \(\Psi _P^{sh}\) can finally be estimated as

$$\begin{aligned} \big | \Psi _P^{sh}(\underline{r}) \big |&\le 2bh \sum _{n=0}^\infty \frac{1}{n!}\left( \frac{h}{L} \right) ^n\,\max _{\theta ^\prime ,\lambda ^\prime }\big \{\big | \Gamma _T^{sh}(b)\big | \big \} \\ &=2bL\sum _{n=0}^\infty \frac{1}{n!}\left( \frac{h}{L} \right) ^{n+1}\,\max _{\theta ^\prime ,\lambda ^\prime }\big \{\big | \Gamma _T^{sh}(b)\big | \big \} . \end{aligned}$$
(77)

When \(h\ll L\), the spherical shell is called a thin shell (Backus 1986; Backus et al. 1996) and the scalar function \(\Psi _P^{sh}\) of the poloidal magnetic field \(\underline{B}_P^{sh}\) vanishes. The scalar function \(\Psi _T^{sh}\), however, remains finite for all h as shown in Appendix A. Thus, if the shell may be regarded as thin, then the contribution of the toroidal currents in this shell to the poloidal magnetic field may be neglected. The poloidal magnetic field is mainly driven by currents beyond the shell. The contribution of the poloidal currents in this shell to the toroidal magnetic field may not be neglected.

From a first point of view the thin shell approximation is not an intuitive approximation. Considering the above presented nature of poloidal and toroidal magnetic fields it can be understood as follows:

The toroidal magnetic field only exists within current carrying regions (cf. Eq. 14) and thus, it is measurable only within these regions. Therefore, the spatial extent of the regions where the poloidal currents flow does not influence the strength of the toroidal magnetic field. It solely depends on the strength of the poloidal current density. In contrast to the toroidal magnetic field, the poloidal magnetic field is also measurable in current-free region. Thus, the poloidal field is a superposition of fields generated by currents flowing inside and outside the shell as well as currents flowing within the shell. This superposition is verified in Eq. (62). Therefore, the amount of the poloidal field generated by currents flowing within the shell has to be compared with the amount of the internal/external contributions. Furthermore, the poloidal field does not solely depend on the strength of the toroidal current density, since for the evaluation of the integrals also the volume where the current density flows is vital. Thus, a small toroidal current density that flows within a large volume outside/inside the shell can have a larger contribution to the field measured within the shell than a stronger current flowing within the thin shell.

Application of the thin shell approximation

The thin shell approximation is applied to parameterize the magnetic field in the vicinity of Mercury by the Gauss–Mie representation to reconstruct Mercury’s internal magnetic field. The internal and external scalar potentials \(\Phi ^i\) and \(\Phi ^e\) are expanded into spherical harmonics up to third degree and order, representing the internal and external dipole, quadrupole and octupole fields. The scalar function \(\Psi _T^{sh}\) of the toroidal magnetic field \(\underline{B}_T^{sh}\) is expanded into spherical harmonics up to second degree and order and additionally into a first order Taylor series for the radius. The scalar function \(\Psi _P^{sh}\) of the poloidal field \(\underline{B}_P^{sh}\) is neglected within the thin shell approximation. Therefore, the total field is parameterized by 46 expansion coefficients, i.e. Gauss internal dipole (\(g_1^0\), \(g_1^1\), \(h_1^1\)), Gauss internal quadrupole (\(g_2^0\), \(g_2^1\), \(h_2^1\), \(g_2^2\), \(h_2^2\)), Gauss internal octupole (\(g_3^0\), \(g_3^1\), \(h_3^1\), \(g_3^2\), \(h_3^2\), \(g_3^3\), \(h_3^3\)), Gauss external dipole (\(q_1^0\), \(q_1^1\), \(s_1^1\)), Gauss external quadrupole (\(q_2^0\), \(q_2^1\), \(s_2^1\), \(q_2^2\), \(s_2^2\)), Gauss external octupole (\(q_3^0\), \(q_3^1\), \(s_3^1\), \(q_3^2\), \(s_3^2\), \(q_3^3\), \(s_3^3\)), toroidal coefficients (\(a_1^0\), \(a_1^1\), \(b_1^1\), \(a_2^0\), \(a_2^1\), \(b_2^1\), \(a_2^2\), \(b_2^2\), \(a_{\,1}^{\prime \,0}\), \(a_{\,1}^{\prime \,1}\), \(b_{\,1}^{\prime \,1}\), \(a_{\,2}^{\prime \,0}\), \(a_{\,2}^{\prime \,1}\), \(b_{\,2}^{\prime \,1}\), \(a_{\,2}^{\prime \,2}\), \(b_{\,2}^{\prime \,2}\)). These 46 coefficients are estimated with Capon’s method (Capon 1969; Toepfer et al. 2020a, b). The Capon method and the underlying model are tested against simulated data and MESSENGER in situ data around Mercury.

Hybrid simulation of Mercury’s magnetosphere

For the first application of the thin shell approximation simulated magnetic field data are analyzed. The magnetic field resulting from the plasma interaction of Mercury with the solar wind is simulated with the hybrid code AIKEF (Müller et al. 2011), that has successfully been applied to several problems in Mercury’s plasma interaction, (Exner et al. 2018, 2020, e.g.). The internal Gauss coefficients \(g_1^0=-190\,\,\text {nT}\) (dipole field), \(g_2^0=-78\,\,\text {nT}\) (quadrupole field) and \(g_3^0=-20\,\,\text {nT}\) (octupole field) (Anderson et al. 2012; Thébault et al. 2018; Wardinski et al. 2019) are implemented in the simulation code. The interplanetary magnetic field with a magnitude of \(B_{\text {IMF}}=20\,\,\text {nT}\) is orientated along the vector \((x,y,z)^T=(0.0 , 0.43 , 0.9)^T\) in the MASO frame. The solar wind velocity of \(v_{\text {sw}}=400\,\,\text {km}/\text {s}\) points along the x-axis (away from the Sun) and the solar wind proton density number was chosen to \(n_{\text {sw}}=30\,\,\text {cm}^{-3}\). The resulting magnitude of the magnetic field and the corresponding current density in the x-z-plane are displayed in Figs. 2, 3.

Fig. 2
figure 2

Simulated magnitude of the magnetic field \(\underline{B}\) in the x-z-plane. The white lines represent the magnetic field lines and the grey circle of radius \(1\,R_\text {M}\) symbolizes Mercury

Fig. 3
figure 3

Simulated magnitude of the current density \(\underline{j}\) in the x-z-plane. The grey circle of radius \(1\,R_\text {M}\) symbolizes Mercury

The internal dipole field dominates the geometry of Mercury’s magnetosphere. Yet the quadrupole field in terms of the apparently shifted dipole field is visible. The influence of the octupole field is not clearly noticeable on the field lines in the figure. Furthermore, the distribution of the simulated current density shows that there exist no completely current-free region around Mercury.

Reconstruction of the Gauss coefficients from simulated data

For the reconstruction of the internal Gauss coefficients implemented in the simulation code, first of all, magnetic field data at a distance of \(0.2\,R_\text {M}\) from Mercury’s surface are evaluated. The data are retrieved along meridional circular orbits around Mercury. The orbital plane is rotated about the rotation axis (z-axis) from \(- \;50^{^\circ }\) (afternoon/post-midnight sector) over \(0^\circ\) (noon/midnight, x-z-plane) to \(50^\circ\) (morning/pre-midnight sector). For this synthetically generated ideal case in terms of the thin shell approximation, the spherical shell that covers the circular orbits has a vanishing thickness \(h=0\) so that the application of the thin shell approximation is surely valid. The reconstructed internal Gauss coefficients are listed in Table 1. The optimal diagonal loading parameter which determines how the data are weighted within the application of Capon’s method results in \(\sigma _{opt.}\approx 1000\,\text {nT}\) (Toepfer et al. 2020b).

Table 1 Implemented and reconstructed Gauss coefficients for the dipole, quadrupole and octupole field resulting from the simulated data along the circular orbits at a distance of \(0.2\,R_\text {M}\) from Mercury’s surface

The deviation \(\big |\underline{g}^{int}-\underline{g}_C^{int} \big |\) between the reconstructed internal coefficients \(\underline{g}_C^{int}\) and the implemented internal coefficients \(\underline{g}^{int}\) results in \(4.0\,\,\text {nT}\), i.e. \(\big |\underline{g}^{int}-\underline{g}_C^{int} \big |/\big |\underline{g}^{int} \big |\approx 1.9\%\) and thus, the implemented coefficients are reconstructed from the data with good precision.

Concerning the BepiColombo mission the generated circular orbits are idealized cases which are not realizable in practice. To investigate the applicability of the thin shell approximation for elliptical orbits we analyze the magnetic field data along the prospective orbits of MPO. The orbits are generated in analogy to the circular orbits, i.e., with the same longitudinal extend. Along this trajectories the distances of the data points from Mercury’s surface vary from \(0.12\,R_\text {M}\) up to \(0.6\,R_\text {M}\) resulting in a shell thickness of \(2h\approx 0.48\,R_\text {M}\). Although the shell is now much thicker the thin shell approximation works successfully. The reconstructed Gauss coefficients are displayed in Table 2. The optimal diagonal loading parameter for the application of Capon’s method results in \(\sigma _{opt.}\approx 1000\,\,\text {nT}\).

Table 2 Implemented and reconstructed Gauss coefficients for the dipole, quadrupole and octupole field resulting from the simulated data along the future MPO orbits. For comparision in the last columns the reconstructed Gauss coefficients resulting from the sole parameterization of the internal parts \(\underline{B}^i\) (Gauss internal, 15 coefficients considered within the model) as well as the coefficients resulting from the parameterization of the internal \(\underline{B}^i\) and external poloidal fields \(\underline{B}^e\) (Gauss internal external, 30 coefficients considered within the model) are presented

The deviation between the reconstructed and the implemented internal coefficients results in \(4.1\,\,\text {nT}\), i.e. \(1.9\%\) and thus, these values agree with the coefficients reconstructed from the data evaluated along the circular orbits. Since the data are evaluated along the MPO orbits, it is expectable that also Mercury’s internal octupole field will be reconstructed with good precision from the data of the magnetometer on board MPO.

It should be noted that the extension of the underlying model by the parameterization of the external parts of the field using the Gauss–Mie representation improves the results significantly. When only the internal parts of the field \(\underline{B}^i\) are considered in the model, the deviation between the implemented and the reconstructed coefficients results in \(29.7\,\,\text {nT}\) or 14.4%, respectively (cf. Table 2). Additional parameterization of the external poloidal fields \(\underline{B}^e\) using the external scalar potential \(\Phi ^e\) yields a deviation of \(10.0\,\,\text {nT}\) or 4.9%, respectively. Thus, for the analysis of Mercury’s internal magnetic field, the Gauss–Mie representation is a suitable alternative to the application of global magnetospheric models.

Validity of the thin shell approximation

The coefficients reconstructed from the data evaluated along the planned MPO trajectories are basically in agreement with that from the data along the circular orbits although the shell covering the MPO orbits has a finite thickness.

To investigate the limits of the thin shell approximation within the hybrid simulation, the thickness of the shell is increased incrementally. Referring to Eq. (67), the length scale L is estimated via radial variation of the current distribution

$$\begin{aligned} L=\frac{\big | \underline{j}_H \big |}{\big |\partial _r \underline{j}_H \big |} \end{aligned}$$
(78)

for the horizontal currents \(\underline{j}_H=\underline{j}-\left( \underline{j}\cdot \underline{e}_r \right) \underline{e}_r\) which are used as a proxy for the toroidal currents at each point along the orbit. Since L represents a local quantity that varies for each point along the orbit, whereas the half thickness h is a global quantity, the set of resulting length scales is averaged over the number of points along the orbit resulting in the mean length scale \(\langle L \rangle\). It should be noted that also the poloidal currents have horizontal components. Thus, the estimation of the length scale L with the horizontal currents \(\underline{j}_H\) as a proxy for the toroidal currents is not exact, but it is sufficient for a qualitative discussion.

The deviations between the ideal coefficients implemented in the simulation and the reconstructed coefficients for varying values of \(h/\langle L \rangle\) are displayed in Table 3.

Table 3 Deviation between the implemented and the reconstructed Gauss coefficients for varying ratios of the half thickness h to the mean current length scale \(\langle L \rangle\)

When h approaches \(\langle L \rangle\) the deviations are greater than \(12\,\text {nT}\). This deviation is of the same order when the parameterization is restricted to the Gauss representation. But for the data points along the MPO orbits, where \(h/\langle L \rangle \approx 0.47\), the application of thin shell approximation is justified.

Since a shell of thickness \(2h\approx 0.48\,R_\text {M}\) is called thin, the name thin shell is misleading, although this naming has been adopted within the literature. Conferring to the global length scale of \(1\,R_\text {M}\) the shell of thickness \(0.48\,R_\text {M}\) is not thin, but compared with the current length scale L it is. Therefore, the term thin has to be understood in a mathematical sense.

Reconstruction of the Gauss coefficients from MESSENGER data

The Gauss–Mie representation has successfully been validated for the simulated data. For the reconstruction of the Gauss coefficients from the MESSENGER data only data points in the northern hemisphere within a distance of \(r\le 1.5\,R_\text {M}\) and \(x> - 0.4\,R_\text {M}\) from Mercury’s surface can be considered because the orbits do not cover the southern hemisphere properly.

For the reconstruction of Mercury’s internal magnetic field, the data from nine pairs of MESSENGER orbits with different orientations of the periapsis between 10. August 2012 and 14. July 2014 are analyzed. The combination of the orbits improves the model resolution compared to the analysis of single orbits. A discussion of the model resolution, as provided by Connerney (1981), can be found in the Appendix B. As a proof of concept, only a small subset of the whole MESSENGER data set is analyzed. In the case of a small and noisy data set the performance of the estimator can be improved by seperating the data set into several subsets (Meir 1994). Therefore, the Gauss coefficients are reconstructed for each pair of orbits and the results are averaged over the nine pairs. Outliers were not included within the averaging. The resulting mean values are listed in Table 4. The reconstructed coefficients for each pair of orbits and the standard deviations of the mean values are listed in Table 5 of Appendix C.

Table 4 Gauss coefficients for the internal dipole, quadrupole and octupole field and for the external dipole field reconstructed from MESSENGER data. In the last colum the ranges of Gauss coefficients reconstructed from MESSENGER data by Anderson et al. (2012), Thébault et al. (2018) and Wardinski et al. (2019) are shown

The mean values of the reconstructed internal Gauss coefficients and the external dipole coefficients are in feasible agreement with the values provided by Anderson et al. (2012), Thébault et al. (2018) and Wardinski et al. (2019). Furthermore, the related standard deviations for each coefficient are within the range of the variations resulting from the time varying magnetic field (Wardinski et al. 2019). To classify the coefficients reconstructed from the simulated magnetic field data (Table 2) in terms of that resulting from the MESSENGER data, the simulated data are furthermore evaluated along the MESSENGER trajectories. The reconstructed Gauss coefficients are listed in Table 6 of Appendix C. These coefficients are in agreement with the results presented in Table 4. Since the reconstructed coefficients resulting from the data evaluated along the MPO trajectories are in better agreement with the implemented coefficients than the coefficients resulting from the data evaluated along the MESSENGER trajectories, the restriction of the data points to the northern hemisphere and the related degradation of the model resolution influences the results significantly (Heyner et al. 2020, cf). Furthermore, it should be noted that the analysis of the length scales of the current densities (cf. "Validity of the thin shell approximation" section) cannot be performed for the MESSENGER orbits, because the current densities in the vicinity of the orbits are unknown.

Summary and outlook

In the vicinity of Mercury no completely current-free region is present. Therefore, the Gauss representation does not yield a proper parametrization of Mercury’s magnetospheric field. Extension of the Gauss representation to the Gauss–Mie representation allows a more complete characterization of Mercury’s internal and magnetospheric field.

For the parameterization of the magnetic field the orbit where the magnetic field is measured, is conceptually covered by a spherical shell. Due to the underlying geometry the total measured magnetic field is a superposition of internal and external poloidal fields generated by toroidal currents flowing outside the spherical shell as well as toroidal and poloidal fields generated by currents flowing within the shell. Thereby, each component of the field is either described by a scalar potential (\(\Phi ^i\), \(\Phi ^e\)) or a scalar function (\(\Psi _P^{sh}\), \(\Psi _T^{sh}\)). These potentials and functions can be expanded into spherical harmonics. When the thickness of the spherical shell is smaller than the length scale on which the toroidal current density changes in radial direction the shell is called a thin shell. Then the poloidal field generated by currents flowing inside the shell is negligible compared to the poloidal field generated by currents flowing beyond the shell, whereas the toroidal field remains finite. In the case of the planned MPO orbits, the thin shell approximation is a reasonable choice.

For the application of the thin shell approximation the internal Gauss coefficients for the dipole, quadrupole and octupole field are implemented in the simulation code AIKEF and the magnetic field data resulting from the plasma interaction of Mercury with the solar wind are simulated in the vicinity of Mercury. The data are evaluated along the planned MPO orbits and the 46 expansion coefficients, describing the internal, external and the toroidal field are reconstructed with Capon’s method. Since the reconstructed internal Gauss coefficients are in good agreement with that implemented in the simulation code, the parameterization of the magnetic field using the Gauss–Mie representation is a suitable alternative to the application of global magnetospheric models. Even the implemented Gauss coefficient for the octupole field of Mercury can be reconstructed accurately and therefore, it is expectable that Mercury’s internal octupole field will be reconstructed with high precision from the magnetometer data on board MPO. Thus, it is worthwile to investigate the analysis of higher multipoles, such as hexadecapole, from the data along the MPO trajectories.

Furthermore, the thin shell approximation is applied to reconstruct Mercury’s internal magnetic field from the data of the MESSENGER mission. The results are in reasonable agreement with former works. Since only the data points in the northern hemisphere are vital for the analysis of the MESSENGER data, the symmetrically distributed MPO orbits will deliver a better model resolution than the MESSENGER orbits.

Concerning the BepiColombo mission the combination of the Gauss representation with the Mie representation is a useful model for the analysis of Mercury’s internal magnetic field. As the BepiColombo mission consits of two elements, the planetary orbiter and the magnetospheric orbiter, measurements of any gradients of Mercury’s magnetic field are possible which may lead to further improvements of the methods presented here. Besides the analysis of the internal magnetic field, the reconstructed coefficients for the toroidal magnetic field can be used for calculating poloidal current systems, e.g. field aligned currents, in the vicinity of the orbit where the data are evaluated.

Availability of data and materials

Simulation data can be provided upon request.

References

Download references

Acknowledgements

The authors are grateful for stimulating discussions and helpful suggestions by Alexander Schwenke and the technial support by Willi Exner.

Funding

Open Access funding enabled and organized by Projekt DEAL. We acknowledge support by the German Research Foundation and the Open Access Publication Funds of the Technische Universität Braunschweig. The work by Y. Narita is supported by the Austrian Space Applications Programme at the Austrian Research Promotion Agency under contract 865967. D. Heyner and K.-H. Glassmeier were supported by the German Ministerium für Wirtschaft und Energie and the German Zentrum für Luft- und Raumfahrt under contract 50 QW1501.

Author information

Authors and Affiliations

Authors

Contributions

All authors contributed conception and design of the study; ST and UM wrote the first draft of the manuscript; all authors contributed to manuscript revision. All authors read and approved the final manuscript.

Corresponding author

Correspondence to S. Toepfer.

Ethics declarations

Competing interests

The authors declare that they have no conflict of interest.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Appendices

Appendix A: The thin shell approximation for the toroidal magnetic field

As derived in "The thin shell approximation" section, the scalar function of the poloidal magnetic field can be estimated upwards as

$$\begin{aligned} \big | \Psi _P^{sh}(\underline{r}) \big |\le 2bL\sum _{n=0}^\infty \frac{1}{n!}\left( \frac{h}{L} \right) ^{n+1}\,\max _{\theta ^\prime ,\lambda ^\prime }\big \{\big | \Gamma _T^{sh}(b)\big | \big \} . \end{aligned}$$
(79)

When \(h\ll L\), the function \(\Psi _P^{sh}\) vanishes. Now the influence of the thin shell approximation for the toroidal magnetic field \(\underline{B}_T^{sh}\) is discussed.

The scalar function \(\Psi _T^{sh}\) of the toroidal magnetic field \(\underline{B}_T^{sh}\) can be expanded analogously into a Taylor series in the vicinity of the mean radius b

$$\begin{aligned} \Psi _T^{sh}(\underline{r})=\Gamma _P^{sh}(\underline{r})=\sum _{n=0}^\infty \frac{1}{n!} \partial _r^n \Gamma _P^{sh}(r)\big |_{r=b} \left( r-b \right) ^n, \end{aligned}$$
(80)

using that the scalar functions \(\Psi _T^{sh}\) and \(\Gamma _P^{sh}\) are the same (cf. Eq. 54).

If we furthermore assume that also the radial derivatives of the poloidal current density are bounded so that

$$\begin{aligned} \big | \partial _r^n \Gamma _P^{sh} \big |\le \frac{\big | \Gamma _P^{sh} \big |}{L^n}, \end{aligned}$$
(81)

the function \(\Psi _T^{sh}(\underline{r})\) can be estimated upwards as

$$\begin{aligned} |\Psi _{T}^{{sh}} (\underset{\raise0.3em\hbox{$\smash{\scriptscriptstyle-}$}}{r} )| \le & \sum\limits_{{n = 0}}^{\infty } {\frac{1}{{n!}}} |\partial _{r}^{n} \Gamma _{P}^{{sh}} (b)| \cdot |r - b|^{n} \\ \le & \sum\limits_{{n = 0}}^{\infty } {\frac{1}{{n!}}} \frac{{|\Gamma _{P}^{{sh}} (b)|}}{{L^{n} }}{\mkern 1mu} |r - b|^{n} . \\ \end{aligned}$$
(82)

Using that \(\Psi _T^{sh}(\underline{r})\) is solely evaluated within the spherical shell, where \(r\in \left[ b-h,b+h \right]\) and therefore \(\big | r-b\big |\le h\) results in

$$\begin{aligned} \big |\Psi _T^{sh}(\underline{r})\big |\le \sum _{n=0}^\infty \frac{1}{n!} \left( \frac{h}{L} \right) ^n \big |\Gamma _P^{sh}(b)\big |. \end{aligned}$$
(83)

When \(h\ll L\), the function \(\Psi _T^{sh}\) remains finite.

Appendix B: Model resolution

For the analysis of Mercury’s internal magnetic field from the MESSENGER data only data points in the northern hemisphere can be considered. This limitation impairs the condition number \(\kappa\) of the shape matrix \(\underline{\underline{H}}\) from \(\kappa \approx 226\) for the data points along the MPO orbits to \(\kappa \approx 10^8\) for the data points along a single MESSENGER orbit, where the shape matrix describes the spacial distribution of the data points. The condition number is defined as the ratio of the largest and the smallest singular value of the shape matrix. Therefore, some of the 46 singular values, corresponding to the 46 expansion coefficients, of the shape matrix have to be dropped within the numerical calculation to improve the condition number. Within the low rank approximation (Eckart and Young 1936) only k singular values, where \(k\le 46\), can be considered and therefore, the shape matrix \(\underline{\underline{H}}\) is approximated by a shape matrix \(\underline{\underline{H}}_k\) which has a lower rank and a lower condition number.

Capon’s filter matrix

$$\begin{aligned} \underline{\underline{w}}^{\dagger }=\left[ \underline{\underline{H}}^{\dagger } \underline{\underline{M}}^{-1}\underline{\underline{H}} \right] ^{-1}\underline{\underline{H}}^{\dagger }\underline{\underline{M}}^{-1} \end{aligned}$$
(84)

which is the key parameter for calculating Capon’s estimator

$$\begin{aligned} \underline{g}_C=\underline{\underline{w}}^{\dagger }\underline{B} \end{aligned}$$
(85)

fulfills the distortionless constraint

$$\begin{aligned} \underline{\underline{w}}^{\dagger } \underline{\underline{H}} =\underline{\underline{I}}, \end{aligned}$$
(86)

where \(\underline{\underline{I}}\) is the identity matrix and \(\underline{\underline{M}}=\langle \underline{B}\circ \underline{B} \rangle\) describes the data covariance matrix of the magnetic field measurements \(\underline{B}\) (Toepfer et al. 2020a, b). As a consequence of the low rank approximation Capon’s filter matrix is modified to

$$\begin{aligned} \underline{\underline{w}}_k^{\dagger }=\left[ \underline{\underline{H}}_k^{\dagger }\, \underline{\underline{M}}^{-1}\underline{\underline{H}}_k \right] ^{-1}\underline{\underline{H}}_k^{\dagger }\,\underline{\underline{M}}^{-1} \end{aligned}$$
(87)

so that

$$\begin{aligned} \underline{\underline{R}}=\underline{\underline{w}}^{\dagger }_k \underline{\underline{H}} \ne \underline{\underline{I}}. \end{aligned}$$
(88)

The matrix \(\underline{\underline{R}}=\underline{\underline{w}}^{\dagger }_k \underline{\underline{H}}\) is called the model resolution matrix. Because of

$$\begin{aligned} \underline{g}_C^k=\underline{\underline{R}}\,\underline{g}, \end{aligned}$$
(89)

where \(\underline{g}_C^k\) denotes the estimator resulting from the consideration of k singular values and \(\underline{g}\) is the ideal coefficient vector implemented in the simulation code, the diagonal elements of \(\underline{\underline{R}}\) identify the resolution of each coefficient (Connerney 1981). When \(\underline{\underline{R}}=\underline{\underline{I}}\) each coefficient is resolved for \(100\%\). If the resolution is smaller than \(100\%\), there exist model parameter covariances.

The more singular values are considered within the estimation, the better the model resolution becomes, whereas the condition number of the shape matrix increases and thus, the influence of measurement errors increases.

To achieve a compromise between the resolution and the condition number, the coefficients are estimated for different numbers of singular values and the change of the coefficients resulting from k singular values to the coefficients resulting from \(k-1\) singular values is regarded. For the final computation the maximum number of singular values is chosen from which the reconstructed coefficients are almost constant. In Fig. 4 the procedure is exemplarily illustrated for the coefficient \(g_1^0\), which corresponds to the diagonal element \(R_{11}\) by analyzing the data of one pair of MESSENGER orbits.

Fig. 4
figure 4

Variation of the reconstructed coefficient \(g_1^0\) (red) and the related model resolution \(R_{11}\) (blue) with respect to the number k of singular values. For \(k\le 37\) (dashed line) the reconstructed values of \(g_1^0\) are almost constant

Changing the number of considered singular values incrementally from 46 to 37 the resulting coefficient changes significantly. For \(k\le 37\) the values are almost constant and therefore, 37 singular values, corresponding to a model resolution of 88% for the coefficient \(g_1^0\) are considered within the estimation.

Appendix C: Tables: Reconstructed Gauss coefficients from MESSENGER orbits

Table 5 Gauss coefficients for the dipole, quadrupole and octupole field reconstructed from selected pairs of MESSENGER orbits
Table 6 Gauss coefficients for the dipole, quadrupole and octupole field reconstructed from the simulated data evaluated at selected pairs of MESSENGER orbits

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Toepfer, S., Narita, Y., Glassmeier, K.H. et al. The Mie representation for Mercury’s magnetic field. Earth Planets Space 73, 65 (2021). https://doi.org/10.1186/s40623-021-01386-4

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1186/s40623-021-01386-4

Keywords