Automatic classification of spread‐F types in ionogram images using support vector machine and convolutional neural network

Benchawattananon, Phongsachot; Siritaratiwat, Apirat; Supnithi, Pornchai; Nishioka, Michi; Perwitasari, Septi; Suthisopapan, Puripong; Kruesubthaworn, Anan

doi:10.1186/s40623-024-02002-x

Full paper
Open access
Published: 10 April 2024

Automatic classification of spread‐F types in ionogram images using support vector machine and convolutional neural network

Phongsachot Benchawattananon¹,
Apirat Siritaratiwat¹,
Pornchai Supnithi²,
Michi Nishioka³,
Septi Perwitasari³,
Puripong Suthisopapan¹ &
…
Anan Kruesubthaworn ORCID: orcid.org/0000-0001-7746-539X¹

Earth, Planets and Space volume 76, Article number: 56 (2024) Cite this article

740 Accesses
1 Altmetric
Metrics details

Abstract

An ionogram image serves as a valuable data for examining the ionospheric bottom side characteristics and variabilities. Spread-F is indicated or identified by plasma irregularity in the ionospheric region. Diffused echo in the ionogram images particularly pose challenges for efficient interpretation required in further applications. An automatic classification of spread-F is presented in this study. Ionogram images are automatically classified using preprocessing techniques to improve the classification performance. In this study, the classification is designed by two machine learning algorithms, including support vector machine (SVM) and convolutional neural network (CNN). The CNN model with preprocessing technique outperforms the SVM alternative based on 4,692 labelled ionogram images from the FMCW-type ionosonde at Chumphon station, Thailand. The model successfully classified clear, frequency spread-F (FSF), range spread-F (RSF), strong spread-F (SSF), and unidentified class with an accuracy of 98.0%, 85.1%, 90.7%, 66.7%, and 99.2%, respectively. The proposed automatic classification models achieved to classify classes of ionogram images. In addition, the image filtering and data preprocessing are useful with ionogram images for improving the model classification performance.

Graphical Abstract

Introduction

The ionospheric layer is the upper part of the atmosphere that is ionized by solar radiation and forms a dense of charged particles changing over time (Bowman 1960). The ionospheric bottomside characteristics can be observed by different ionosonde systems, such as frequency-modulated continuous wave (FMCW) ionosonde (Yao et al. 2012) and digital ionosonde (Rao et al. 2022a). The FMCW-type ionosonde with pulse compression technique used at Chumphon station, Thailand transmits continuous pulses of high frequency range of 2–30 MHz toward the ionosphere layers. The signals are reflected by the ionospheric plasma resulting in the echoes being recorded and then the ionosonde displays the recorded data on a graph called the ionogram (Nozaki 2009; Thammavongsy et al. 2020). An ionospheric disturbance in the F layer appears on the ionogram as spread-F traces often characterize the presence of plasma irregularities, typically observed during night-time when the ionosphere is less influenced by solar radiation. In addition, equatorial spread-F specifically occurs in the equatorial region. Furthermore, there are several types of spread-F phenomenon, each exhibiting distinct characteristics, including frequency spread-F (FSF), range spread-F (RSF), mixed spread-F (MSF), and strong spread-F (SSF) (Wang et al. 2008). Figure 1 illustrates the characteristics of ionogram images, including a typical ionogram, as shown in Fig. 1a, and ionograms with spread-F types. FSF in Fig. 1b represents a horizontal spread of ionogram traces around the F region critical frequency, which can be observed in various patterns. RSF in Fig. 1c is a vertical spread of ionogram traces over the wide range of frequency. MSF in Fig. 1d can be observed as a combination of spreading characteristics between FSF and RSF. SSF in Fig. 1e is an intensified version of RSF, where the spreading conditions are significantly expanded. In addition, if the FMCW-type ionosonde fails to receive a reflected signal or encounters an error during the process, it plots an ionogram with no trace, classified as an unidentified class in this study, as shown in Fig. 1f.

The presence of this phenomenon causes the ionization density in ionospheric layer to become irregular and the plasma depletion in some areas has relatively lower density than the surrounding areas, hence, impacting the transmission of radio waves, leading to an unclear, poor quality, delays, and errors in recorded data (Thammavongsy et al. 2020). Consequently, it is essential to conduct ionospheric observations for various applications, such as developing an alert system to detect irregularities in radio communication systems and global positioning systems (GPS).

These observations are crucial for ensuring reliable and accurate performance in these systems. Nowadays, the ionosondes operators are required to manage the processing of ionogram data, involving scaling task and categorizing thousands of ionogram images. To solve this problem, an application of machine learning is introduced. Machine learning has a powerful ability to manage the large amount of data which can be well-applied for automatic ionogram image classification tasks. In general, machine learning can be classified into three types which are supervised learning, unsupervised learning, and reinforcement learning. The supervised learning is a learning of the machine by feeding labelled data directly. The unsupervised learning is learning by feeding raw data or unlabeled for training, and the reinforcement learning is learning through decision making to optimize the outcome (Janiesch et al. 2021). According to types of machine learning, supervised learning is the most suitable to be used with ionogram images due to large amounts of available data (Luwanga et al. 2022).

In data management of ionogram, there was a method proposed by Xiao et al. (2020) using deep learning method for ionogram automatic scaling (DIAS) to scale a large amount of ionosonde data. DIAS model consists of encoder and decoder networks. The encoder network model was evaluated and compared using VGG16, ResNet50 and Efficient-b5 as backbones. While the decoder network has applied the feature pyramid network (FPN) module to enhance the scaling accuracy. The results showed that the DIAS model with ResNet50 backbone and FPN module scaled ionograms with the precision of 95.79%, while the traditional method achieved only 88.67%. In 2021, De La Jara and Olivares presented a method of using CNN to detect ionospheric echo in digital ionograms with three different models, evaluated by Intersection over Union (IoU) to measure the accuracy between manual and automatic trace detector. The first model was fed by filtered images. The second model trained by adding manually extracted images after the first model. The last model was trained by feeding manually extracted data only which achieved the IoU value of 0.174, 0.602, and 0.569, respectively. This model also showed that extracted data can highly improve the IoU value in this work. Xue et al. (2022) presented an echo extraction for three types of ionograms (vertical, oblique, and backscatter ionograms) using CNN as classification and extraction model with residual learning and skip connection structure which improved the model performance compared with the traditional method by 22.18%, 22.56%, and 6.67%, respectively. In the same year, Luwanga et al. proposed a method for spread-F detection on digital ionogram image through SVM and CNN models with three different based models which are VGG16, InceptionV3, and ResNet50. The results showed that SVM model achieved the precision score of 77.00% and CNN model with ResNet50 achieved the precision score of 95.00%. However, the SVM model in this research showed poor performance and then was abandoned for the further evaluation. Therefore, the performance of SVM model on ionogram data remains inconclusive. In 2022, the auto-detection method for ionospheric irregularity in digital ionogram proposed by Rao et al. (2022b) A tool based on fuzzy relation that detects the height and frequency points in the denoised digital ionogram images to identify them into classes. The proposed method was able to detect ionograms with an efficiency of 96.71%, 97.83%, 89.71%, 68.32%, and 93.39% for sporadic-E, FSF, RSF, MSF, and SSF events, respectively. Recently, Wang et al. (2023) presented a deep learning model for spread-F detection and classification. The digital ionogram over 100,000 images were used for training, evaluating, and testing on various models. The ionogram had been cropped, resized and added with the simulated noise into the original images. The results indicated that ResNet50 achieved a test accuracy of 92.36% to detect and classify the ionogram into FSF, RSF, MSF, SSF, and no spread-F. Accordingly, the most used ionogram image in the mentioned works are digital ionograms which are different from the used ionogram images in this work as recorded by the FMCW system which contains more variation of noises. Additional image preprocessing methods are introduced to deal with all unwanted noise and improve the data quality in the models for classification and further applications.

Therefore, the main purpose of this work is to evaluate model performance between shallow and deep machine learning structures which are SVM and CNN models for operating the automatic classification task with ionogram data set. To alleviate the manual tasks associated with ionogram data for the FMCW-type ionosonde. In addition, the preprocessing methods are also investigated to improve the model performance using different techniques such as image filtering with image sharpening, image thresholding, median blur, gamma correction, fast non-local means, and bilateral filters. These techniques are utilized in training, validation, and testing processes for both models.

Experiment setup

This section provides information on the experimental design, including data set preparation, image preprocessing methods, model details, and evaluation process to determine the most effective approach for improving the model performance.

Data set preparation

The ionogram data set used in this study are obtained from the FMCW ionosonde at Chumphon station, Thailand under the administration of the Southeast Asia Low-latitude Ionospheric Network (SEALION) Project (Maruyama et al. 2007). The data collection period in this study spans from March to May and August to October in 2014–2016 and 2018–2020, respectively. The data was collected during specific periods to consider seasonal changes in the phenomena. The data set of 4693 ionogram images were manually classified by expert inspector into clear class, ionogram with no spread-F, (1320 images), FSF class (760 images), RSF class (1620 images), SSF class (126 images), and unidentified class (866 images). The sample of each ionogram classes are shown in Fig. 1.

For the spread-F occurrence, RSF events frequently occur near the Chumphon station, (Rungraengwajiake et al. 2013), resulting in the RSF class having the highest number of samples in spread-F classes. Conversely, SSF events rarely occur in the equatorial region, resulting in the lowest number of samples. Subsequently, the data set is divided into three subsets which are training set, validation set, and test set. The data set is split into three sets, including 70% of training set, 15% of validation set and 15% of test set, (Razzano and Cuoco 2018). From the total image, the training set is 3,285 images, the validation, and test set are 704 images. When splitting the data set, it is important to ensure that there are sufficient samples for training while also avoid having too few samples for validation and testing. It is noticed that the data classes in this work are severely imbalanced, particularly within the SSF class, there is a risk of having an insufficient number of samples available for evaluating the model during validating and testing.

Image preprocessing methods

The manipulation of the data set is considered before analyzing or using in other processes of machine learning tasks including image preprocessing and image filtering with the purpose to filter out noise and enhance the features in ionograms. Image filtering technique can be used to reduce noise, distortion and improve the overall information of the ionogram images (De La Jara and Olivares 2021). With proper image preprocessing, it can directly improve the performance and efficiency of the models in terms of computer vision (Xiao et al. 2020). In this work, there are seven preprocessing techniques to be evaluated which are described below.

1)
Image cropping: a technique to specifically crop the image to obtain only the necessary part of the ionogram. By cropping the F layer region from the full ionogram image, the image contains only the important information.
2)
Image thresholding: a thresholding technique to separate the region in an image by determining the threshold value.
3)
Image sharpening: a filter that passes a specific kernel matrix through image pixels, resulting the sharpened image. The kernel matrix is defined as $\left[\begin{array}{ccc}0& -1& 0\\ -1& 5& -1\\ 0& -1& 0\end{array}\right]$.
4)
Median blur: a technique for image denoising by replacing the value of each pixel with the median value of the neighboring pixels within a defined window.
5)
Gamma correction filter: a technique for adjusting the overall brightness and contrast of the image by determining the gamma value. By increasing gamma value, the pixel intensity in image will become brighter while decreasing gamma value resulting darker output.
6)
Fast non-local means filter (fast-NLmeans): a denoising technique with ability to preserve the structure of the image and denoise by comparing the similarity between nearby image area and calculate the average normal value (without noise) of each area.
7)
Bilateral filter: a filter for noise reduction while preserving important image structure by considering both spatial proximity and intensity similarity between pixels.

From the presented preprocessing methods, the image cropping is used as a preliminary process for both SVM and CNN models to crop out the unwanted data from the full size ionogram images. While other preprocessing will be applied and evaluated on the ionogram images separately to compare the classification performance.

Model details

This study presents the application of SVM and CNN models to evaluate the classification performance on this ionogram data set. Typically, the SVM has a simple architecture to determine the hyperplane and classify the data into classes using the kernel trick. While the CNN has a more complex architecture with multiple convolutional layers and neurons to analyze various features from the input data. It should be noted that this experiment is conducted on different types of ML model to provide the performance comparison between two different model structures on classifying this ionogram data set and examine the impacts of the proposed preprocessing methods on both models. Therefore, the best way to evaluate the improvement of the proposed method must be examined for both SVM and CNN algorithms on the specific data set then compare the results. The structure of the image classification methodology of each model is shown in Fig. 2.

Parameters for model evaluation

The metrics for model evaluation in this study are utilized as follows,

1.
Precision represents the accuracy of positive predictions by calculating the ratio of true positives (ionogram with spread-F) with the sum of true positives and false positives as shown in following formular
$${\text{Precision}}=\frac{{\text{True Positive}} }{{\text{True Positive}} +{\text{False Positive}} }$$
2.
Recall represents the performance to identify all positive instances by calculating true positive with the sum of true positives and false negatives as shown in following formular
$${\text{Recall}}=\frac{{\text{True Positive}} }{{\text{True Positive}} +{\text{False Negative}} }$$
3.
F1-score calculates the balanced measurement of the model performance between the ratio of precision and recall as shown in following formular
$${\text{F1 score}}= 2\times \frac{{\text{Precision}}\times {\text{Recall}}}{{\text{Precision}}+{\text{Recall}}}$$
4.
Support represents the number of tested samples for each ionogram class.
5.
Accuracy represents the overall model performance by calculating the ratio between correct prediction and total number of tested samples as shown in following formular.
$${\text{Accuracy}}=\frac{{\text{True Positive}} +{\text{True Negative}} }{{\text{Total number of predictions}} }$$

SVM model for image classification

SVM is a supervised machine learning algorithm used for data classification and regression. The algorithm determines the best hyperplane that can divide the data into separated categories by maximizing the margin width, which is the gap between the closest data points and the hyperplane. The SVM can handle low- and high-dimensional data and can be used for managing both linear and non-linear data using different kernel functions to transform the input data into a higher-dimensional space for data separation (Brereton and Lloyd 2010; Raghavendra and Deka 2014). A linear kernel creates a linear boundary in the original feature space, while polynomial and Radial Basis Function (RBF) kernels use nonlinear transformations to achieve nonlinear boundaries in higher-dimensional spaces. Thus, these mentioned kernel tricks were evaluated to find out the most suitable kernel for this ionogram data set.

Table 1 defines the parameters for tuning the SVM classification model in this work. All input data are cropped into three-dimensional tensor, and a depth of one channel in grayscale images. The three main variables in SVM are C parameter, gamma value, and kernel function. C parameter is the regularization variable that adjusts the cost of misclassifying in training sample. Gamma value is used to determine a shape of the decision boundary, and kernel function is a mathematical function to transforms data into a higher-dimensional space for data separation. As previously stated, the way to obtain the optimized parameter is to utilize a hyperparameter tuning process. Hyperparameter tuning is used to determine the optimal values for the model to improve performance on a specific data which typically done through a combination of parameters and variables, then sort out the best parameters to optimize the SVM model. Another preprocessing method for SVM classification model in this work is K-means clustering for data pre-separation. This clustering technique is a method of grouping samples based on their characteristics, aiming to place similar samples into the same group and distinguish them from other groups. The objective is to group the similar image based on their intensities together into clusters while maximizing differences between groups and minimizing variations within groups (Pham et al. 2005). In this step, K-means clustering is tested to separate the ionogram into three different size of clusters which are 2, 3, and 4. Then, each cluster will be applied with image filtering to examine how image filter affects the training and evaluation of SVM model.

Table 1 SVM model architecture for ionogram image classification task

Automatic classification of spread‐F types in ionogram images using support vector machine and convolutional neural network

Abstract

Graphical Abstract

Introduction

Experiment setup

Data set preparation

Image preprocessing methods

Model details

Parameters for model evaluation

SVM model for image classification

CNN model for image classification

Results and discussions

Image preprocessing method

SVM model performance

CNN model performance

CNN model parameter tuning

Fully connected layer modification

Data augmentation

Model regularization technique

SVM and CNN model performances comparison

Conclusions

Availability of data and materials

Abbreviations

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords