A convolutional neural network-based classification of local earthquakes and tectonic tremors in Sanriku-oki, Japan, using S-net data

Takahashi, Hidenobu; Tateiwa, Kazuya; Yano, Keisuke; Kano, Masayuki

doi:10.1186/s40623-021-01524-y

Express Letter
Open access
Published: 15 October 2021

A convolutional neural network-based classification of local earthquakes and tectonic tremors in Sanriku-oki, Japan, using S-net data

Earth, Planets and Space volume 73, Article number: 186 (2021) Cite this article

3203 Accesses
5 Citations
8 Altmetric
Metrics details

Abstract

Low-frequency tremors have been widely detected in many tectonic zones, and are often located adjacent to megathrust zones, indicating that their spatiotemporal evolution provides important insights into megathrust events. The envelope correlation method (ECM) is commonly used to detect tremors. However, the ECM also detects regular earthquakes, which requires the separation of these two signals after the initial detection. In addition, signals of tremors are weak, so classifying tremors from noises is also an essential problem. We develop a convolutional neural network (CNN)-based method using a single S-net station located off Sanriku region, Northeast Japan, to classify local earthquakes, tremors, and noise. Along the Japan Trench, especially in a region focused in this study, local earthquakes and tremors occurred in coexistence within a small region, so detection, location, and discrimination of these events are the key to understand the relationship between slow and regular earthquakes. The spectrograms of the three-component velocity waveforms that were recorded during 16 August 2016 to 14 August 2018 are used as the training and test datasets for the CNN. The CNN successfully classified 100%, 96%, and 98% of the earthquakes, tremors, and noise, respectively. We also showed a successful application of our method to continuous waveform data including a tremor to explore the feasibility of the proposed method in classifying tremors and noise in continuous streaming data. The output probabilities for the true classifications decrease with increasing epicentral distance and/or decreasing event magnitude. This highlights the need to train the CNN using tremors proximal to the seismic stations for detecting tremors using multiple stations.

Introduction

Slow earthquakes, which are characterized by a longer duration than regular earthquakes of the same magnitude, have been widely detected in many tectonic zones (e.g., Obara and Kato 2016). There have been increasing observational reports that slow earthquake activity has sometimes preceded megathrust earthquakes (Kato et al. 2012; Ito et al. 2013; Graham et al. 2014; Ruiz et al. 2014; Radiguet et al. 2016; Socquet et al. 2017; Voss et al. 2018). This activity has been detected adjacent to the coseismic source regions of some megathrusts, suggesting that slow earthquakes may potentially provide the necessary stress loading to trigger these megathrust ruptures. Therefore, the detection of slow earthquakes is one of the key criteria for advancing our understanding of tectonic processes in subduction zones.

Tectonic tremors are slow earthquakes with dominant frequencies of ~ 2–10 Hz. Tectonic tremors are often detected via the envelope correlation method (ECM) (e.g., Obara 2002) using inland and/or ocean-bottom seismometers. The ECM utilizes the similarity of the envelopes of the observed waveforms among different stations because it is usually difficult to identify clear P- and S-wave onsets in the tremor signal. While the ECM is a powerful tool for tremor detection, it is also effective in detecting regular earthquake signals. Therefore, a visual inspection of the seismic waveforms is necessary to identify the tremor signals, which are then used to create tremor catalogs.

A machine learning approach is one way to reduce the manual cost of waveform inspections, especially for large seismic data volumes (e.g., Kong et al. 2019). Nakano et al. (2019) developed a supervised convolutional neural network (CNN)-based method to classify local earthquakes, tremors, and noise using the spectrograms obtained by the DONET ocean seismometers deployed in the Nankai subduction zone, Southwest Japan, as input images for the CNN, and achieved an event classification accuracy of 99.5%. The known effectiveness of CNNs in extracting features from input images (e.g., Krizhevsky et al. 2012; Szegedy et al. 2015; Goodfellow et al. 2016) allowed Nakano et al. (2019) to tailor a CNN for tremor detection because tectonic tremors have characteristic dominant frequencies of 2–10 Hz.

The S-net seismic network, a seafloor observation network that covers a wide area of the Japan Trench subduction zone, has recently been established in Northeast Japan (Fig. 1a) (NIED 2019). Short-period velocity seismometers have been deployed at intervals of a few tens of kilometers in this network. Nishikawa et al. (2019) and Tanaka et al. (2019) have investigated local tremor activity using the S-net observations. While these previous studies detected tremors based on the ECM, we apply a CNN-based approach that was developed by Nakano et al. (2019) to the S-net velocity records. However, the S-net seismometers have a 15-Hz characteristic period, and are therefore less sensitive to the dominant frequencies of tremors (2–10 Hz) compared with the DONET seismometers (Nakano et al. 2019). Therefore, we modify the structures and retrain the parameters in the CNN developed by Nakano et al. (2019) for application to the S-net data. We attempt to classify the known local earthquakes, tremors, and noise via the CNN using a single station to investigate how our CNN-based method works, with the ultimate goal of eventually developing a comprehensive tremor-detection approach. We note that it is important to discriminate local earthquakes and tremors along the Japan Trench subduction zone, due to the coexistence of regular and slow earthquakes within a small region in spite of their different source process (Ide 2008). In contrast to regular earthquake signals, tremor signals are weak, and tremors could be easily mislabeled as noise in the condition of far-source station. Therefore, using a near-source station is a key to detect and classify tremors accurately.

Data

This study utilizes a CNN for classifying the spectrograms of local earthquakes, tectonic tremors, and noise, which are hereafter labeled EQ, T, and N, respectively, that were observed by S-net. We used the three-component velocity waveforms that were recorded at a single station, N.S4N21, at a sampling frequency of 100 Hz during the period from 16 August 2016 to 14 August 2018 . This station was selected based on its location above the plate boundary where tremors frequently occur (Fig. 1a). Training datasets with labeled EQ, T, and N events are necessary since CNNs take a supervised approach. We therefore used the Japan Meteorological Agency (JMA) unified catalog and the tremor catalog created by Nishikawa et al. (2019) for EQ and T event identification, respectively. We selected the events whose epicenters were close to station N.S4N21, which include those located within the area indicated by the rectangle in Fig. 1a. We selected reference time of each image based on the second and minute of the origin time for EQ and T, respectively. Here we used the minute for tremor instead of the second to extract whole excited parts of tremor signals because origin time of cataloged tremors do not necessarily start before excitation of signals unlike regular earthquakes. The training dataset of N was constructed by visually inspecting seismograms every 10 min for 1 day in each month (i.e., the 20th day of each month in this study) when seismograms do not contain transient signal of tremor or regular earthquakes.

For creating all images of EQ, T, and N, we selected 117.76-s time windows that started from each reference time defined above. Either the spectrograms or the power spectrum density (PSD) were then calculated for the waveforms using twenty 20.48-s moving time windows in the 2–10 Hz frequency range with a lag of 5.12 s. We corrected PSD based on the amplitude characteristic in the frequency domain by using a damping constant of 0.707 and the 15 Hz natural frequency. Each spectrogram was normalized to ensure that the input image pixels were in the 0–1 range. Each spectrogram was composed of 20 × 165 pixels (time domain × frequency domain). Figure 1b shows examples of earthquake and tremor waveforms with their corresponding spectrograms. The average of duration of T events in the catalog is about 44 s as shown by Nishikawa et al. (2019), and the window contains whole excited parts of tremor signals. We divided the data period into two datasets for the CNN analysis, with the 16 August 2016–2 December 2017 waveforms used for training and the 3 December 2017–14 August 2018 waveforms used for validation. Because images with low signal-to-noise ratios potentially cause mis-labeling in the training dataset, we selected qualified images in each category (EQ, T, and N) from the training dataset as follows. We first calculated the difference between the maximum and minimum log₁₀(PSD) values for each spectrogram. We then selected the images that possessed higher absolute values than the third quartile values for the local earthquakes and tremors, and images with lower absolute values than the first quartile values for noise. Hereafter, we call these events selected events, and events that do not satisfy the abovementioned criteria non-selected events. We confirmed that without this selection step, our CNN resulted in a poor classification accuracy. After this data selection step (Table 1), we obtained 210, 531, and 468 training events that were labeled EQ, T, and N, respectively. The numbers of labeled EQ, T, and N events are 91, 208, and 118, respectively, for the validation dataset (Table 1). Each event is composed of three-component PSD images. Finally, we normalized log₁₀(PSD) for each event image over the 0–1 range (see Fig. 1b for an example) to extract the features in the frequency domain.

Table 1 Matrix for the training and validation data of EQ–T–N events

Full size table

Methods

CNNs (Fukushima 1980; LeCun et al. 1989) are a de facto standard for extracting non-linear features by learning a large amount of digital filters in combination with fully connected neural networks. CNNs employ these learnable filters to provide a function that maps an input image into an output vector. The inputs for our CNN are two-dimensional (2D) images of the three-component spectrograms, and the output is a three-dimensional vector containing the EQ, T, and N probabilities for the input data.

CNNs consist of a series of layers that sequentially process the input data as follows (Fig. 2): (a) convolve the input images using a set of learnable filters; (b) downsample the convolution outputs; and (c) apply a non-linear transformation known as the activation function to the downsampled outputs. The first step (a) is referred to as the convolution layer. Let $\{{w}_{t}:t=1,\dots ,T\}$ be an input waveform consisting of T samples, and let $\{{h}_{i}:i=1,\dots ,F\}$ be a learnable filter of length F. The convolution with this filter can be represented as follows:

$$\sum_{i=1}^{F}{h}_{i}{w}_{t+i}, t=1,\dots ,T.$$

(1)

Here, each filter $\{{h}_{i}:i=1,\dots ,F\}$ is optimized during the training step (i.e., a learnable filter), and specific features are identified in the original input images. The second step (b) is known as the pooling layer. The pooling layer drastically reduces the dimensionality (i.e., the number of parameters and computations in the network) to avoid overfitting due to excess parameters. This step also introduces both a local translation invariance and robustness to local perturbations (for details, see section 9.3 in Goodfellow et al. (2016) and references therein). We adopt max pooling, which is a pooling operation that calculates the maximum value for each point of each waveform that is convolved with a learnable filter. The max pooling operation with size $P$, on a waveform $\{{\tilde{w }}_{t} :t=1,\dots ,T\}$ returns:

$$\left\{{\text{max}}\left\{{\tilde{w }}_{t+m} :m=0,\dots ,P-1\right\} :t=1,\dots ,T\right\},$$

(2)

where we use zero padding (${\tilde{w }}_{T+i}=0$) for $i=1,\dots ,P$ to ensure that the output dimension of the pooling operation is the same as its input dimension. Max pooling highlights the sharp contrasts in the convolved waveform. The third step (c) is the activation layer, which enables the network to capture complex features by making the output non-linear. We adopt a rectified linear unit (ReLU; $x\mapsto {\text{max}}\{x,0\}$) (Fukushima 1980) as the activation function, which is commonly used in CNN-based seismological research (e.g., Perol et al. 2018; Ross et al. 2018).

We specified the CNN hyperparameters in this study as follows. Twenty-five learnable filters were employed in the first and second convolution layers, with each possessing filter lengths of 10.24 s and 0.29 Hz in the time and frequency domains, respectively. These parameters were chosen through the performance comparison with the other hyperparameter values: we tested the number of learnable filters from 5 to 25, the length of filters in time domain from 10.24 s to 51.2 s, and the length of filters in frequency domain from 0.1 to 0.49 Hz. The max pooling lengths after the first and second convolution layers were 25.6 s in the time domain. Max pooling was not conducted in the frequency domain following a previous study (Nakano et al. 2019). The number of units in the fully connected layer was 10. The batch size during the training step was 18. Training-learnable filters were applied by minimizing the cross-entropy loss via stochastic gradient descent optimization with momentum; a learning rate of 0.005 and momentum of 0.9 were employed during the training step. Cross-entropy loss is a function that maps a pair of vectors, the true label vector $({y}_{2},{y}_{1},{y}_{0})$ (where ${y}_{2}=1$ for EQ and 0 otherwise; ${y}_{1}=1$ for T and 0 otherwise; and ${y}_{0}=1$ for N and 0 otherwise) and output probability vector $({p}_{2},{p}_{1},{p}_{0})$ (with earthquake probability ${p}_{2}$, tremor probability ${p}_{1}$, and noise probability ${p}_{0}$) into $-{y}_{2}{\text{log}}{p}_{2}-{y}_{1}{\text{log}}{p}_{1}-{y}_{0}\mathrm{log}{p}_{0}$. Stochastic gradient descent optimization (Robbins and Monro 1951) with momentum (Qian 1999) uses a linear combination of the gradient multiplied by the learning rate and the previous update multiplied by the momentum as the next update. An l₂ penalty with a regularization parameter of 1.0 was added to the cross-entropy loss (Ng 2004).

Results

We begin the analysis by applying our CNN-based method to the validation dataset. Table 1 provides the confusion matrix for the EQ–T–N classification. We determine the predicted labels for the input data via the largest output probabilities, whereby our method labels the input data based on the region in Fig. 3a, where the output probability of a given event is located. Our method successfully identified all of the EQ events (100%), and almost all of the T and N events (96% and 98%, respectively). Ternary plots of the output probabilities for each event type in the validation dataset are shown in Fig. 3b–d, with the output probabilities for each event type being concentrated mainly in the corner corresponding to the actual label of the event. Note that if we do not use the training and validation datasets selected based on signal-to-noise ratio, probabilities for each event type in the validation dataset drop to 98.6%, 80.9%, and 94.2% for EQ, T, N, respectively, which suggests selection procedure is essential to robust classification.

We then confirm that all of the misclassified events were truly misclassified with a different label (Events #1–#10). This misclassification can be explained by the spectrogram features of these events, with the X spectrogram components of Events #1, #8, and #9 shown in Fig. 3e. The spectrograms of the other misclassified events are summarized in Additional file 1: Fig. S1. Our CNN labeled Event #1 as EQ event, which is labeled as a T event by Nishikawa et al. (2019). A comparison of the Event #1 spectrogram with the typical EQ and T spectrograms in Fig. 1b highlights that the Event #1 spectrogram has a peak in the relatively high-frequency range and possesses a similar spectrogram signal to that of an EQ event. The Event #1 waveforms possess a P-wave onset at station N.S4N21 and S-wave onsets at neighboring stations, thereby implying that Event #1 is an EQ event (Additional file 1: Fig. S2a). Our CNN labeled Event #8 as N, whereas the event #8 was detected as T event (Nishikawa et al. 2019). As shown in Fig. 3e, the Event #8 spectrogram does not contain signals with characteristic dominant frequencies in the 2–10 Hz range, but rather has a spectral peak at around 2–3 Hz. A possible reason for this misclassification is a large geometrical spreading due to the long epicentral distance, about 60 km. The tremor waveforms with gradual rise time were observed at the nearest station, N.S4N24, from the source (Additional file 1: Fig. S2b). Our CNN labeled Event #9 as a T event, whereas we labeled the event as N by visual inspection. The Event #9 spectrogram displays a signal with dominant frequencies in the 2–10 Hz range in the analyzed time window, which caused our CNN to give this event a T classification.

Next, we investigated the relationships between the probabilities of the CNN-based classification on the true event labels and the epicentral distances, and the probabilities and M_w. Figure 4 shows the M_w—epicentral distance plot for the local earthquakes and tremors, where their output probabilities are color-coded. The moment magnitudes of EQ are within the range of 0.7 to 6.4, while those of T are within the range of − 1.33 to 0.35. Remark that as denoted in Fig. 2, our CNN-based classification yields the probabilities of (EQ, T, N) for each event, so, for each event, we picked a probability of the true event label from its output probabilities to draw Fig. 4. The presented images were selected via the procedure outlined in the “Data” section, and yielded high (> 0.5) output probabilities of the true event labels, as confirmed in the “Results” section. However, the number of local earthquakes with low probabilities (< 0.5) appears to increase in the non-selected dataset for events with larger epicentral distances and smaller magnitudes (blue histogram in Fig. 4a). The number of tremors with low probabilities (< 0.5) also appears to increase in the non-selected dataset with larger epicentral distances (blue histogram in Fig. 4b). We do not investigate the relationship between M_w of tremors and the CNN probabilities of them in detail because the omission of the amplification factor in Nishikawa et al. (2019) from the tremor M_w estimations may have resulted in large uncertainties.

We used the signal-to-noise ratio as a threshold for selecting the training data. The training dataset includes many events with shorter epicentral distances since these events generally possess higher signal-to-noise ratios. Therefore, there was a higher probability of classifying the events in the validation dataset that possessed shorter epicentral distances into their true event labels (Fig. 4).

Discussion

In this section, we discuss three topics: the application to continuous data, the impact of noise on classification results, and the evaluation of our methods compared with other classifier methods.

We first discuss the application of our method to continuous data. We applied our method to 5-min continuous seismograms containing a tremor listed in Nishikawa et al. (2019). We successively created images with 5.12 s lag-time, and applied our trained CNN to each image to obtain the probabilities for EQ, T, and N. Figure 5 shows continuous spectrograms and probability for EQ, T, and N, respectively. From the beginning of observation at 13:48:00 (JST) to 13:48:50, we cannot see any clear signal in each image with about 2-min time-window. The probability of N is dominant in this period. At the beginning of the increase of probability of T around 13:48:50, the corresponding image begins to contain the excitation of tremor below 5 Hz. The high probability of T (> 0.9) continues for 1.5 min when the images include tremor signals. At the end of the high probability of T around 13:50:30, the corresponding image contains the tremor signal at the edge of the time-window. Tremor catalog in Nishikawa et al. (2019) determined origin time of tremor (13:50:37.8) based on the S-wave arrival time as maximum stacked envelopes after correcting the source-to-station travel times. This origin time is included in the period when the probability of T is high. These results show that the CNN-based methods successfully identified tremors and noise just around observed tremor, suggesting the possibility of applying CNN-based methods in continuous dataset in the future (e.g., simultaneous occurrences, noise we cannot explain by oceanic or cultural noise).

We next discuss impact of noise on classification results. We used input noise dataset for about 1 year, and as a result we successfully identified signals from noise with very high probabilities. However, as suggested by Takagi et al. (2020), the noise level shows changes of PSDs on the order of 2 above 1 Hz for 3 years along the Japan Trench at each S-net station. To be a more reliable classification, we should consider various noise patterns during observation periods. To do so, we need to appropriately select training and validation noise dataset to create homogeneous dataset by investigating temporal changes of noise characteristics in detail.

We finally discuss the evaluation of our methods compared with other classifier methods. Single station detection and classification methods of tectonic tremors have been previously proposed. Brudzinski and Allen (2007), Kao et al. (2007), and Sit et al. (2012) mainly focused on the dominant frequency of tremor (1–5 Hz) and successfully detected tremors in Cascadia using seismic records observed at a single station. These methods were only applied to hour-long tremor episodes. However, along the Japan Trench, tremors are detected with the duration of dozens of seconds, and sometimes coexist with regular earthquakes. Therefore, we should focus on not only the dominant frequency band of tremor, but also time-length used by classification. Our CNN-based method can classify seismograms at least about 120-s segments, suggesting the high temporal resolution of classification. Recently, Liu et al. (2019) successfully distinguished tremors from local earthquakes and noise with high accuracy of 86.6% to 98.9% from three stations using a machine learning approach, k-nearest neighbors classifier in Taiwan. In the methods, 27 seismic features were calculated. Then they suggested efficient features for a better classification between tremor and noise, that is, maximum amplitude, number of peaks, and energy of the 2 to 8 Hz-filtered signals and number of peaks in the curve showing the temporal evolution of the discrete Fourier transforms median. We can use these features directly as the input of CNN methods, which may lead to better classification.

Conclusion

We developed a CNN to classify local earthquakes, tremors, and noise that were observed by S-net in Northeast Japan. The CNN successfully classified 100%, 96%, and 98% of the local earthquakes, tectonic tremors, and noise, respectively. All of the misclassified events were thoroughly investigated to validate that our CNN-based approach yielded explainable classifications. We also showed an example of successful classification of continuous waveforms including tremor. The classification abilities for local earthquakes and tectonic tremors degraded as the epicentral distance increased and/or event magnitude decreased. The utilization of multiple stations appears to be a promising approach to circumvent this degradation, as the subsequent analysis would include a broad spatial distribution of earthquakes and tremors with various epicentral distances.

Availability of data and materials

We used S-net data provided by the National Research Institute for Earth Science and Disaster Resilience (http://www.hinet.bosai.go.jp/?LANG=en). Earthquake catalog is provided from JMA, tremor catalog is provided from Nishikawa et al. (2019). Both catalogs were downloaded from “Slow Earthquake Database” (Kano et al. 2018; http://www-solid.eps.s.u-tokyo.ac.jp/~sloweq/).

Abbreviations

ECM:: Envelope correlation method
CNN:: Convolutional neural network
JMA:: Japan Meteorological Agency
JST:: Japan Standard Time
PSD:: Power spectral density
ReLU:: Rectified linear unit

References

Brudzinski MR, Allen RM (2007) Segmentation in episodic tremor and slip all along Cascadia. Geology 35:907–910. https://doi.org/10.1130/G23740A.1
Article Google Scholar
Fukushima K (1980) Neocognitron: a self-organizing neural network model for a mechanism of pattern recognition unaffected by shift in position. Biol Cybern 36:193–202. https://doi.org/10.1007/BF00344251
Article Google Scholar
Goodfellow I, Bengio Y, Courville A (2016) Deep learning. MIT Press, Cambridge
Google Scholar
Graham SE, DeMets C, Cabral-Cano E, Kostoglodov V, Walpersdorf A, Cotte N, Brudzinski M, McCaffrey R, Salazar-Tlaczani L (2014) GPS constraints on the 2011–2012 Oaxaca slow slip event that preceded the 2012 March 20 Ometepec earthquake, southern Mexico. Geophys J Int 197(3):1593–1607. https://doi.org/10.1093/gji/ggu019
Article Google Scholar
Ide S (2008) A Brownian walk model for slow earthquakes. Geophys Res Lett 35:L17301. https://doi.org/10.1029/2008gl034821
Article Google Scholar
Ito Y, Hino R, Kido M, Fujimoto H, Osada Y, Inazu D, Ohta Y, Iinuma T, Ohzono M, Miura S, Mishina M, Suzuki K, Tsuji T, Ashi J (2013) Episodic slow slip events in the Japan subduction zone before the 2011 Tohoku-Oki earthquake. Tectonophysics 600:14–26. https://doi.org/10.1016/j.tecto.2012.08.022
Article Google Scholar
Kano M, Aso N, Matsuzawa T et al (2018) Development of a slow earthquake database. Seismol Res Lett 89:1566–1575. https://doi.org/10.1785/0220180021
Article Google Scholar
Kao H, Thompson PJ, Rogers G, Dragert H, Spence G (2007) Automatic detection and characterization of seismic tremors in northern Cascadia. Geophys Res Lett 34:L16313. https://doi.org/10.1029/2007gl030822
Article Google Scholar
Kato A, Obara K, Igarashi T, Tsuruoka H, Nakagawa S, Hirata N (2012) Propagation of slow slip leading up to the 2011 Mw 9.0 Tohoku-oki earthquake. Science 335:705–708. https://doi.org/10.1126/science.1215141
Article Google Scholar
Kong Q, Trugman D, Ross Z, Bianco M, Meade B, Gerstoft P (2019) Machine learning in seismology: turning data into insights. Seismol Res Lett 90(1):3–14. https://doi.org/10.1785/0220180259
Article Google Scholar
Krizhevsky A, Sutskever I, Hinton G (2012) ImageNet classification with deep convolutional neural networks. In: Proceedings of neural information processing systems (NIPS 2012), 25:1097–1105
LeCun Y, Boser B, Denker J, Henderson D, Howard R, Hubbard W, Jackel L (1989) Backpropagation applied to handwritten zip code recognition. Neural Comput 1(4):541–551. https://doi.org/10.1162/neco.1989.1.4.541
Article Google Scholar
Liu YH, Yen TC, Chen KH, Chen Y, Yen YY, Yen HY (2019) Investigation of single-station classification for short tectonic tremor in Taiwan. J Geophys Res Solid Earth 124(8):8803–8822. https://doi.org/10.1029/2019JB017866
Article Google Scholar
Nakano M, Sugiyama D, Hori T, Kuwatani T, Tsuboi S (2019) Discrimination of seismic signals from earthquakes and tectonic tremor by applying a convolutional neural network to running spectral images. Seismol Res Lett 90(2A):530–538. https://doi.org/10.1785/0220180279
Article Google Scholar
National Research Institute for Earth Science and Disaster Resilience (NIED) (2019) NIED S-net. National Research Institute for Earth Science; Disaster Resilience. https://doi.org/10.17598/NIED.0007
Ng A (2004) Feature selection, vs regularization, and rotational invariance. In: Proceedings of the twenty-first international conference on machine learning (ICML’04), Banff, Alberta, Canada. https://doi.org/10.1145/1015330.1015435
Nishikawa T, Matsuzawa T, Ohta K, Uchida N, Nishimura T, Ide S (2019) The slow earthquake spectrum in the Japan Trench illuminated by the S-net seafloor observatories. Science 365(6455):808–813. https://doi.org/10.1126/science.aax5618
Article Google Scholar
Obara K (2002) Nonvolcanic deep tremor associated with subduction in southwest Japan. Science 296(5573):1679–1681. https://doi.org/10.1126/science.1070378
Article Google Scholar
Obara K, Kato A (2016) Connecting slow earthquakes to huge earthquakes. Science 353(6296):253–257. https://doi.org/10.1126/science.aaf1512
Article Google Scholar
Perol T, Gharbi M, Denolle M (2018) Convolutional neural network for earthquake detection and location. Sci Adv 4(2):e1700578. https://doi.org/10.1126/sciadv.1700578
Article Google Scholar
Qian N (1999) On the momentum term in gradient descent learning algorithms. Neural Netw 12(1):145–151. https://doi.org/10.1016/S0893-6080(98)00116-.6
Article Google Scholar
Radiguet M, Perfettini H, Cotte N, Gualandi A, Valette B, Kostoglodov V, Lhomme T, Walpersdorf A, Cabral Cano E, Campillo M (2016) Triggering of the 2014 Mw 7.3 Papanoa earthquake by a slow slip event in Guerrero, Mexico. Nat Geosci 9:829–833. https://doi.org/10.1038/ngeo2817
Article Google Scholar
Robbins H, Monro S (1951) A stochastic approximation method. Ann Math Stat 22(3):400–407
Article Google Scholar
Ross Z, Meier M-A, Hauksson E (2018) P wave arrival picking and first-motion polarity determination with deep learning. J Geophys Res Solid Earth 123(6):5120–5129. https://doi.org/10.1029/2017JB015251
Article Google Scholar
Ruiz S, Metois M, Fuenzalida A, Ruiz J, Leyton F, Grandin R, Vigny C, Madariaga R, Campos J (2014) Intense foreshocks and a slow slip event preceded the 2014 Iquique Mw 8.1 earthquake. Science 345(6201):1165–1169. https://doi.org/10.1126/science.1256074
Article Google Scholar
Sit S, Brudzinski M, Kao H (2012) Detecting tectonic tremor through frequency scanning at a single station: application to the Cascadia margin. Earth Planet Sci Lett 353–354:134–144. https://doi.org/10.1016/j.epsl.2012.08.002
Article Google Scholar
Socquet A, Valdes JP, Jara J, Cotton F, Walpersdorf A, Cotte N, Specht S, Ortega-Culaciati F, Carrizo D, Norabuena E (2017) An 8 month slow slip event triggers progressive nucleation of the 2014 Chile megathrust. Geophys Res Lett 44:4046–4053. https://doi.org/10.1002/2017GL073023
Article Google Scholar
Szegedy C, Liu W, Jia Y, Sermanet P, Reed S, Anguelov D, Erhan D, Vanhoucke V, Rabinovich A (2015) Going deeper with convolutions. In: Proceedings of conference on computer vision and pattern recognition (CVPR 2015) 1–9
Takagi R, Toyokuni G, Chikasada N (2020) Ambient noise correlation analysis of S-net records: extracting surface wave signals below instrument noise levels. Geophys J Int 224:1640–1657. https://doi.org/10.1093/gji/ggaa548
Article Google Scholar
Tanaka S, Matsuzawa T, Asano Y (2019) Shallow low-frequency tremor in the northern Japan Trench subduction zone. Geophys Res Lett 46(10):5217–5224. https://doi.org/10.1029/2019GL082817
Article Google Scholar
Voss N, Dixon TH, Liu Z, Malservisi R, Protti M, Schwartz S (2018) Do slow slip events trigger large and great megathrust earthquakes? Sci Adv 4(10):eaat8472. https://doi.org/10.1126/sciadv.aat8472
Article Google Scholar
Wessel P, Smith WHF, Scharroo R, Luis J, Wobbe F (2013) Generic mapping tools: improved version released. Eos Trans AGU 94(45):409–410. https://doi.org/10.1002/2013EO450001
Article Google Scholar

Download references

Acknowledgements

We used S-net data provided by the National Research Institute for Earth Science and Disaster Resilience (NIED 2019). We used GMT (Wessel et al. 2013) to create the figure.

Funding

This study was supported by JP18K03796 and JP21K03694 Grant-in-Aid for Scientific Research (C), JP19H04620 Scientific Research on Innovative Areas “Science of Slow Earthquakes” , JST CREST Grant Number JPMJCR1763, MEXT Project for Seismology toward Research Innovation with Data of Earthquake (STAR-E) Grant Number JPJ010217, and JP21H05205 Grant-in-Aid for Transformative Research Areas (A) “Science of Slow-to-Fast Earthquakes”.

Author information

Hidenobu Takahashi
Present address: Central Research Institute of Electric Power Industry, 1646 Abiko, Abiko, Chiba, 270-1194, Japan

Authors and Affiliations

Graduate School of Science, Tohoku University, 6‑3, Aramaki‑aza‑aoba, Aoba‑ku, Sendai, 980‑8578, Japan
Hidenobu Takahashi, Kazuya Tateiwa & Masayuki Kano
The Institute of Statistical Mathematics, 10-3 Midori cho, Tachikawa, Tokyo, 190-8562, Japan
Keisuke Yano

Authors

Hidenobu Takahashi
View author publications
You can also search for this author in PubMed Google Scholar
Kazuya Tateiwa
View author publications
You can also search for this author in PubMed Google Scholar
Keisuke Yano
View author publications
You can also search for this author in PubMed Google Scholar
Masayuki Kano
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

HT and MK designed this paper. HT, KT, MK, and KY contributed to the interpretation and discussion. HT acquired the data and did the data processing. KY designed the CNN. KT calculated classification via CNN. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Hidenobu Takahashi.

Ethics declarations

Ethics approval and consent to participate

Not applicable.

Consent for publication

Not applicable.

Competing interests

All the authors declare no competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Additional file 1: Figure S1.

Spectrograms of Events #2, #3, #4, #5, #6, #7, and #10; the event locations are indicated in Fig. 3. The left, middle, and right panels for each event denote the X and Y components of the spectrogram, respectively. Figure S2. X-component waveforms of Event #1 and #8 ((a) and (b), respectively) that were recorded at N.S4N21 (Fig. 1a) and the neighboring stations. The waveforms are bandpass filtered at 2–8 Hz.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Takahashi, H., Tateiwa, K., Yano, K. et al. A convolutional neural network-based classification of local earthquakes and tectonic tremors in Sanriku-oki, Japan, using S-net data. Earth Planets Space 73, 186 (2021). https://doi.org/10.1186/s40623-021-01524-y

Download citation

Received: 05 June 2021
Accepted: 30 September 2021
Published: 15 October 2021
DOI: https://doi.org/10.1186/s40623-021-01524-y

A convolutional neural network-based classification of local earthquakes and tectonic tremors in Sanriku-oki, Japan, using S-net data

Abstract

Introduction

Data

Methods

Results

Discussion

Conclusion

Availability of data and materials

Abbreviations

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Ethics approval and consent to participate

Consent for publication

Competing interests

Additional information

Publisher's Note

Supplementary Information

Additional file 1: Figure S1.

Rights and permissions

About this article

Cite this article

Share this article

Keywords