Testing various seismic potential models for hazard estimation against a historical earthquake catalog in Japan
- Wahyu Triyoso^{1, 2} and
- Kunihiko Shimazaki^{1}Email author
https://doi.org/10.5047/eps.2011.02.003
- Received: 4 June 2010
- Accepted: 2 February 2011
- Published: 27 August 2012
Abstract
The classic zoning method and spatial smoothing of seismicity were used with seismicity, GPS, and late Quaternary fault data to develop time-invariant seismic potential models of shallow crustal earthquakes in the Japanese islands that were then tested against a 400-year Japanese historical earthquake catalog. The results demonstrated that the models so developed for seismic hazard estimation did not necessarily reproduce the observed seismicity. In some cases they were even worse than the Reference Model that assumes a uniform earthquake potential over all of the Japanese islands. A subsequent analysis of the original dataset once it had been divided into two subsets based on time indicated that the present-day spatial distribution of small earthquakes and surface horizontal strain are much affected by previous large earthquakes. Two sources of information were the most effective: regionalized seismicity of small earthquakes and the active fault data. Two models using each of them were not only successful, but also robust. A model combining the distributions of small and moderate-size earthquakes proposed by Frankel in 1995 was also effective for modeling the distributed sources, which are unrelated to the faults. In this study, we tested the spatial variation of the likelihood of large earthquakes with M ≥ 6.8.
Key words
- Seismic hazard
- seismic potential
- model testing
- crustal earthquake
1. Introduction
An earthquake potential model plays a key role in seismic hazard analysis. It has long been realized that an evaluation of a potential source model is essential to accurate earthquake forecasting, but realization of this technique has required a wait of many decades—until data on an adequate number of large earthquakes were available for evaluation. For moderate-sized earthquakes, regionalized earthquake likelihood models are now being prospectively tested (e.g., Jordan, 2006; Field, 2007; Schorlemmer et al., 2007; Nanjo et al., 2011). A time-invariant model can be used not only to study future earthquakes, but also those in the past. The aim of the study reported here was to evaluate a timeinvariant source model retrospectively by testing it against a 400-year historical earthquake catalog in the Japanese islands.
Various models utilizing different methods have been developed to estimate seismic hazard. In this study, we use both the classic regionalization method (Cornell, 1968) as well as a spatial smoothing technique proposed by Frankel (1995), information on strain accumulation proposed by Ward (1994), and a method based on estimated slip rate and other parameters of late Quaternary faults (e.g., Wesnousky et al., 1984) for constructing source models for shallow crustal earthquake hazards in Japan.
For the construction of various long-term earthquake potential models, we used the following datasets: an instru-mentally recorded Japan Meteorological Agency (JMA) catalog of small and moderate-size earthquakes for 19261997, a seismotectonic zoning map of Japan (Property and Casualty Insurance Rating Organization of Japan, 2000), GPS data of the Geographical Survey Institute for 1994–1999, and active fault data (Kumamoto, 1997).
The Akaike information criterion (AIC; Akaike, 1974) is used to quantitatively evaluate the models. Historical data on inland large earthquakes during the past 400 years are used to calculate the likelihood of realization of the spatial distribution of the large events for all models. The difference in AIC value is calculated between a certain model and the Reference Model that assumes a spatially uniform earthquake potential. A model with the largest difference in AIC was ultimately chosen as the most successful model.
Finally, we divide the historical earthquake datasets into two data subsets and test the models against each subset separately to evaluate the robustness of the models.
2. Data
Two earthquake catalogs are assembled from JMA catalogs for the construction of earthquake potential models. Since the target is a large shallow inland earthquake, for our catalogs we select earthquakes from the JMA catalogs that are not deeper than 20 km, and we exclude events occurring off shore. One catalog, referred to as the “small earthquake catalog”, includes events with magnitude ≥3.0 that occurred between 1980 and 1997. Relatively high-quality observations of small earthquakes started at the beginning of 1980. The second catalog, referred as the “moderate-size earthquake catalog”, consists of earthquakes with a magnitude ≥5.0 than occurred from 1926 to 1997.
We use the zoning map of Property and Casualty Insurance Rating Organization of Japan (2000), which is mainly based on a seismotectonic-zoning map proposed by Kakimi et al. (1994, 2003).
The GPS data used in this study were provided by the Geographical Survey Institute (GSI) for the period 1994–1999. We first extract the long-term average velocity at each GPS site and then estimate the surface strain rate by using the least squares collocation technique (El-Fiky et al., 1997; Kato et al., 1998) in which Gaussian spatial smoothing with a correlation distance of 100 km was applied for noise reduction (Shimazaki and Zhao, 2000).
The characteristic earthquake model (Wesnousky et al., 1983; Schwartz and Coppersmith, 1984) is used to evaluate seismicity based on the late Quaternary fault data. A synthetic catalog of large shallow crustal earthquakes is produced from Kumamoto’s (1997) datasets of late Quaternary faults which contain data on the epicenter, magnitude, and annual frequency.
3. Models
3.1 Smoothing Models
Following the methodology proposed by Frankel (1995) we use a Gaussian function to smooth the seismicity. Model S is based on spatially smoothed a-values derived from the declustered small earthquake catalog. The a-value shows the activity level in the Gutenberg-Richter relation. We use a correlation distance of 50 km for the smoothing. In this model, events with a magnitude of ≥3 are assumed to illuminate areas of faulting that can produce a destructive earthquake (Frankel, 1995). Model M uses the declustered moderate-size earthquake catalog and a correlation distance of 75 km. This model assumes that a future large event will occur close to where moderate-size earthquakes have occurred in the past. Model U, which is identical to the Reference Model, assumes uniform seismic potential. The aim of this model is to quantify large earthquake potential in areas that have not shown significant seismicity during the period for which instrumental records are available, but which could very well produce a sizeable earthquake in the future (Frankel, 1995). The Smoothing F Model is a combination of the three models, i.e., Models S and M, and U. We adopt Frankel’s (1995) weighting factors of 0.5, 0.25, 0.25 for Models S and M, and U, respectively. Similarly, Smoothing S Model is constructed by combining Models S and U, with weighting factors of 0.75 and 0.25, respectively. The Smoothing M Model is a combination of Models M and U, with weighting factors of 0.75 and 0.25, respectively.
3.2 Zoning Models
The Zoning P Model utilizes the zoning map and a- and b-values in each zone as proposed by Property and Casualty Insurance Rating Organization of Japan (PCIRO) (2000). PCIRO’s zoning map is based on the seismo-tectonic map of Kakimi et al. (1994, 2003). PCIRO (2000) estimated the a- and b-values from seismicity for 1885 to 1995. We construct two other models on the basis of the small and moderate-size earthquake catalogs, referred to here as the Zoning S and Zoning M Models, respectively. Only the a-value in each zone is estimated from the catalogs since the b-value is assumed to be 0.85.
3.3 GPS Model
The surface strain rates derived from the GPS data show relatively high rates along the Pacific coasts due to the subduction of the Pacific and Philippine Sea plates. Most of the accumulated strain will be released by large earthquakes off shore (Shimazaki, 1974). Thus, the subduction effects are removed for the evaluation of inland seismicity, as shown in Appendix.
3.4 Fault Model
We spatially smooth the synthetic seismicity with a correlation distance of 75 km to obtain Model F. We then construct Fault Model by combining Models F and U with weighting factors of 0.75 and 0.25, respectively.
4. Testing Models
4.1 Historical earthquake catalog
For testing the earthquake potential models we use about 400 years of data collected on Japanese historical earthquakes from 1596 to 2000 in Usami’s (2003) catalog. On Hokkaido, the northernmost island of the four major Japanese islands, datasets on historical earthquakes cover only the past 150 years; therefore, Hokkaido is excluded in the assessment of the models. Events occurring off shore are also excluded, as are dependent events, i.e., aftershocks of the 1923 Kanto earthquake, deep earthquakes, and the Odawara earthquake of 1782 with tsunami reporting (Tsuji, 1986).
Figure 1 shows the distribution of the historical inland large earthquakes used for testing. The magnitudes of all earthquakes tested are ≥6.8 because, based on the cumulative magnitude-frequency distribution, we can judge that the data are more or less complete for this magnitude range. Data for 1926 through to 1997 are excluded because seismicity data for this time period were used to construct the models, and the historical data for testing should be independent of the data used in model construction.
However, as some after-effects of large historical earthquakes may exist, we divide the complete whole dataset into two subsets: one covering the period 1801–1925; the second, covering all other periods (left figure of Fig. 1). If a large historical event has century-long after-effects, some models may have a good correlation with data from the 19th to early 20th century. We also divide the data into two subsets (right figure on Fig. 1) on the basis of whether events are related to the late Quaternary fault or not (Odagiri and Shimazaki, 2001). The Fault Model should be able to successfully reproduce the fault-related data.
A total of 40 historical earthquakes are used: 18 took place between 1801 and 1925 and 22 occurred in other time periods; 17 correlated with late Quaternary faults and 23 were uncorrelated. If the two classifications are completely independent, we can expect seven to eight fault-related earthquakes for the period 1801–1925. However, there are ten such events. Thus, there exists a slight correlation between the fault-related earthquakes and the events occurring in 1801–1925.
4.2 Evaluation
5. Results
Comparison of difference models.
Models | δAIC |
---|---|
Smoothing F | 5.4 |
Smoothing S | 4.5 |
Smoothing M | −2.1 |
Fault | 10.0 |
Zoning P | −6.4 |
Zoning S | 9.6 |
Zoning M | −11.4 |
GPS | 0.1 |
Nonetheless, there exists a significant difference between models. The difference in δAIC between the best and the worst models is >20, which is equivalent to a difference of 10 in the log-likelihood. Nominally, the best model may reproduce the observed spatial distribution of large earthquakes roughly 20,000 times more than the worst model.
Comparison of models for groups of events related and unrelated to a late Quaternary fault.
Models | δAIC Fault related | δAIC Not related |
---|---|---|
Smoothing F | −0.4 | 5.9 |
Smoothing S | 0.1 | 4.3 |
Smoothing M | −5.7 | 3.6 |
Fault | 6.1 | 3.8 |
Zoning P | −6.1 | −0.4 |
Zoning S | 4.5 | 5.1 |
Zoning M | −10.3 | −1.0 |
GPS | 4.0 | −3.9 |
Comparison of models for groups of events occurring in 1801–1925 and other periods.
Models | δAIC 1801–1925 | δAIC Other periods |
---|---|---|
Smoothing F | 3.6 | 1.8 |
Smoothing S | 5.7 | −1.3 |
Smoothing M | −2.7 | 0.5 |
Fault | 7.8 | 2.1 |
Zoning P | −3.8 | −2.7 |
Zoning S | 5.1 | 4.4 |
Zoning M | −7.7 | −3.7 |
GPS | 5.1 | −5.1 |
6. Discussion
It may be argued that calculated likelihood is inaccurate for a large event since its source zone could be much larger than the size of one cell (10 × 10 km) and that, therefore, not just one cell but all cells should be included in the source zone. However, the actual source zone of the most historical earthquakes is unknown. The uncertainty of the likelihood would not be large since all of the models are spatially smoothed by the Gaussian function or composed of wide zones. The correlation distance used in this study ranges from 50 to 100 km, and the likelihood only slightly varies with these distances. Thus, we neglect the finite extent of the source zone in this study.
It is surprising that one-half of the models show poor results that are not much better than the Reference Model (Table 1). The Smoothing M and Zoning M Models are based on the moderate-size earthquake catalog. Although the observation period is much longer than that of the small earthquake catalog, 72 vs. 18 years, the results based on the moderate-size earthquake catalog are far less successful. Despite declustering, moderate-size earthquakes tend to cluster near large earthquakes during the observation period, i.e., 1926–1997, where historical earthquakes rarely occurred. In other words, we find no recurrence of large earthquakes in the same cell during the past 400 years. However, the Smoothing and Zoning M Models do show high occurrence probability near the epicenter of large earthquakes for 1926–1997 and fail to reproduce the spatial distribution of large earthquakes in other time periods (Table 1).
The Smoothing S and Zoning S Models are based on the small earthquake catalog. Both models appear to be successful as a whole (Table 1), but they show contrasting results when the historical catalog is divided into two different time-periods (Table 3). The Zoning S Model seems robust, while the Smoothing S Model is not.
A comparison of the last two columns in Table 3 reveals that the GPS Model has the same time-dependency as the Smoothing S Model, namely, a successful result for 1801–1925, but a poor result for the other time-period. It is very likely that the large earthquakes in the earlier time period affect the activity of present-day small earthquakes and the surface strain. Since current seismicity near the source region of the 1891 Nobi earthquake still follows the Omori-Utsu aftershock formula (Utsu, 1961), century-long aftereffects of large events may not be surprising. The viscous response of the lower crust would also cause lingering strain accumulation near the source region of a large earthquake, as was observed after the 1896 Riku-u earthquake (Thatcher et al., 1980). If the historical earthquake catalog were to be short, allowing only earthquakes between 1801 and 1925 to be used in the analysis, this viewpoint could be neglected, and a different and wrong conclusion could be reached. We should note that both present-day small earthquakes and surface strain are greatly affected by large earthquakes that occurred about a century ago.
It is to be expected that the Fault Model successfully reproduces historical seismicity (Table 1). However, it also successfully reproduces the data of earthquakes uncorrelated with late Quaternary faults (Table 2). This latter result is rather surprising since recent large shallow crustal earthquakes tend to occur in areas where no late Quaternary fault have been mapped and the importance of a “blind fault” is emphasized (e.g., Toda and Awata, 2008). However, the Fault Model is constructed by the spatial smoothing of synthetic events based on the late Quaternary fault data. In comparison with the correlation distance of 75 km used for the smoothing, those events took place not far from the nearest mapped fault and, therefore, high occurrence probabilities are estimated in areas of blind faulting. The Fault Model also shows robustness for the different time-period data (Table 3).
The Zoning S Model is the most robust (Tables 2 and 3) and successful model, possibly indicating that both the large quantity of small earthquakes and geological knowledge of zoning are important keys for predicting the spatial variation of large shallow crustal earthquakes. Detailed knowledge of the late Quaternary faults is unnecessary to construct this model.
The Smoothing F Model is also successful although the ΔAIC for the entire historical dataset is not as high as those for the Fault and Smoothing F Models. Frankel (1995) proposed using fault data and other data for a source model of large events with magnitude > 7.0 and to use the spatial smoothing technique for smaller events. As such, it is acceptable to test the Smoothing F Model against the historical dataset of events unrelated to late Quaternary faults (Table 2). We find that the best model of distributed sources is the Smoothing F Model. The weighting factors used for combining different models seem to have an unpredictable mystic effect because the δAIC of the F Model is larger than the sum of those of S and M models (Table 1).
The Zoning P Model employs the original a- and b-values in each zone proposed by the Property and Casualty Insurance Rating Organization of Japan (2000). Tables 1, 2, and 3 show that the δAIC of this model is negative in all cases. Since the zoning method was used for distributed sources, it is logical to test the Zoning P Model against the historical data of events uncorrelated with late Quaternary faults (Table 2). However, δAIC is nearly zero and, consequently, the model does not provide much information on the spatial distribution of large events. Since the Zoning S Model is successful (Table 2), the zoning itself should not be blamed. Property and Casualty Insurance Rating Organization of Japan (2000) obtained the a- and b-values from seismicity data for 1885 to 1995. It is likely that the migration of large earthquakes over time is one cause of the poor result.
7. Conclusions
Eight seismic potential models of large shallow crustal events in the Japanese islands are constructed by zoning and spatial smoothing techniques with seismicity, GPS, and late Quaternary fault data and then tested against historical data. We find that the model based on a large quantity of small earthquake data combined with seismo-tectonic zoning is the most robust and successful model. The model based on the late Quaternary fault data is also reliable. Frankel’s (1995) method of combining catalogs of small and moderate-size earthquake is found to be effective for distributed sources.
A century-long after-effect of large earthquakes on seismicity and surface strain is inferred from the results obtained from testing of the models based on seismicity and GPS data. If a historical catalog is not long enough, this effect may not be detected.
Large earthquakes have not recurred in the same place for the past 400 years. Thus, the earthquake potential model showing a similar spatial distribution to what we observe now for large earthquakes will give a rather poor forecast of future earthquakes.
Declarations
Acknowledgements
We thank Prof. Takashi Kumamoto for fault data and Dr. Eric Nana Oware for his help in handling the GPS data. We also acknowledge the comments of two reviewers (Prof. Honn Kao and an anonymous reviewer) and the editor (Dr. Kazu Z. Nanjo).
References
