Issue 
J. Space Weather Space Clim.
Volume 11, 2021
Topical Issue  Space Weather research in the Digital Age and across the full data lifecycle



Article Number  38  
Number of page(s)  16  
DOI  https://doi.org/10.1051/swsc/2021021  
Published online  30 June 2021 
Research Article
Operational Dst index prediction model based on combination of artificial neural network and empirical model
^{1}
Korea Meteorological Administration, 61 16gil Yeouidaebangro, Dongjakgu, Seoul 07062, Republic of Korea
^{2}
Chungnam National University, 99 Daehakro, Yuseonggu, Daejeon 34134, Republic of Korea
^{3}
Korea Astronomy and Space Science Institute, 776 Daedeokdaero, Yuseonggu, Daejeon 34055, Republic of Korea
^{4}
University of Science and Technology, 217, Gajeongro, Yuseonggu, Daejeon 34113, Republic of Korea
^{5}
Chungbuk National University, 1 Chungdaero, SeowonGu, Cheongju, Chungbuk 28644, Republic of Korea
^{6}
York University, 4700 Keele St., Toronto, ON M3J 1P3, Canada
^{7}
National Meteorological Satellite Centre, KMA, 6418, Guamgil, Gwanghyewonmyeonl, Jincheongun, Chungcheongbukdo 27803, Republic of Korea
^{*} Corresponding author: jjlee@kasi.re.kr
Received:
12
June
2020
Accepted:
23
May
2021
In this paper, an operational Dst index prediction model is developed by combining empirical and Artificial Neural Network (ANN) models. ANN algorithms are widely used to predict space weather conditions. While they require a large amount of data for machine learning, largescale geomagnetic storms have not occurred sufficiently for the last 20 years, Advanced Composition Explorer (ACE) and Deep Space Climate Observatory (DSCOVR) mission operation period. Conversely, the empirical models are based on numerical equations derived from human intuition and are therefore applicable to extrapolate for large storms. In this study, we distinguish between Coronal Mass Ejection (CME) driven and Corotating Interaction Region (CIR) driven storms, estimate the minimum Dst values, and derive an equation for describing the recovery phase. The combined Korea Astronomy and Space Science Institute (KASI) Dst Prediction (KDP) model achieved better performance contrasted to ANN model only. This model could be used practically for space weather operation by extending prediction time to 24 h and updating the model output every hour.
Key words: Space weather model / Dst index prediction / Artificial neural network
© W. Park et al., Published by EDP Sciences 2021
This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
1 Introduction
Largescale interplanetary disturbances from the Sun interact with Earth’s magnetic field, resulting in severe space weather events, such as geomagnetic storms (Gonzalez et al., 1994; Ohtani et al., 2000; Bhaskar & Vichare, 2019). The Dst index is a representative index of geomagnetic activity in the space weather community (Sugiura, 1964; Rangarajan, 1989; Daglis et al., 1999; Wanliss & Showalter, 2006), and there have been numerous attempts to predict the Dst index (Lyons, 1998; Birn et al., 2001; Raeder & Maynard, 2001; Rastätter et al., 2013; Eastwood et al., 2017). Empirical and artificial neural network (ANN) models have been handled as important methods for predicting the Dst index (Murayama, 1982; Feldstein, 1992; Boyle et al., 1997; Kugblenu et al., 1999; Weigel, 2010). Many previous studies established that the southward component of the interplanetary magnetic field (IMF) presents the main cause of geomagnetic storms (e.g., Burton et al., 1975; Gonzalez et al., 1994; Echer et al., 2005).
Burton et al. (1975) described the Dst index using the following empirical formula:
where Dst* is the Dst index corrected by solar wind dynamic pressure. The term Q(t) is a function that expresses the rate at which the ring current is intensified by the duskward solar wind electric field (represented in the geocentric solar magnetospheric coordinate system); P_{dyn} is the solar wind dynamic pressure, τ is the decay time when the ring current weakens by 1/e, and b and c are coefficients determined by observational data. As Q(t) is determined solely by the solar wind, we can accurately predict the Dst index using these formulas if we can observe or predict solar wind states. Many Dst models have been derived from this model (Lundstedt et al., 2002), and there have been numerous studies based on the above empirical formula (O’Brien & McPherron, 2000; Wang et al., 2003) since Burton et al. (1975). The model for predicting the Dst index 1 h ahead has been significantly improved by Temerin & Li (2002, 2006).
In the 1990s, advances in computer technology led to ANN for predicting the Dst index (Freeman & Nagai, 1993; Lundstedt & Wintoft, 1994; Watanabe et al., 2002). These studies, which used solar wind data as input, initially utilized a simple feedforward ANN. Wu & Lundstedt (1996) applied a more complex Elman recurrent neural network. More data from solar wind observatories, Wind, and Advanced Composition Explorer (ACE), have been archived and used to improve the results obtained from neural networks. Currently, we can predict the Dst index 1 h ahead with relatively high accuracy. However, it is not clear whether this is sufficient for space weather operation, as it is not easy to determine the forecast time required for space weather applications. Obviously, more than 1 h of forecast time is required for space weather applications, changing scheduled satellite operations and the air routes in a polar region, and sending a warning signal to a complex communication system.
As Earth’s magnetic field is disturbed almost simultaneously with solar wind variations, it is not easy to predict longterm changes in Earth’s magnetic field using only the solar wind data observed at Lagrange point, L1, without forecasting solar winds. Lazzús et al. (2017) created a model that combined ANN with particle swarm optimization and predicted the Dst index 1–6 h ahead. Gruet et al. (2018) combined a long shortterm memory recurrent neural network with a Gaussian process to predict the Dst index up to 6 h ahead. Naturally, the prediction accuracy tends to decrease sharply as prediction time increases. In this study, we also tried to extend the leading time of prediction up to 24 h differently.
The ACE mission, launched in 1997 and replaced by the Deep Space Climate (DSCOVR) satellite in 2015 for realtime operations, collected considerable solar wind data that can be used by ANN algorithms to predict the Dst index. Nevertheless, it is doubtful whether sufficient data is available for artificial intelligence and machine learning. Since geomagnetic storms with the Dst index of less than −100 nT occur only a few times a year during the solar maximum and significantly fewer times during the solar minimum, more than 20 years of solar wind data appear to be insufficient for predicting geomagnetic storms (Watari, 2017; Makarov, 2018).
This paper presents three Dst prediction models, simple ANN, empirical model, and combination model (Korea Astronomy and Space Science Institute (KASI) Dst Prediction, KDP).The simple ANN is the baseline model for comparison. Here, we just tried to follow the existing neural network algorithm, so there are no significant advances and show a comparable prediction performance to the previous models, such as shown in Section 3. The empirical model (equations) indicates the intensity of a magnetic storm according to the degree of solar wind changes. Two kinds of geomagnetic storms are considered separately; coronal mass ejection (CME) driven and corotating interaction region (CIR) driven storms in Section 4. This preemptive information can help improve Dst prediction accuracy because a space weather operator can determine in advance whether CIR or CME causes future geomagnetic storms. Finally, the present study proposes a combination of empirical and ANN models in Section 5. The predicted values by the empirical model are used as an input parameter to the ANN model. Thus we could improve the prediction performance better than other models.
The method combining empirical and ANN models is similar to that of Revallo et al. (2014, 2015); the difference is in the empirical model. They used time histories of solar wind and magnetosphere interaction proposed by Romashets et al. (2008) a neural network input data. They constructed an analytical representation of magnetic fields in the region where the solar wind interacted with the earth’s magnetosphere and used the results as an input parameter for predicting the Dst index. On the other hand, we directly derived the Dst index’s prediction values from the empirical equations and applied them to the ANN model. Thus, the KDP model could extend the prediction time up to 24 h while Revallo et al. (2014, 2015) predict the Dst index just 1 h ahead. See and compare the model performance in Figure 9.
2 Data
For the development of a model, we use the solar wind data collected by the ACE and DSCOVR satellite and the Dst index as input values for the prediction model from 1999 to 2017. The Dst index is obtained from NASA (https://omniweb.gsfc.nasa.gov/ow.html), and the hourly average solar wind data are acquired from CDAWeb (https://cdaweb.gsfc.nasa.gov/index.html). For the model’s operation, current Dst and solar wind data are downloaded from World Data Center for Geomagnetism, Kyoto (http://wdc.kugi.kyotou.ac.jp/dst_realtime/presentmonth/index.html) and NOAA (https://services.swpc.noaa.gov/products/solarwind/).
Tables A1 and A2 in Appendix list the CME and CIRdriven geomagnetic storms from February 1999 to September 2017. Here, the geomagnetic storm is defined with a minimum Dst index of less than −50 nT. The event start time is defined as the maximum Dst time at the main phase, and the event end time is the recovery time exceeds −30 nT. These storm lists are mainly obtained from published papers (referenced therein), and some CIRdriven storms are added in this study. The effects of CME and CIR are known to be different (Liemohn et al., 2010). CME can potentially generate large geomagnetic storms, while CIR generally triggers minor storms. The empirical model developed in this study distinguishes the CME and CIRdriven storms, and the empirical model results are used as the input parameter of the combination model.
3 Artificial neural network (ANN) model
This section introduces a simple ANN model and shows its performance. The model is combined with an empirical model in Section 5. The ANN used in this study is the feedforward network (Gardner & Dorling, 1998) known as the most simple ANNtype algorithm (Haykin, 1998). The training technique of weights and biases uses an error backpropagation learning algorithm with a gradient descent learning method. The basic structure of this ANN is composed of an input layer (I), a hidden layer (H), and an output layer (O) as shown in Figure 1. The hidden layer uses a nonlinear hyperbolic tangent function, H = tanh(I), and the output layer uses the O = H type linear function. The following equations express these layers:
where H_{j} and O_{k} are the nodes of the hidden and output layers, respectively, wh_{i,j}, wo_{j,k} are the weights of the hidden and output layers, and b_{i} and b_{k} are biases, respectively. The suffixes i, j, and k denote the node numbers in the input, hidden, and output layers, and N and M denote the total number of nodes in the input and hidden layers, respectively.
Fig. 1 Artificial neural network architecture consisting of input, hidden, and output layers. 
We use the input parameters listed in Table 1 for the input layer. Note that the simple ANN model uses the data collected for storm and quiet time and does not distinguish CME and CIRdriven storms. We use the current observations, and the differences between the current observations and the observations obtained 1 or 2 h earlier. The use of these differences as input data helps predict the Dst index trend and, therefore, plays an important role in predicting the Dst value more than 1 h ahead. The differences are estimated based on hourly averaged solar wind data. We also predict 1–24 h later Dst index as the target values of the output layer. The output layer has one node, as shown in Figure 1, which means the simple ANN model consists of 24 models for each hour predictions in advance.
Simple artificial neural network input parameters.
We train the model by adjusting hyperparameters such as the node numbers of the input and hidden layers, learning rate, and learning cycle to optimize the neural network. The optimization selects the best condition by calculating the correlation coefficient and root mean square error (RMSE) between the predicted and actual values for all hyperparameters. The equations are given as follows:
where Y_{pre} is a predicted value, Y_{real} is a measured value, and N is the number of values.
For performance optimization, all hidden layers with node numbers of 7–50, a learning rate of 0.0001–0.01, and learning cycles of 1200–1500 were trained and validated (see Table 2). For the ANN training, we used the data observed in 1999, 2001, 2004, 2007, 2008, 2011, 2012, 2014, 2016, and 2017. For the model validation, we use 2000, 2002, 2006, and 2013, and for the test, we use 2003, 2005, 2009, 2015. A total of 53,839, 28,526, and 25,421 data points are used for training, validation, and testing, respectively. The data covered storm and quiet times.
Optimized hyperparameters of ANN models
The red dots (ANN model) in Figure 2 show the RMSE and the correlation coefficients obtained by comparing the prediction results with the observed Dst index during the test period. The statistical analysis is conducted for the entire period without dividing it into the storm and quiet period regardless of distinguishing between CME and CIRdriven geomagnetic storms. Compared with other models, this figure also shows the results from the various ANN models reported in the literature (Lazzús et al., 2017). As seen in Figure 2, many models predict the Dst index just 1 h ahead, and more advanced models (Wu & Lundstedt, 1997; Stepanova et al., 2005) can predict up to 6 h, while our model attempts to predict the Dst index up to 24 h. The initial version of the ANN shows similar or slightly lower performance than other models because no improved techniques are used.
Fig. 2 Comparison of the simple ANN model performance with other models: correlation coefficient (up) and RMSE (down) derived for test period. 
4 Empirical model
The relationship between the solar wind and the Dst index could be derived using an arbitrary empirical formula (McPherron & O’Brien, 2001). In deriving the equations, we regarded a geomagnetic storm as having only one main phase and one recovery phase to simplify the relationship between the measured Dst index and input parameters. However, this model could also be used to predict the multipeak storms when combined with the ANN model. The empirical model consists of three equations: predicting the minimum Dst index, finding the time reaching Dst Minimum, and expressing the recovery phases.
Equations (7) and (8) are used to determine the minimum Dst index (Dst_{min}) from the current solar wind condition caused by CME and CIR, respectively. While Burton et al. (1975) expressed the ring current injection rate in terms of the solar wind speed (V) and the southward IMF (Bs), in this work, we add the solar wind density (N), total IMF (Bt), and current Dst index to complete the equation. The solar wind parameters on the righthand side of equations (7) and (8) are measurements when the model starts to run each hour. Recall that this model is designed to produce new prediction values every hour for the space weather operation. These equations were derived by a polynomial fitting algorithm in commercial software for the storm period and added exponential terms for adjusting large storm trends. In deriving these equations, we fixed some arbitrary coefficients and fit a function to the measurements to find the unfixed coefficients. We do not think we found the optimum values describing minimum Dst because the equations have six input parameters. Note that there is still a large inconsistency between prediction and measurements in Figure 3. Nevertheless, these empirical equations effectively build the combination model, as shown in Section 5.
Fig. 3 Predicted minimum Dst (blue) and corresponding measured minimum Dst (black) with respect to V × Bs for CMEdriven (left) and CIRdriven (right) storms. The red lines show the values calculated from equations (7) and (8) with Bt = Bs, V = 500 km/s, and N = 30/cm^{3}. 
At the initial storm phase, the predicted Dst_{min} is mainly controlled by solar wind measurements because Dst_{current} is generally a small value, and the term, does not significantly contribute to the Dst_{min} prediction. However, as a big storm progresses, only solar wind data are not enough to describe geomagnetic storm developments. The exponential term adjusts the estimation of Dst_{min}. Figure 3 shows the minimum Dst predicted by using equations (7) and (8) (blue symbols), measured minimum Dst (black symbols), and trend (red line) of equations (7) and (8) under specific conditions, Bt = Bs, V = 500 km/s, and N = 30/cm^{3}. In these figures, the predicted Dst_{min} is calculated at the initial phase of the storm when V × Bs is just over 0.49 mV/m, and several hours later, minimum Dst measurement is obtained. As shown in Figure 3, the frequency of smaller geomagnetic storms is higher. For the prediction algorithms based on artificial intelligence, a larger weighting factor is applied to more frequent storms. Therefore, the ANN algorithm shows the worse prediction accuracy for the greater storms. In contrast, the empirical models are derived by human intuition and attempt to find a correlation between minimum Dst index and input parameters, regardless of the amount of data.
Next, we calculate the time (T_{min}) required to reach the minimum Dst obtained from equations (7) and (8). If we could estimate the Dst variation (dDst) for 1 h, we can estimate T_{min} as the following equation:
As the Dst index represents the variation in the geomagnetic field due to ring current development, the amount of change in the Dst index is proportional to the injection rate of the ring current particles. In this study, we adopt Wang et al. (2003) empirical model to derive the rate of change in Dst per hour (dDst) during the storm main phase.
where P (in nPa) is the solar wind dynamic pressure and P_{0} and γ are parameters provided by Wang et al. (2003). These parameters depend on the Dst index, as shown in Table 3. Here, the Dst index refers to a measurement at the current time.
Parameters which used for calculating the ring current injection rate, Q.
For more information about the model, refer to Wang et al. (2003). Now, we obtain the dDst, and the prediction equation for the main phase can be expressed as follows:
Here, we assumed the Dst index decrease linearly to Dst_{min} from Dst_{current}. While the Dst index changes during the main phase can be predicted using equation (11), the recovery phase’s equation is required to be derived. Figure 4 is a superposed epoch plot of the Dst index for the storms’ recovery phases listed in Tables A.1 and A.2 (Appendix). Here the minimum Dst is normalized to −1. We examined several fitting functions to derive an equation describing the Dst index trends for the recovery phase. We found that an equation represented by two exponential functions, such as equation (12), effectively follows the recovery phase trend. Here, equation (12) presented by the red line is not the averaged Figure 4 but just a plausible expression derived by intuition. Other researchers can get a different equation.
Fig. 4 The trend of the recovery phase of geomagnetic storms and results obtained from empirical equations. The Dst index is normalized by the absolute value of the minimum Dst index. Black solid lines show the normalized Dst index of the 184 events listed in Tables A.1 and A.2 in Appendix, and the red line is the result obtained from the recovery phase equation (12). 
From equation (11), we can get the profile of Dst variation for the storm main phase and from equation (12) for the recovery phase from 1 h to 24 h ahead. During a storm is progressing, Dst_{min} is updated every time, and if the predicted Dst_{min} is less than the current Dst, the empirical model recognizes the storm is in the recovery phase. Otherwise, if the Dst_{min} is larger than the current Dst, then the algorithm predicts another storm main phase. Thus, this model can generate multipeak storm prediction. These values are used as input parameters of the ANN model.
5 Combination model
The KASI Dst prediction (KDP) model effectively predicts the Dst index by combining the empirical and ANN models. First, we estimate the Dst index from 1 h to 24 h ahead using empirical equations (11) and (12). The prediction results from the empirical model are combined with the measurements listed in Table 1. For example, the 6th ANN model’s input parameters are the 6 h ahead prediction value from equations (11) and (12) and the measurements in Table 1. Hence, it is as if there are 24 independent models for predicting Dst for each hour. Even if the predicted Dst from the empirical model is just one among 15 inputs, note ANN gives greater weight to the empirical predictions. Therefore, just adding the empirical model outputs on the ANN’s input data improves the final prediction performance. The method of training and validation is the same as the original ANN model. Table 4 shows the combination model’s hyperparameters.
Optimized hyperparameters of ANN of combination models.
Figure 5 shows an example of how the KDP model predicts the Dst index during geomagnetic storms. A halo CME was observed at 08:06 UT on 18 November 2003 by Solar and Heliospheric Observatory (SOHO) Large Angle and Spectrometric Coronagraph (LASCO) C2. At this time, the CME’s speed was approximately 1150 km/s, and it would reach the Earth after approximately 32 h by simply dividing the SunEarth distance (150,000,000 km) by the solar wind speed (1150 km/s). However, there were big uncertainties at this moment whether the geomagnetic storm would occur by this CME, how much the Dst index would decrease, how fast it would reach the minimum Dst, and how long it would take recovery. At this moment, space weather operators set the KDP as CME model and wait for the CME shock arrival. The ACE satellite observed the CME shock at 07:26 UT on 20 November, indicating that the arrival time was approximately 47 h, approximately 15 h later than expected. However, the model did not predict geomagnetic storms because the IMF B_{z} component was northward at the shock arrival. In the bottom panel of Figure 5, each color dots indicate the Dst values predicted 1 h at each hour, and the lines show the following predicted values for 24 h with onehour interval. Note the first red dot indicates around zero, and no significant Dst changes are expected for 24 h. As time goes on, the colored dots and lines show the magnetic storm proceeding.
Fig. 5 Example of predicting the geomagnetic storm that occurred by CME on 20 November 2003. From top: (a) the solar wind density, (b) IMF total field, (c) IMF Bz, (d) solar wind speed, V × Bs, and (e) Dst index measured and predicted. In the bottom panel, the gray line shows the measured Dst index, the color dots indicate the value predicted 1 h at each hour, and the colored lines show predicted Dst values for 24 h with a onehour interval. 
As the IMF B_{z} component rapidly turned southward at 10:45 UT on 20 November, the KDP model started to predict a minor storm event, where a Dst minimum of −140 nT would be reached around 13:00 UT. As the IMF enhanced southward and the solar wind speed increased, the KDP model produced new predictions over time and provided results similar to actual Dst values. At 12:00 UT, the KDP model predicted that a Dst minimum of −430 nT would be reached around 20:00 UT. This value was similar to the measured value of −422 nT at 21:00 UT. At 21:00 UT on 20 November, the KDP model predicted that the storm would recover to −100 nT after one day. In this manner, space weather operators can deliver information about space weather conditions to users several hours before the minimum Dst is reached.
Figure 6 shows another example of KDP model prediction where the magnetic storm occurred by CIR on 5 August 2019. Generally, CIR could be identified by the corona holes on the solar UV images and produces minor magnetic storms. From 03:00 UT on 5 August, the solar wind speed increased slowly to about 700 km/s, similar speed to the CME storm shown in Figure 5. However, the IMF B_{z} was larger than −10 nT during the storm was progressing and caused different scale storms from the CMEdriven storm described in Figure 5. At the initial stage of the storm, the KDP model predicted the Dst index well. However, the solar wind speed increased extraordinary, and the model overestimates the minimum Dst with −80 nT while the measurement was larger than −50 nT. Nevertheless, the model successfully predicted the minor storm caused by CIR.
Fig. 6 Example of predicting the geomagnetic storm that occurred by CIR on 5 August 2019. Same format with Figure 5. 
Regarding the KDP model performance, Figure 7 shows the correlation coefficient and RMSE derived from equations (5) and (6) for the Dst index predicted using the empirical model, ANN, and combination model. Here, the statistical results are calculated only during the storm events listed in Tables A.1 and A.2 (Appendix) because we tried to develop the KDP model for space weather operation. In other literature, the space weather model’s performance has been shown for both storm and quiet periods. These algorithm performances would be governed by the quiet period because it is considerably longer than the storm period. However, from a space weather operator’s perspective, the important thing is how storm events are accurately predicted. We believe a better operational model should have good performance in predicting storm events. Thus, these results are obtained for the storm events in the test period. Figure 8 shows the KDP model’s correlation coefficients and RMSEs for training, validation, and test group to illustrate the model’s prediction capability. The higher correlation coefficient and smaller RMSE show that the Dst index is predicted more accurately using the combination model than the empirical and ANN models.
Fig. 7 Correlation coefficient and RMSE for the empirical model, ANN model, and combination (KDP) model. The performances were calculated only for the storm events in test group. 
Fig. 8 KDP model’s performance for training, validation, and test period. 
6 Discussion and conclusion
This paper introduces the KDP model that combined an ANN and an empirical model. The combination model showed better performance than each model in predicting the Dst index from 1 h to 24 h ahead, as shown in Figure 7. Therefore, we conclude that better space weather prediction can be achieved by using the combined models.
While this study aims to show an ANN model’s improvement by combining an empirical model, as shown in Figure 7, there have been demands for comparing the KDP model’s performance with others. While the ANN in the KDP model is trained only during the storm periods, we can find the input parameters even in quiet time. Thus, the KDP model can produce nonstorm predictions. Figure 9 shows the storm and quiet time results, implying the combination model has superior prediction performance than other developed models. However, such a comparison does not tell us which model is better or worse because the test periods and conditions are different. The persistence model shown in Figure 9 assumes the prediction values are the same as the current Dst. Figure 9 clearly shows the KDP model’s performance is better than the persistence model, and this model might be useful in space weather operation.
Fig. 9 Comparison of KDP model performance with other models: correlation coefficient (up) and RMSE (down). The red circles show prediction results from the KDP model for both quiet and storm time, and the red triangles indicate the model performance for only storm time. 
In addition to the persistence model, prediction efficiency is also used to validate the space weather model. Figure 10 shows the prediction efficiency of the KDP model calculated as below:
where Dst_{s}, , and stand for the measured, predicted, and mean Dst values respectively, and M is the length of the record. The prediction efficiency tests the ability of the model to predict the variation of Dst around the mean. A prediction efficiency of 1 is perfect agreement at all times. Prediction efficiencies less than or equal to zero do not provide useful predictions of the observations’ time variation. Figure 10 shows the prediction efficiency decrease as prediction time increases, while the model is still useful in predicting the 24hour Dst index.
Fig. 10 Prediction efficiency of KDP model for storm time only (red) and all test period (black). 
It should be noted that the ANN and empirical models used in this study are developed with a conventional method. The ANN is a commonly used feedforward network, and roughly estimated equations implement the empirical models. As shown in Figure 2, the ANN model is not superior to other existing models. As shown in Figure 3, the minimum Dst values are spread widely, and the empirical equations are not fitted well to the data. Nevertheless, the combined model yielded better results, which means the combination model could be significantly improved by adopting modernized ANN and sophisticate empirical models. Further research will improve the empirical equations for predicting the Dst index. If two improved models are combined, the prediction accuracy would be more increase. Thus, this paper encourages researchers to improve their neural network models by adopting empirical models.
Besides, by categorizing geomagnetic storms into CMEdriven and CIRdriven storms, we improved the model performance. Through such categorization, we can increase the correlation between the Dst index and input parameters. In a future study, the solar cycle and seasonal effects could also be considered to make an improved empirical model. Space weather operators can easily recognize whether CME or CIR will cause magnetic storms from solar images. However, sometimes CMEs could be embedded in a CIR and resulting in a compound storm event (AlShakarchi & Morgan, 2018). In this case, the decision for selecting CME or CIR mode remains to the operators. Even if they fail in identifying CME or CIRdriven storms, the KDP model predicts the Dst index anyhow by solar wind and current Dst.
The KDP model has been implemented to predict the Dst index 24 h ahead, and the prediction output is updated every hour for space weather operations. In general, space weather information is delivered to a professional group of users, such as radio communication companies, satellite operators, and flight dispatchers. Space weather users state that they can cope with severe space weather conditions in reasonable prediction time. For example, flight dispatchers (Privet communication with Korea air dispatchers) say they can change routes 3 h before a flight. It is well known that satellite launches can be delayed just a few minutes before launch. GEO satellite operators monitor their satellites for 24 h, and they can act quickly for emergencies. As shown in Figure 7, the KDP model can predict the Dst storms 6 h ahead with a correlation coefficient of 0.8 and an RMSE of 24 nT or less. Figure 11 shows the correlations between measurements and prediction values for 3hour, 6hour, 12hour, and 24hour ahead. The KDP model shows good correlations up to 6hour ahead. We emphasize these are acceptable characteristics for space weather operation. Thus, this model can be used as an operational model for delivering warning signals to space weather users.
Fig. 11 Correlation between measured and predicted Dst index for (a) 3hour (b) 6hour, (c) 12hour, and (d) 24hour ahead. 
We have noted that space weather users do not want to know only the minimum Dst index but the recovery time of storms. Users like to know when they can return to their daily lives. Even though the 24hour prediction correlation coefficient of the KDP model is not satisfactory at 0.38, we attempted to develop a model to track the storm’s progress using the 24hour prediction model and predicted the recovery phase. While the Dst index’s recovery does not mean that a space storm event has ended completely, it would be helpful information for space weather alerts.
Acknowledgments
The Dst index is obtained from NASA (https://omniweb.gsfc.nasa.gov/ow.html), and the hourly average solar wind data from ACE and DSCOVR are acquired from CDAWeb (https://cdaweb.gsfc.nasa.gov/index.html). For the model’s operation, current Dst and solar wind data are downloaded from World Data Center for Geomagnetism, Kyoto (http://wdc.kugi.kyotou.ac.jp/dst_realtime/presentmonth/index.html) and NOAA (https://services.swpc.noaa.gov/products/solarwind/).
This work was supported by the National Meteorological Satellite Centre (NMSC) of the Korea Meteorological Administration (KMA) through the “Geostationary Meteorological Satellite Ground Segment Development” research project. The editor thanks two anonymous reviewers for their assistance in evaluating this paper.
Appendix
CMEdriven geomagnetic storm events.
CIRdriven geomagnetic storm events.
References
 AlShakarchi DA, Morgan H. 2018. Properties of the HPSICMECIR interaction event of 9–10 September 2011. J Geophys Res (Space Phys) 123: 2535–2556. https://doi.org/10.1002/2017JA024849. [Google Scholar]
 Bhaskar A, Vichare G. 2019. Forecasting of SYMH and ASYH indices for geomagnetic storms of solar cycle 24 including St. Patrick’s day, 2015 storm using NARX neural network. J Space Weather Space Clim 9: A12. https://doi.org/10.1051/swsc/2019007. [Google Scholar]
 Bingham ST, Mouikis CG, Kistler LM, Boyd AJ, Paulson K, Farrugia CJ, Huang CL, Spence HE, Claudepierre SG, Kletzing C. 2018. The outer radiation belt response to the storm time development of seed electrons and chorus wave activity during CME and CIR driven storms. J Geophys Res 123: 10139–10157. https://doi.org/10.1029/2018JA025963. [Google Scholar]
 Birn J, Drake JF, Shay MA, Rogers BN, Denton RE, Hesse M, Kuznetsova M, Ma ZW, Bhattacharjee A, Otto A, Pritchett PL. 2001. Geospace environmental modeling (GEM) magnetic reconnection challenge. J Geophys Res 106(A3): 3715–3719. https://doi.org/10.1029/1999JA900449. [Google Scholar]
 Boyle CB, Reiff PH, Hairston MR. 1997. Empirical polar cap potentials. J Geophys Res 102(A1): 111–125. https://doi.org/10.1029/96JA01742. [Google Scholar]
 Burton RK, McPherron RL, Russell CT. 1975. An empirical relationship between interplanetary conditions and Dst. J Geophys Res 80: 4204–4214. https://doi.org/10.1029/JA080i031p04204. [Google Scholar]
 Daglis IA, Thorne RM, Baumjohann W, Orsini S. 1999. The terrestrial ring current: Origin, formation, and decay. Rev Geophys 37(4): 407–438. https://doi.org/10.1029/1999RG900009. [CrossRef] [Google Scholar]
 Eastwood J, Nakamura R, Turc L, Mejnertsen L, Hesse M. 2017. The scientific foundations of forecasting magnetospheric space weather. Space Sci Rev 212(3–4): 1221–1252. https://doi.org/10.1007/s1121401703998. [CrossRef] [Google Scholar]
 Echer E, Gonzalez WD, Guarnieri FL, Dal Lago A, Vieira LEA. 2005. Introduction to space weather. Adv Space Res 35: 855–865. https://doi.org/10.1016/j.asr.2005.02.098. [CrossRef] [Google Scholar]
 Freeman J, Nagai A. 1993. The Magnetospheric Specification and Forecast Model: Moving from real time to prediction. In: SolarTerrestrial Predictions IV, Proceeding of a Workshop at Ottawa, Canada, May 18–22, 1992, Vol 2, Hruska J, Shea MA, Smart DF, Heckman G, (Eds.), NOAA, Boulder. pp. 524–539. [Google Scholar]
 Feldstein YI. 1992. Modelling of the magnetic field of magnetospheric ring current as a function of interplanetary medium parameters. Space Sci Rev 59(1–2): 83–165. https://doi.org/10.1007/BF01262538. [Google Scholar]
 Gonzalez WD, Joselyn JA, Kamide Y, Kroehl HW, Rostoker G, Tsurutani BT, Vasyliunas VM. 1994. What is a geomagnetic storm? J Geophys Res 99(A4): 5771–5792. https://doi.org/10.1029/93JA02867. [NASA ADS] [CrossRef] [Google Scholar]
 Gardner M, Dorling S. 1998. Artificial neural networks (the multilayer perceptron) a review of applications in the atmospheric sciences. Atmos Environ 32(14): 2627–2636. https://doi.org/10.1016/S13522310(97)004470. [Google Scholar]
 Gruet MA, Chandorkar M, Sicard A, Camporeale E. 2018. Multiplehourahead forecast of the Dst index using a combination of long shortterm memory neural network and gaussian process. Space Weather 16: 1882–1896. https://doi.org/10.1029/2018SW001898. [Google Scholar]
 Haykin S. 1998. Neural Networks: A Comprehensive Foundation. Prentice Hall, Upper Saddle River, NJ. ISBN 0132733501. [Google Scholar]
 Kugblenu S, Taguchi S, Okuzawa T. 1999. Prediction of the geomagnetic storm associated Dst index using an artificial neural network algorithm. Earth Planet Space 51: 307–313. https://doi.org/10.1186/BF03352234. [Google Scholar]
 Lazzús JA, Vega P, Rojas P, Salfate I. 2017. Forecasting the Dst index using a swarmoptimized neural network. Space Weather 15: 1068–1089. https://doi.org/10.1002/2017SW001608. [Google Scholar]
 Lyons L. 1998. The geospace modeling program grand challenge. J Geophys Res 103(A7): 14781–14785. https://doi.org/10.1029/98JA00015. [Google Scholar]
 Lundstedt H, Wintoft P. 1994. Prediction of geomagnetic storms from solar wind data with the use of a neural network. Ann Geophys 12: 19. https://doi.org/10.1007/s0058599400192. [Google Scholar]
 Lundstedt H, Gleisner H, Wintoft P. 2002. Operational forecasts of the geomagnetic Dst index. Geophys Res Lett 29: 2181. https://doi.org/10.1029/2002GL016151. [Google Scholar]
 Liemohn MW, Jazowski M, Kozyra JU, Ganushkina N, Thomsen MF, Borovsky JE. 2010. CIR versus CME drivers of the ring current during intense magnetic storms. Proc R Soc London A 466(2123): 3305–3328. https://doi.org/10.1098/rspa.2010.0075. [Google Scholar]
 Matamba TM, Habarulema JB. 2018. Ionospheric responses to CME and CIRdriven geomagnetic storms along 30 E–40 E over the African sector from 2001 to 2015. Space Weather 16: 538–556. https://doi.org/10.1029/2017SW001754. [Google Scholar]
 Makarov GA. 2018. Heliolatitude regularities of magnetically disturbed days with daily average geomagnetic index Dst < −100 nT. Sol Terr Phys 4(3): 20–23. https://doi.org/10.12737/stp43201803. [Google Scholar]
 McPherron RL, O’Brien TP. 2001. Predicting geomagnetic activity: The Dst index. In: Space Weather Geophys Monogr Ser, vol. 125. Song P, Singer HJ, Siscoe GL (Eds.), AGU, Washington, DC. pp. 339–345. https://doi.org/10.1029/GM125p0339. [Google Scholar]
 Murayama T. 1982. Coupling function between solar wind parameters and geomagnetic indices. Reviews of Geophysics 20(3): 623–629. https://doi.org/10.1029/RG020i003p00623. [Google Scholar]
 O’Brien TP, McPherron RL. 2000. Forecasting the ring current index Dst in real time. J Atmos Sol Terr Phys 62: 1295–1299. https://doi.org/10.1016/S13646826(00)000729. [Google Scholar]
 Ohtani SI, Fujii R, Hesse M, Lysak RL. 2000. Magnetospheric current systems, vol 118. American Geophysical Union, Washington, DC. ISBN 9781118669006. [Google Scholar]
 Raeder J, Maynard N. 2001. Foreword (to Special Section on Proton Precipitation Into the Atmosphere). J Geophys Res 106(A1): 345–348. https://doi.org/10.1029/2000JA000600. [Google Scholar]
 Rangarajan G. 1989. Indices of geomagnetic activity. Geomagnetism 3: 323–384. [Google Scholar]
 Rastätter L, Kuznetsova MM, Glocer A, Welling D, Meng X, Raeder J, et al. 2013. Geospace environment modeling 2008–2009 challenge: Dst index. Space Weather 11(4): 187–205. https://doi.org/10.1002/swe.20036. [Google Scholar]
 Revallo M, Valach F, Hejda P, Bochníček J. 2014. A neural network Dst index model driven by input time histories of the solar windmagnetosphere interaction. J Atmos Sol Terr Phys 110–111: 9–14. https://doi.org/10.1016/j.jastp.2014.01.011. [Google Scholar]
 Revallo M, Valach F, Hejda P, Bochnicek J. 2015. Modeling of CME and CIR driven geomagnetic storms by means of artificial neural networks. Contrib Geophys Geodesy 45(1): 53–65. https://doi.org/10.1515/congeo20150013. [Google Scholar]
 Romashets EP, Poedts S, Vandas M. 2008. Modeling of the magnetic field in the magnetosheath region. J Geophys Res (Space Phys) 113: A02203. https://doi.org/10.1029/2006JA012072. [Google Scholar]
 Richardson IG, Cane HV. 2010. Nearearth interplanetary coronal mass ejections during solar cycle 23 (1996–2009): Catalog and summary of properties. Solar Phys 264: 189–237. https://doi.org/10.1007/s112070109568. [Google Scholar]
 Shen XC, Hudson MK, Jaynes A, Shi Q, Tian A, Claudepierre S, et al. 2017. Statistical study of the storm time radiation belt evolution during Van Allen Probes era: CME versus CIRdriven storms. J Geophys Res (Space Phys) 122: 8327–8339. https://doi.org/10.1002/2017JA024100. [Google Scholar]
 Stepanova M, Antonova E, Troshichev O. 2005. Prediction of Dst variations from polar cap indices using timedelay neural network. J Atmos Sol Terr Phys 67: 1658–1664. https://doi.org/10.1016/j.jastp.2005.02.027. [Google Scholar]
 Sugiura M. 1964. Hourly values of equatorial Dst for the IGY. Ann Int Geophys Year 35: 7–45. [Google Scholar]
 Temerin M, Li X. 2002. A new model for the prediction of Dst on the basis of the solar wind. J Geophys Res 107(A12): 1472. https://doi.org/10.1029/2001JA007532. [Google Scholar]
 Temerin M, Li X. 2006. Dst model for 1995–2002. J Geophys Res 111: A04221. https://doi.org/10.1029/2005JA011257. [Google Scholar]
 Wang CB, Chao JK, Lin CH. 2003. Influence of the solar wind dynamic pressure on the decay and injection of the ring current. J Geophys Res 108: 1341. https://doi.org/10.1029/2003JA009851. [Google Scholar]
 Wanliss JA, Showalter KM. 2006. Highresolution global storm index: Dst versus SYMH. J Geophys Res 111(A2). https://doi.org/10.1029/2005JA011034. [Google Scholar]
 Watanabe S, Sagawa E, Ohtaka K, Shimazu H. 2002. Prediction of the Dst index from solar wind parameters by a neural network method. Earth Planet Space 54: 1263–1275. https://doi.org/10.1186/BF03352454. [Google Scholar]
 Watari S. 2017. Geomagnetic storms of cycle 24 and their solar sources. Earth Planet Space 69(1): 70. https://doi.org/10.1186/s406230170653z. [Google Scholar]
 Weigel RS. 2010. Solar wind density influence on geomagnetic storm intensity. J Geophys Res 115(A9). https://doi.org/10.1029/2009JA015062. [Google Scholar]
 Wu JG, Lundstedt H. 1996. Prediction of geomagnetic storms from solar wind data using Elman recurrent neural networks. Geophys Res Lett 23: 319–322. https://doi.org/10.1029/96GL00259. [Google Scholar]
 Wu JG, Lundstedt H. 1997. Geomagnetic storm predictions from solar wind data with the use of dynamic neural networks. J Geophys Res 102: 14255–14268. https://doi.org/10.1029/97JA00975. [Google Scholar]
Cite this article as: Park W, Lee J, Kim KC Lee J, Park K, et al. 2021. Operational Dst index prediction model based on combination of artificial neural network and empirical model. J. Space Weather Space Clim. 11, 38. https://doi.org/10.1051/swsc/2021021.
All Tables
All Figures
Fig. 1 Artificial neural network architecture consisting of input, hidden, and output layers. 

In the text 
Fig. 2 Comparison of the simple ANN model performance with other models: correlation coefficient (up) and RMSE (down) derived for test period. 

In the text 
Fig. 3 Predicted minimum Dst (blue) and corresponding measured minimum Dst (black) with respect to V × Bs for CMEdriven (left) and CIRdriven (right) storms. The red lines show the values calculated from equations (7) and (8) with Bt = Bs, V = 500 km/s, and N = 30/cm^{3}. 

In the text 
Fig. 4 The trend of the recovery phase of geomagnetic storms and results obtained from empirical equations. The Dst index is normalized by the absolute value of the minimum Dst index. Black solid lines show the normalized Dst index of the 184 events listed in Tables A.1 and A.2 in Appendix, and the red line is the result obtained from the recovery phase equation (12). 

In the text 
Fig. 5 Example of predicting the geomagnetic storm that occurred by CME on 20 November 2003. From top: (a) the solar wind density, (b) IMF total field, (c) IMF Bz, (d) solar wind speed, V × Bs, and (e) Dst index measured and predicted. In the bottom panel, the gray line shows the measured Dst index, the color dots indicate the value predicted 1 h at each hour, and the colored lines show predicted Dst values for 24 h with a onehour interval. 

In the text 
Fig. 6 Example of predicting the geomagnetic storm that occurred by CIR on 5 August 2019. Same format with Figure 5. 

In the text 
Fig. 7 Correlation coefficient and RMSE for the empirical model, ANN model, and combination (KDP) model. The performances were calculated only for the storm events in test group. 

In the text 
Fig. 8 KDP model’s performance for training, validation, and test period. 

In the text 
Fig. 9 Comparison of KDP model performance with other models: correlation coefficient (up) and RMSE (down). The red circles show prediction results from the KDP model for both quiet and storm time, and the red triangles indicate the model performance for only storm time. 

In the text 
Fig. 10 Prediction efficiency of KDP model for storm time only (red) and all test period (black). 

In the text 
Fig. 11 Correlation between measured and predicted Dst index for (a) 3hour (b) 6hour, (c) 12hour, and (d) 24hour ahead. 

In the text 
Current usage metrics show cumulative count of Article Views (fulltext article views including HTML views, PDF and ePub downloads, according to the available data) and Abstracts Views on Vision4Press platform.
Data correspond to usage on the plateform after 2015. The current usage metrics is available 4896 hours after online publication and is updated daily on week days.
Initial download of the metrics may take a while.