Issue
J. Space Weather Space Clim.
Volume 5, 2015
Solar variability, solar forcing, and coupling mechanisms in the terrestrial atmosphere
Article Number A15
Number of page(s) 15
DOI https://doi.org/10.1051/swsc/2015013
Published online 19 June 2015

© O. Ashamari et al., Published by EDP Sciences 2015

Licence Creative CommonsThis is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

1. Introduction

Solar radiation is the Earth’s most important energy source. It is not constant, but varies over all observed timescales. As such the incoming solar irradiance is a very significant natural driver for the Earth’s climate. For recent reviews see Solanki et al. (2013), Ermolli et al. (2013) and Yeo et al. (2014a). Depending on its wavelength, the incoming radiation is absorbed in different layers of the Earth’s atmosphere and deposits its energy accordingly. For a detailed study of the influence of solar irradiance, including the Solar Spectral Irradiance (SSI) and the Total Solar Irradiance (TSI), on the Earth’s climate system it is however crucial to know accurately its variations over different timescales. Various methods of reconstruction of both the SSI and the TSI have been presented in the literature. These methods can be categorised into proxy-based and semi-empirical models.

The proxy-based models study to what extent specific solar proxies, such as sunspot number, Mg II index, F10.7 radio flux, abundance of cosmogenic isotopes, spectral lines or specific wavelength bands in the observed solar spectra or solar intensitygrams, can be used to describe solar irradiance variability. The NRLSSI model (see e.g., Lean et al. 2011) is based on the sunspot number, Mg II and F10.7 indices, and covers the full solar spectrum from the EUV to the IR. Thuillier et al. (2012) employ the Mg II index along with neutron monitoring observations, with the model named MGNM to model variations in the UV spectral range, while Morrill et al. (2011) use the Mg II index and the Ca II K activity derived from ground-based observations along with high-resolution spectra taken by the Solar Ultraviolet Spectral Irradiance Monitor (SUSIM; Brueckner et al. 1993) on the Upper Atmosphere Research Satellite (UARS) to model the variations in the UV part of the spectrum. Dudok de Wit et al. (2014) employ a set of radio observations to empirically model the SSI from the EUV to the visible part of the spectral range. Preminger et al. (2011) analyse the effect of the photometric sum of the solar features identified from images obtained at the San Fernando Observatory (SFO; Chapman et al. 1997) in the red and blue continuum to model the variation in TSI. For the understanding of the UV spectral range in particular, Lean et al. (1998) and Worden et al. (1998, 2001) analysed Ca II K spectroheliograms to derive information on the evolution of solar activity features used in their empirical models.

The semi-empirical models are based on the fact that various distinct activity features on the solar disk emit characteristic spectra. This involves determining the time-dependent area coverage of features such as sunspots, faculae and network for any given point in time. The contrast of the various features varies with wavelength and distance from disk centre. The area coverage of the activity features is determined from solar images and/or magnetograms. Finally, weighting the intensity spectra with the position-dependent relative areas covered by the features yields the temporal variation of the SSI (for an overview see e.g., Ermolli et al. 2013). Different groups use different spectral synthesis codes, solar images and magnetogram data.

Full-disk observations carried out with the Precision Solar Photometric Telescope in Rome (PSPT; Ermolli et al. 2010) are employed in the Osservatorio Astronomico di Roma (OAR) model of solar irradiance variations (Domingo et al. 2009; Ermolli et al. 2011). More recently, Ermolli et al. (2013) used, along with the PSPT images, spectra calculated with the RH radiative transfer code (Uitenbroek 2002) to reconstruct TSI and the SSI from the UV to the IR wavelength range. Moreover, Fontenla et al. (2011) reconstruct TSI and the SSI from the EUV to the IR using the Solar Radiation Physical Modeling tools (SRPM; Fontenla et al. 2009, 2011) along with data from PSPT (Coulter et al. 1996), also operating at the Mauna Loa Solar Observatory. Haberreiter et al. (2014) use the Solar Modeling code (SOLMOD; Haberreiter 2011, 2012) and images obtained with SOHO/EIT for the identification of activity features with the SPoCA tool (Verbeeck et al. 2014) to reconstruct the solar EUV spectral range.

Synthetic spectra calculated in non-Local Thermodynamic Equilibrium (non-LTE) with the COde for Solar Irradiance (COSI; Haberreiter et al. 2008; Shapiro et al. 2010) have also been used for semi-empirical modelling of solar irradiance (Haberreiter et al. 2005; Shapiro et al. 2011). With the analysis of MDI magnetograms presented here we aim at improving their UV reconstruction. Recently Yeo et al. (2014b) used a series of full-disk observations from the Kitt Peak Vacuum Telescope (KPVT; Jones et al. 1992), SOHO/MDI and the Helioseismic and Magnetic Imager on board the Solar Dynamics Observatory (SDO/HMI; Schou et al. 2012) to reconstruct TSI and SSI from the FUV to the IR, with the SATIRE-S model for the period of 1974–2013 (see e.g., Krivova et al. 2003; Unruh et al. 2008; Ball et al. 2011; Yeo et al. 2013, and references therein). In the following we summarise the concept of the SATIRE reconstruction model in more detail as it is in some respect similar to the new approach presented here.

Sunspots are identified as groups of magnetic pixels that have continuum intensities below a certain threshold value. All other magnetic pixels on the solar disk are identified as faculae and network. In the irradiance reconstruction these pixels are assigned a specific weighting, i.e. filling factor. It has a value between 0 and 1, depending on the pixel’s magnetic field strength and is determined via linear interpolation between 0 G and the saturation value (Bsat) of the faculae and network pixels. Finally, the intensity spectra for the different features and positions on the disk are calculated with the LTE spectral synthesis code ATLAS9 (Kurucz 1991). For the faculae and network elements the same intensity spectra are used.

It is clear that each of the groups that identify the area coverage of solar activity features from various instruments uses slightly different definitions and identifications for these features. We emphasise that there are different definitions for some photospheric solar structures, and there is no universally accepted unique definition for these structures (Harvey & White 1999). Sunspots, for example, are typically straightforward to identify as they appear dark in photospheric intensitygrams. The differences arise for faculae and network definitions and identification techniques. For instance, using PSPT images, faculae and network are identified through the position-dependent contrast of the narrow-band Ca II K intensities. On the number of features identified, several discrete thresholds might be applied. Therefore, unless the same feature identification and consequently threshold technique is applied by different authors, a direct comparison of the segmentation results is difficult.

Here we present yet another definition for the detected solar photospheric structures taking into account the locations of magnetic pixels with respect to active regions. We also use continuum intensitygrams and magnetograms, however the idea is that, besides the information of the contrast and the photospheric magnetic field strength of the features, their proximity to an active region is also taken into account. In particular, we investigate to what extent faculae and network can be distinguished through their proximity to a sunspot group.

The present work is the first of a series of papers that aims towards creating a new approach for the identification of solar features from SOHO/MDI intensitygrams and magnetograms, for the purpose of semi-empirical irradiance reconstruction. The motivation for this is that in the near future we aim to incorporate it into an automated system that can operate in a real-time mode to process low-latency data from space missions. As a proof of concept we start our investigations on SOHO/MDI data, and in the near future we plan to investigate SDO/HMI data and adapt our methods accordingly. One particular outcome of the current work is the provision of area coverages of different solar features, which will be used at a later stage for irradiance reconstruction, and would also be available to the community to carry out any kind of analysis for a better understanding of these features. In this paper we discuss and present the detection methodology of solar features from SOHO/MDI intensitygrams and magnetograms and their area coverage. For this purpose, we use an extended version of the Automated Solar Activity Prediction tool (ASAP; Colak & Qahwaji 2008, 2009), which was used in previous research on the detection of sunspots and active regions. We detect magnetically active solar features in magnetograms and locally dark solar features in intensitygrams and provide active regions, sunspot umbra and penumbra, faculae, and network detections as well as the quiet Sun at daily intervals for the available MDI intensitygrams and magnetograms.

We provide binary segmentation maps along with the solar area coverages of the features as a function of position on the solar disk. In an upcoming paper we will assign intensity spectra to these activity features and reconstruct the SSI for the full MDI time series.

The following parts of the paper are organised as follows: the SOHO/MDI data used in this study and the pre-processing applied are briefly described in Section 2. The statistics of the intensity and magnetogram signals from the full set of the MDI data are discussed in Section 3. The methodologies adopted for the detection of features in solar images are discussed in Section 4, the results obtained and their analysis, including a simple model of TSI based on the results obtained here, are presented in Section 5, while conclusions are drawn in Section 6.

2. MDI data

The Solar Oscillations Investigation/Michelson Doppler Imager (SOI/MDI; Scherrer et al. 1995) on board the Solar and Heliospheric Observatory (SOHO) satellite took sets of high-quality full-disk images. The images were taken using five narrow-band filters through the Ni I 676.8 nm absorption spectral line, and were used to produce, amongst others, the intensitygrams and magnetograms.1

In this work, we use level 2 intensitygrams, which have been corrected for limb-darkening, observed with a 6-hour cadence and level 1.8 5-min magnetograms, observed with a 96-min cadence. A total of 90,463 FITS data files were downloaded, which included 17,347 intensitygrams and 73,116 magnetograms. The downloaded data cover a period from April 21, 1996, when the first magnetogram was recorded, to April 11, 2011, when the MDI observations were terminated. This period covers solar cycle 23 and the beginning of solar cycle 24, and includes data from two solar minima and one solar maxima periods. The downloaded data set, however, includes a significant number of corrupted intensitygrams and magnetograms, which if left included, would cause misdetection and classification of solar features. Therefore, care was taken to identify and exclude corrupted files, which have poor quality and missing values.

For the purpose of the work described here, one pair of intensitygram and magnetogram is selected for each day over the complete MDI data set. These were determined, by selecting the intensitygram observed closest to 12:00 UT each day. Then the magnetogram closest in time to the intensitygram was chosen. As part of the selection process, the time difference between the selected intensitygram and magnetogram pair was not allowed to exceed 3 h. Each intensitygram was then re-mapped taking account of differential solar rotation, using the approach discussed in Tang (1981), to correspond to the observing time of the associated magnetogram. Through this process, 2,490 days with joint intensitygram-magnetogram pairs remained and were used for the subsequent analysis that is discussed here.

3. Data analysis

In this work, we take advantage of the ASAP tool for initial photospheric feature detection and grouping. ASAP was designed to operate in near real-time and so used SOHO/MDI intensitygrams and magnetograms, provided in Graphics Interchange Format (GIF) 8-bit format. However, the downloaded data files used in the current work are in the Flexible Image Transport System (FITS) format, containing data stored as signed 32-bit format as well as detailed header information. Therefore, the conversion of the FITS data from 32-bit signed to unsigned 8-bit greyscale images is necessary to enable the data visualisation and form images that are similar in nature to the GIF ones provided in SOHO archives. To implement the data type conversion, the solar disk minimum and maximum values of the investigated intensitygrams and magnetograms were determined. We found that the full range of values are between 0 and 1.5 for the intensitygrams, and between −3,500 and 3,500 Gauss for the magnetograms. These values were therefore used in linear operations to scale the signed 32-bit data from the FITS files to unsigned 8-bit data, which were then stored as images.

Subsequently, we found and recorded the solar disk mean, standard deviation, minimum and maximum values for each of the investigated intensitygrams and magnetograms. This is a very useful step for analysing and understanding the nature of the investigated data set. The collected values are therefore analysed for the purpose of finding parameters that enable the efficient and constant detection and enhanced visualisation of the photospheric features. The collected values and their analysis are presented and discussed in the following subsections.

3.1. Intensitygram analysis

The mean (〈I〉), standard deviation (σI), minimum (Imin) and maximum (Imax) values of the on-disk pixels for the intensitygrams are recorded and analysed in order to find the most appropriate parameters and values that enable the detection and visualisation of sunspots, pores and quiet Sun. These values are summarised in Table 1 and discussed in the following subsections. We report the values of these parameters as relative 8-bit pixel values, therefore we do not assign units to them.

Table 1.

Summary of the statistical analysis of the intensitygrams.

3.1.1. Intensitygram statistics

We collected 〈I〉, Imin and Imax values within the solar disk for the full set of the analysed intensitygrams. Figure 1 shows plots of 〈I〉 (green), Imin (blue) and Imax (red) for the on-disk pixels of the intensitygrams over the full MDI time series. We found 〈I〉 to be fairly steady with average about 169.5. We found the Imin values to be higher on average during the solar minimum period in comparison with during the solar maximum period. The collected Imin values can represent the intensities of sunspots, pores or the quiet Sun. Therefore we see the Imin values drop to its lowest levels during the solar maximum, when bigger sunspots with lower intensities are common. We found Imax values to increase slightly during the solar maximum, when compared to during the solar minimum period. The Imax values can represent the intensities of the quiet Sun or bright regions. It is also worth noting that extreme values are often found to correspond to single erroneous pixels.

thumbnail Fig. 1.

I〉 (green), I min (blue) and I max (red) of the solar disk for the full set of 8-bit intensitygrams.

Moreover, we determined the standard deviation σI for on-disk pixels of individual intensitygrams for the full set. The collected σI are shown plotted in Figure 2. As expected, we found σI to be lowest during the solar minimum periods and to increase gradually towards the solar maximum period or with the presence of solar activity. Thus, we find σI to be a good indicator for selecting intensitygrams at times with low solar activity. Figure 3 shows the statistics of the intensitygrams selected during quiet time.

thumbnail Fig. 2.

Standard deviation, σ I, of the solar disk for the full set of 8-bit intensitygrams.

thumbnail Fig. 3.

I min (blue) and 〈I SMin〉 (red) of 350 intensitygrams, with the lowest σ I, collected from the solar minimum periods.

3.1.2. Sunspot intensities

We collected and analysed 350 intensitygrams from the solar minimum periods for the purpose of obtaining an initial sunspot detection threshold Thss. For each image, we analysed the difference between Imin and 〈I〉. The collected data are shown plotted in Figure 3. As expected intensitygrams with the smallest differences between Imin and 〈I〉 have no sunspots or pores. In cases where the difference between Imin and 〈I〉 is larger, it turns out that the corresponding intensitygrams contain pores or small sunspot groups. We experimented with the collected data to empirically define the sunspot threshold, by linearly increasing the factor fss subtracted from 〈I〉 of the investigated images, and found that fss = 15 in the 8-bit range of values provides a reasonable, by visual inspection, threshold for the sunspot detection. Thus we adopt the following equation for calculating the initial Thss from the intensitygrams: Th ss = I - f ss . $$ {{Th}}_{\mathrm{ss}}=\left\langle I\right\rangle-{f}_{\mathrm{ss}}. $$(1)

For comparison purposes, we found that the corresponding value of this threshold to the original intensitygrams in FITS format is equal to 0.91. An example of the initial sunspot detection of an intensitygram using Eq. (1) is shown in Figure 4.

thumbnail Fig. 4.

(a) MDI intensitygram obtained May 17, 2000 at 11:11 UT, (b) sunspot segmentation map, which was achieved by applying Eq. (1). This is the result of the first stage of the initial sunspot segmentation. (c) sunspot segmentation map, which was achieved by applying Eq. (2). This is the result of the second stage of the initial sunspot segmentation. The segmentation map from the second stage shows further detection of sunspots.

3.1.3. Pore intensities

We carried out further analysis in order to enhance the detection of sunspots achieved from the first stage, as discussed in Section 3.1.2, to detect smaller parts of sunspots and pores in each of the detected sunspot groups, by calculating ThPore using Eq. (2). One hundred and forty samples with the lowest activity during the solar minimum periods were collected and analysed at this stage in order to establish the values of the parameters in Eq. (2). The σI values of the collected data samples were analysed in order to determine an estimate of the standard deviation of the solar disk with the lowest activity (σI,SMin). The collected data are plotted and shown in Figure 5. When these samples were examined, care was taken to check that the solar disk was clear of sunspots and pores. The average and the median of σI,SMin, 〈σI,SMin〉 and 〈σI,SMinmed, were found to be equal to 1.5. We consider this value to reflect the standard deviation value of the quiet Sun at the solar minimum. Thus, σI,SMin, 〈σI,SMin〉 = 1.5, is adopted for the ThPore calculation. We also carried out an empirical analysis on a number of sunspot detections in a 100 × 100-pixel window, which is equivalent to a field-of-view of about 200 × 200 arcsec. The aim was to find a threshold value, ThPore, that can enhance the detection of sunspots, as defined in Eq (2). The analysis was carried out by linearly increasing the value of fp to find the most appropriate value for the detection of pores, which was found to be seven. Further discussion about enhancing the sunspot detection is given in Section 4. An example of this stage of sunspot detection in comparison with the results from the first stage is shown in Figure 4.

Th Pore = I region - ( f p × σ I , SMin ) , $$ {{Th}}_{\mathrm{Pore}}=\left\langle I\right\rangle_{\mathrm{region}}-({f}_{\mathrm{p}}\times {\sigma }_{\mathrm{I},\mathrm{SMin}}), $$(2)where fp = 7 and σI,SMin = 1.5.

thumbnail Fig. 5.

The standard deviation, σ I,SMin, of 140 intensitygrams collected from the solar minimum periods.

3.2. Magnetogram analysis

Having converted the magnetograms to unsigned 8-bit greyscale images, we record the mean 〈B〉, standard deviation σB, minimum Bmin, and maximum Bmax, of the scaled on-disk magnetic field values for the whole data set summarised in Table 2 and the following subsections.

Table 2.

Summary of the statistical analysis of the magnetograms.

3.2.1. Magnetogram statistics

We determined 〈B〉, Bmin, and Bmax for the on-disk pixels of the full set of the analysed magnetograms. The collected data are plotted in Figure 6.

thumbnail Fig. 6.

B〉 (green), B min (blue) and B max (red) of the full disk for the full set of 8-bit magnetograms.

We find the mean magnetic field strength 〈B〉 to be fairly constant over the full time series, and to be on average 127. The Bmin and Bmax values were also collected as they can give an indication of the intensities of the negative and positive polarity regions. As expected, Bmin and Bmax values vary in line with the solar cycle. As a result of the 8-bit rescaling of negative and positive magnetic field strength values onto one positive scale, the Bmin values are higher during the solar minimum period and gradually decreases towards the solar maximum period, while Bmax values are lower during the solar minimum period and gradually increase with solar activity.

Furthermore, the standard deviation of the magnetic field strength, σB, has also been determined for the full set of magnetograms and plotted in Figure 7. As expected, we find σB to be lower during the solar minimum period and to increase with the presence of solar activity. In the following, we use σB as an indicator for finding magnetograms with the lowest solar activity in the data set.

thumbnail Fig. 7.

The standard deviation of the magnetic field strength, σ B, of the full disk for the full set of 8-bit magnetograms.

3.2.2. Magnetic region analysis

During the solar minimum, the Sun has lower magnetic activity and flux when compared to the solar maximum period. In this context, we collected 100 samples of magnetograms from the solar minimum periods when solar activity was lowest and analysed the 〈B〉, Bmin, and Bmax values, in order to get an initial estimation of the pixel values that represent magnetic regions. The collected data are shown plotted in Figure 8. At this stage, we are unable to define the threshold values for detecting magnetic regions. However, we consider the average of the minimum, min(Bmin), and the maximum, max(Bmax), values of the collected samples to indicate magnetic values. We found the differences between these values and 〈B〉, |〈B〉 − min(Bmin)|, and |〈B〉 − max(Bmax)|, to be approximately 30 and −30. These values are adopted to enable the enhanced visualisation of the magnetic regions, which is achieved by applying contrast stretching, so that values between 〈B〉 − 30 and 〈B〉 + 30 will be transformed to values between 0 and 255, respectively. Equation (3) is applied to perform the contrast stretching, where in this case the interval of the magnetic field values to be stretched is $ {B}_{\mathrm{M}\mathrm{in}}^{\mathrm{*}}=\left\langle B\right\rangle-30$ and $ {B}_{\mathrm{M}\mathrm{ax}}^{\mathrm{*}}=\left\langle B\right\rangle+30$, respectively. B Output = B Input - B Min * 255 B Max * - B Min * , $$ {B}_{\mathrm{Output}}={B}_{\mathrm{Input}}-{B}_{\mathrm{Min}}^{\mathrm{*}}\frac{255}{{B}_{\mathrm{Max}}^{\mathrm{*}}-{B}_{\mathrm{Min}}^{\mathrm{*}}}, $$(3)

thumbnail Fig. 8.

B〉 (green), B min (blue), and B max (red) of 100 magnetograms, with the lowest σ B, collected from the solar minimum periods.

After performing the contrast stretching process, we carried out investigations to find the most appropriate approach for detecting magnetic regions within the solar disk, using the investigated samples. However, as reported in Ortiz et al. (2002) and Liu et al. (2012), we find that the magnetograms have non-uniform noise, and the application of a constant threshold results in the detection of significant numbers of isolated pixels particularly in the lower right quadrant of solar disks, where the actual noise is slightly above the average used in the thresholding. This segmented noise has been eliminated by applying a 3 × 3 median filter to the magnetogram, with little effect on the segmented signal. An example of an enhanced magnetogram is shown in Figure 9.

thumbnail Fig. 9.

(a) MDI magnetogram obtained May 17, 2000 at 01:39 UT. Values in the range of −3,500 to +3,500 Gauss are represented in grey-scale, (b) the contrast stretched version of the magnetogram. Values with the equivalence of −823 to +823 Gauss are represented in grey-scale, (c) median filtered version of the contrast stretched magnetogram, (d) magnetic region segmentation map. The grey and white regions on the segmented map represent the negative and positive regions on the magnetogram, respectively.

After enhancing the investigated images, we carried out an investigation on the collected data in order to determine the threshold values for the magnetic regions detection (Thmr). We did so by investigating the most appropriate value for the variable fmr in Eq (4) that can be used to detect the magnetic regions. The value of σB,SMin in Eq (4) is a predetermined value, fixed at the value of σB of the solar disk at the solar minimum, which is approximately equal to 2.4. This value was determined by calculating the σB of the contrast stretched magnetogram samples which were collected from the solar minimum periods, when magnetic activities were the lowest. Thus σB,SMin is set to 2.4 for the magnetic region detection and with this we found, by visual inspection, that using the value 2 for fmr gave the most appropriate results. For purpose of comparison, we found the physical equivalent of Thmr to be about 30 Gauss. This is equivalent to about three times the noise level of the MDI magnetograms, as found by other studies (e.g., Liu et al. 2012). Figure 9 shows an example of a magnetic region segmentation map, which was achieved by applying Eq (4): Th mr = B ± ( f mr × σ B , SMin ) , $$ {{Th}}_{\mathrm{mr}}=\left\langle B\right\rangle\pm ({f}_{\mathrm{mr}}\times {\sigma }_{\mathrm{B},\mathrm{SMin}}), $$(4)where fmr = 2, and σB,SMin = 2.4 in the 8-bit scale of the magnetograms.

4. MDI segmentation

ASAP is the collective name for a set of algorithms used to process solar images. These cover algorithms for detecting sunspots and active regions (Colak & Qahwaji 2008), solar-flare prediction (Colak & Qahwaji 2009) and sunspot detection and tracking (Verbeeck et al. 2014). For the purposes of the work described here, new segmentation algorithms were developed and integrated into ASAP to enable the detection and identification of faculae and network thought to be important for irradiance reconstruction. In summary, we apply our segmentation algorithms as follows. First the initial feature detection processes, described in Section 4.1, are applied to determine sunspot and magnetic region candidates. Second, a Neural Network (NN) method is applied, described in Section 4.2.2, to group sunspots and identify active regions based on magnetic region and sunspot properties. This step determines active region and their boundaries, which is a crucial intermediate process needed to determine whether features are located either inside or outside active regions. Finally, in Section 4.2.3, the maps generated are analysed to identify faculae and network, based on their contrasts and proximity to active regions. In Section 4.3, we summarise the process and provide flowcharts of the overall segmentation process.

4.1. Initial feature detection

We process MDI intensitygrams and magnetograms for the initial detection of sunspots and magnetic regions. This is carried out in a similar manner to that described in Colak & Qahwaji (2008) using automatically calculated threshold values, based on image type and the solar features that need to be detected. In this work, we adopted a number of predefined parameters, which are derived from analysing the MDI solar cycle data, as described in Section 3, and use them systematically in the segmentation approach, in order to achieve consistent segmentations. The initial detection of sunspots and magnetic regions is explained further below.

4.1.1. Initial sunspot detection

The initial detection of sunspots includes two stages. First, in the MDI intensitygram, we detect pixels with values below (〈I〉 − fss), which detects the majority of sunspot pixels, but occasionally misses smaller parts within the sunspots. It has been found that some of these missing small parts can be detected during the second stage, where a window of size 100 × 100 pixels is drawn around each group of the detected pixels, and the mean is calculated for each window, which we denote here as (〈IRegion). Any intensitygram pixel within the window that has value lower than (〈IRegion − (fp × σI,SMin)) is detected. Examples of sunspots segmentation maps are shown in Figure 10.

thumbnail Fig. 10.

(a) Intensitygrams from the solar maximum and (b) solar minimum. (c) Initial sunspot segmentation maps from solar maximum and (d) solar minimum.

4.1.2. Initial magnetic region detection

For the purpose of this paper, magnetic regions are defined as regions of the magnetograms that have magnetic field significantly higher than their surroundings. Prior to the detection of magnetic regions, we apply contrast stretching in order to enhance the visualisation of the magnetic regions, followed by 3 × 3 window median filtering, to reduce the non-uniform noise within the magnetogram, as described in Section 3.2.2. Magnetic regions with positive and negative polarity are identified and segmented separately. For the segmentation threshold calculation, pixels that have values below (〈B〉 − (fm × σB,SMin)) represent negative polarity regions, while pixels that have values above (〈B〉 + (fm × σB,SMin)) represent positive polarity regions. In both cases, the σB,SMin value is set to 2.4, which is the average of σB in the solar minimum period when no activity was present, as discussed in Section 3.2. Two examples of magnetic region detection from magnetograms are shown in Figure 11.

thumbnail Fig. 11.

(a) Magnetograms from the solar maximum and (b) solar minimum. (c) Magnetic region segmentation maps from solar maximum and (d) solar minimum. The negative and positive regions on the magnetogram images are represented as grey and white regions, respectively, on the segmentation map, for clarity purpose.

4.2. Final feature detection

After detecting sunspot candidates and magnetic regions, the results are processed to cluster sunspots into groups, and to identify groups of magnetic regions that overlap with sunspots, which we define as active regions.

In this section, we summarise the determination of active region candidates using image processing, which is described in Section 4.2.1, and the validation and final identification of active region and sunspot groups using neural network machine learning, which is described in Section 4.2.2. Full details of the sunspot and active region grouping can be found in Colak & Qahwaji (2008). We provide a summary of this process below.

4.2.1. Active region candidates

In order to identify sunspot and active region groups, the magnetic region and sunspot candidate maps are combined as follows. First a pixel marked as a sunspot candidate (Pspotcan) on the sunspot candidate map is found. If the magnetic region map has a magnetic pixel (Pmag), at the same location as Pspotcan, a new map is created for the active region group candidate (Pactcan) and marked with the pixel value of the Pmag. Then, all adjacent pixels to Pactcan that are marked as magnetic in the Pmag, are marked as Pactcan.

4.2.2. Neural network for active region and sunspot group identification

For the active region detection, we employ the machine learning approach that was introduced in Colak & Qahwaji (2008). A neural network (NN) was trained to enable the detection and grouping of active regions, using sunspot and active region candidates’ properties. A NN is a machine learning algorithm that is commonly used for tasks such as data classification, clustering and prediction. The adopted NN has been trained on nearly 100 examples of sunspots and active regions’ properties. For the NN application, the properties of overlapping sunspot and active region candidates are extracted and fed to the NN as an input. These properties include geometrical and location information of the active region candidates. The NN then provides an output decision, which represents whether the active region candidate is considered to be an active region or not. The identified active regions are then used for the final determination of sunspot groups and their boundaries.

The umbra and penumbra parts of a sunspot are then detected, in a similar manner to the approach introduced in Colak & Qahwaji (2008). This approach classifies the sunspot pixels that have values below (〈ISS 〉 − σSS) as umbra, and the remaining sunspot pixels as penumbra. Here 〈ISS〉 and σSS are the mean and the standard deviation, respectively, of the intensitygram pixels that are detected as sunspots. The final output from the processes discussed in this subsection are the sunspot groups, including umbra and penumbra, and active region groups, examples of which are shown in Figure 12.

thumbnail Fig. 12.

(a and b) The input intensitygram and magnetogram, (c) map of sunspot umbra and penumbra, (d) active region groups. Each group is given a different shade of grey.

4.2.3. Final detection of faculae and network

In this work, faculae are considered to be magnetic pixels that lie within the boundary of an active region, while network is defined as magnetic pixels located outside active region boundaries. For the distinction between faculae and network, ASAPs active region detection is used as an intermediate step. The starting point to distinguish between these features is to employ image processing to the corresponding sunspot, magnetic region and active region maps. An example set of these maps is shown in Figure 13.

thumbnail Fig. 13.

(a) Example of the segmentation maps for sunspots, (b) magnetic regions, (c) active regions. These maps are further employed for the identification of faculae and network.

We identify the faculae and network map by subtracting the sunspot map from the magnetic region map. Then, the faculae and network pixels are sub-classified to faculae if they are located within active region, or to network if they are located outside an active region. Examples of detection results are shown in Figures 14 and 15.

thumbnail Fig. 14.

Segmentation maps are shown in (a), from left to right, for faculae and network, faculae, and network with their corresponding boundaries shown in red in (b) for May 17, 2000 at 01:39 UT.

thumbnail Fig. 15.

Segmentation maps are shown in (a), from left to right, for faculae and network, faculae, and network with their corresponding boundaries shown in red in (b) for April 22, 1997, 00:04 UT. For this input, no faculae are detected, therefore no indication for faculae is given in the central segmentations.

Ortiz et al. (2002) study the position-dependent contrast of magnetic elements in specific magnetic field strength intervals from the threshold value up to 600 G (see their Fig. 3). From this study it can be concluded that faculae at disk centre with field strengths between 300 and 500 G typically have a contrast If/Iqs ≈ 0.8 (or −0.2 in their representation), and a slightly lower contrast If/Iqs ≈ 0.75 for the interval of 500–600 G. These contrast values are below the threshold of 0.91 used for sunspot identification in the present work. In order to ensure that faculae are not erroneously identified as sunspots, we included a criterion that sunspot pixels that have a magnetic field strength lower than 800 G are always identified as facular pixels. Using this value we follow the approach by Ball et al. (2012).

4.3. Summary of detection algorithm

This section described the proposed approach for detecting a number of solar features, present in MDI intensitygrams and magnetograms. The feature detection criteria are summarised in Table 3, while the overall detection process is summarised as a flowchart and shown in Figure 16. The key elements of the feature identification scheme are as follows:

  • the detection of solar features present in MDI intensitygrams and magnetograms, in line with ASAP feature detection,

  • the detection of faculae and network is based on the location of magnetic regions in relation to active regions,

  • the automated detection and classification of solar features and the calculation of their area coverage for the whole period of MDI data.

thumbnail Fig. 16.

Flowchart of the individual steps of the feature identification scheme of ASAP.

Table 3.

The solar features identification criteria that have been adopted in this work, and the parameters employed to identify them, from the respective data set. Intensitygrams and magnetograms are shortened to Int and Mag, respectively.

A summary of the solar features detection processes is given below:

  • detect initial sunspot candidates from intensitygrams, and magnetic regions from magnetograms. Unmagnetised regions on the magnetogram define the quiet Sun,

  • group sunspots and magnetic regions for the detection and verification of active regions and sunspots. Active regions are detected based on the localisation of magnetic regions and sunspots using NNs,

  • identify faculae and network by subtracting sunspot map from the magnetic region map. The remaining magnetic regions pixels are classified as faculae if they are located within active regions, otherwise, if they are located outside active regions, then they are classified as network.

5. Results

The main aim of this research is the detection of solar features for the provision of useful proxies for reconstructing TSI and SSI. For the detection of photospheric features, we carried out a series of processes to identify feature of interest on MDI intensitygrams and magnetograms. Figure 17 shows colour maps of features extracted from intensitygram and magnetogram pairs taken from solar maxima and the solar minima on the days indicated in the figure. Further to the application of the identified solar features, we have also computed the area filling factors for the detected features. These will be used in our future work for the SSI reconstruction. The calculated filling factors for a feature reflect the fraction of the solar disk covered by the feature and assign reference synthetic spectra to it. The filling factor is specified as a function of radial position on the solar disk, by relative radius values that determine concentric rings around the disk centre, as indicated in Tables 4 and 5. In the case of MDI the last interval would be between 0.95 and the limb. To reduce uncertainty in the calculation of area filling factors, the areas are calculated to sub-pixel accuracy. Tables 4 and 5 give the sets of filling factors calculated for the intensitygram-magnetogram pairs for May 17, 2000 11:11 UT and April 22, 1997 at 00:04 UT, respectively. The bottom row in each table is provided as a check and shows the number of pixels for each type of feature and should equal the sum of the numbers in the rows above.

thumbnail Fig. 17.

The left panels show the original MDI intensitygram (a) and magnetogram (b). The right panels show a colour map of the detected solar features: umbra (red), penumbra (orange), faculae (yellow) and network (blue).

Table 4.

Pixel coverage of the rings with inner and outer radii, R 1 and R 2, respectively, expressed relative to the solar radius, of the features penumbra, umbra, faculae, and network, as detected on the intensitygram and the magnetogram observed May 17, 2000, 01:39 UT.

Table 5.

Same as Table 4 but for the intensitygram and the magnetogram observed April 22, 1997, 00:04 UT.

The area coverage of the identified features (umbra, penumbra, network and faculae), which will be employed in upcoming reconstructions, is given in Figure 18a. The solar cycle modulation of the area coverage of all the features is clearly visible. To further test the segmentation result, we carried out a multi-regression fit to the SOHO/VIRGO data (Fröhlich et al. 1997). Following Fehlmann et al. (2012) we first corrected (divided) the VIRGO data by the correction factor 1.0034 to account for the difference between the World Radiometric Reference (WRR) maintained at PMOD/WRC to which VIRGO has been compared to, and the SI scale, the latter being realised at the TSI Radiometer Facility (TRF; Kopp et al. 2012). The Picard/PREMOS instrument PREMOS, (Schmutz et al. 2009, 2013) has been compared to both scales and independently confirmed the difference of the WRR and SI scale. We then carried out the multi-linear regression fit from which we obtained correlation coefficients between the observed TSI time series and the disk-integrated relative area coverage of umbra, penumbra, network and faculae of 0.29, 0.29, 0.82 and 0.57, respectively. Figure 18b shows the comparison of the model (red) with the VIRGO data (black). It is clear that the TSI variability is very well pre-produced. Some strong decrease in TSI due to the appearance of large sunspots is slightly underestimated. Figure 19 shows the scatter plot between the VIRGO data and the fitted data, the correlation coefficient between the two being r2 = 0.88.

thumbnail Fig. 18.

(a) Gives the relative area of the full solar disk of network (green), faculae (red), umbra (blue) and penumbra (black), (b) compares the fitted TSI data (red) with the VIRGO data corrected to the SI-scale (black).

thumbnail Fig. 19.

Scatter plot between the VIRGO TSI data and the fitted data based on our regression analysis. The squared correlation coefficient of the fit versus the VIRGO observation is r2 = 0.88. The dashed line indicates equality between the two data sets.

Figure 19 shows the scatter plot of the VIRGO data compared to the simple regression model. The analysis of the time series shows that the applied method to identify solar activity features from SOHO/MDI intensitygrams and magnetograms gives realistic results.

6. Conclusions

In this paper, a new image segmentation approach is introduced to enable the efficient detection of sunspot umbra and penumbra, faculae, network and quiet Sun. Other techniques were also developed for the visualisation and the calculation of area coverages of the detected features. The imaging methods and tools introduced in this work enabled the extraction of area coverages for the different solar features for the duration of the entire MDI mission. These area coverages will be used in the near future for the reconstruction of SSI. We extended the ASAP system for the purpose of identifying a number of solar features and calculating their area coverage, which is necessary for the SSI reconstruction. As part of this process, we carried out investigations to determine a set of key values that enables thresholding of solar features on MDI intensitygrams and magnetograms. ASAP is an automated system that provides near real-time detection of solar features and prediction of solar flares, and is currently operated using SDO HMI intensitygrams and magnetograms. Taking this into account, the approaches described in this paper can be applied to calibrated SDO FITS files in order to enable the systematic detection of solar features, and subsequently enable the provision of near real-time solar irradiance using SDO HMI images.

The relative contribution of the quiet Sun and various solar activity features to the total solar irradiance strongly depends on wavelength. Radiometers, e.g., PICARD/PREMOS (Schmutz et al. 2009) and PROBA2/LYRA LYRA (Hochedez et al. 2006), were dedicated to observe the solar spectrum in specific passbands from the EUV to the visible and near infrared, which allows a detailed validation of our approach. The situation regarding SSI measurements is however rather bleak. The only upcoming SSI instrument currently planned for the future is the TSIS instrument, which will be integrated on the International Space Station in 2017. Therefore, it is crucial to have reliable models available that allow modelling of the expected SSI in a robust way. Using the results presented here along with the synthetic spectra calculated from semi-empirical models will allow us to determine the contributions of the various features to the variation in SSI. As the intensity spectrum emitted by the different solar components can be pre-calculated, the solar spectrum can be predicted practically in real-time. The reconstruction of the SSI over a broad wavelength range will be the focus of a follow-up paper.

Acknowledgments

SOHO is a project of international cooperation between ESA and NASA. The authors acknowledge that the research leading to these results has received funding from the European Community’s Seventh Framework Programme (FP7 2012) under Grant Agreement No. 313,188 (SOLID). The authors thank the SOI and MDI team for their guidance to acquire MDI data, and thank Dr. Mohammad Alomari for his assistance in retrieving MDI data. The authors thank the editor and the anonymous reviewers for their careful revision and valuable comments. The editor thanks two anonymous referees for their assistance in evaluating this paper.


1

Full-disk 1,024 × 1,024 pixel intensitygrams and line-of-sight magnetograms data sets are both available to download from the SOHO and JSOC archive: http://soho.nascom.nasa.gov; http://jsoc.stanford.edu

References

Cite this article as: Ashamari O, Qahwaji R, Ipson S, Schöll M, Nibouche O, et al. Identification of photospheric activity features from SOHO/MDI data using the ASAP tool. J. Space Weather Space Clim., 5, A15, 2015, DOI: 10.1051/swsc/2015013.

All Tables

Table 1.

Summary of the statistical analysis of the intensitygrams.

Table 2.

Summary of the statistical analysis of the magnetograms.

Table 3.

The solar features identification criteria that have been adopted in this work, and the parameters employed to identify them, from the respective data set. Intensitygrams and magnetograms are shortened to Int and Mag, respectively.

Table 4.

Pixel coverage of the rings with inner and outer radii, R 1 and R 2, respectively, expressed relative to the solar radius, of the features penumbra, umbra, faculae, and network, as detected on the intensitygram and the magnetogram observed May 17, 2000, 01:39 UT.

Table 5.

Same as Table 4 but for the intensitygram and the magnetogram observed April 22, 1997, 00:04 UT.

All Figures

thumbnail Fig. 1.

I〉 (green), I min (blue) and I max (red) of the solar disk for the full set of 8-bit intensitygrams.

In the text
thumbnail Fig. 2.

Standard deviation, σ I, of the solar disk for the full set of 8-bit intensitygrams.

In the text
thumbnail Fig. 3.

I min (blue) and 〈I SMin〉 (red) of 350 intensitygrams, with the lowest σ I, collected from the solar minimum periods.

In the text
thumbnail Fig. 4.

(a) MDI intensitygram obtained May 17, 2000 at 11:11 UT, (b) sunspot segmentation map, which was achieved by applying Eq. (1). This is the result of the first stage of the initial sunspot segmentation. (c) sunspot segmentation map, which was achieved by applying Eq. (2). This is the result of the second stage of the initial sunspot segmentation. The segmentation map from the second stage shows further detection of sunspots.

In the text
thumbnail Fig. 5.

The standard deviation, σ I,SMin, of 140 intensitygrams collected from the solar minimum periods.

In the text
thumbnail Fig. 6.

B〉 (green), B min (blue) and B max (red) of the full disk for the full set of 8-bit magnetograms.

In the text
thumbnail Fig. 7.

The standard deviation of the magnetic field strength, σ B, of the full disk for the full set of 8-bit magnetograms.

In the text
thumbnail Fig. 8.

B〉 (green), B min (blue), and B max (red) of 100 magnetograms, with the lowest σ B, collected from the solar minimum periods.

In the text
thumbnail Fig. 9.

(a) MDI magnetogram obtained May 17, 2000 at 01:39 UT. Values in the range of −3,500 to +3,500 Gauss are represented in grey-scale, (b) the contrast stretched version of the magnetogram. Values with the equivalence of −823 to +823 Gauss are represented in grey-scale, (c) median filtered version of the contrast stretched magnetogram, (d) magnetic region segmentation map. The grey and white regions on the segmented map represent the negative and positive regions on the magnetogram, respectively.

In the text
thumbnail Fig. 10.

(a) Intensitygrams from the solar maximum and (b) solar minimum. (c) Initial sunspot segmentation maps from solar maximum and (d) solar minimum.

In the text
thumbnail Fig. 11.

(a) Magnetograms from the solar maximum and (b) solar minimum. (c) Magnetic region segmentation maps from solar maximum and (d) solar minimum. The negative and positive regions on the magnetogram images are represented as grey and white regions, respectively, on the segmentation map, for clarity purpose.

In the text
thumbnail Fig. 12.

(a and b) The input intensitygram and magnetogram, (c) map of sunspot umbra and penumbra, (d) active region groups. Each group is given a different shade of grey.

In the text
thumbnail Fig. 13.

(a) Example of the segmentation maps for sunspots, (b) magnetic regions, (c) active regions. These maps are further employed for the identification of faculae and network.

In the text
thumbnail Fig. 14.

Segmentation maps are shown in (a), from left to right, for faculae and network, faculae, and network with their corresponding boundaries shown in red in (b) for May 17, 2000 at 01:39 UT.

In the text
thumbnail Fig. 15.

Segmentation maps are shown in (a), from left to right, for faculae and network, faculae, and network with their corresponding boundaries shown in red in (b) for April 22, 1997, 00:04 UT. For this input, no faculae are detected, therefore no indication for faculae is given in the central segmentations.

In the text
thumbnail Fig. 16.

Flowchart of the individual steps of the feature identification scheme of ASAP.

In the text
thumbnail Fig. 17.

The left panels show the original MDI intensitygram (a) and magnetogram (b). The right panels show a colour map of the detected solar features: umbra (red), penumbra (orange), faculae (yellow) and network (blue).

In the text
thumbnail Fig. 18.

(a) Gives the relative area of the full solar disk of network (green), faculae (red), umbra (blue) and penumbra (black), (b) compares the fitted TSI data (red) with the VIRGO data corrected to the SI-scale (black).

In the text
thumbnail Fig. 19.

Scatter plot between the VIRGO TSI data and the fitted data based on our regression analysis. The squared correlation coefficient of the fit versus the VIRGO observation is r2 = 0.88. The dashed line indicates equality between the two data sets.

In the text

Current usage metrics show cumulative count of Article Views (full-text article views including HTML views, PDF and ePub downloads, according to the available data) and Abstracts Views on Vision4Press platform.

Data correspond to usage on the plateform after 2015. The current usage metrics is available 48-96 hours after online publication and is updated daily on week days.

Initial download of the metrics may take a while.