Deep‐learning–based method for faults classification of PV system
2021; Institution of Engineering and Technology; Volume: 15; Issue: 1 Linguagem: Inglês
10.1049/rpg2.12016
ISSN1752-1424
AutoresSayed A. Zaki, Honglu Zhu, Mohammed Al Fakih, Ahmed Rabee Sayed, Jianxi Yao,
Tópico(s)Energy Load and Power Forecasting
ResumoIET Renewable Power GenerationVolume 15, Issue 1 p. 193-205 ORIGINAL RESEARCH PAPEROpen Access Deep-learning–based method for faults classification of PV system Sayed A. Zaki, Corresponding Author eng_sayed_002010@yahoo.com orcid.org/0000-0001-7545-0039 School of New Energy, North China Electric Power University, Beijing, China Faculty of Engineering, Cairo University, Giza, Egypt Correspondence Sayed A. Zaki, School of Renewable and Clean Energy, North China Electric Power University, 102206, Beijing, China. Email: eng_sayed_002010@yahoo.comSearch for more papers by this authorHonglu Zhu, School of New Energy, North China Electric Power University, Beijing, China The State Key Laboratory of Alternate Electrical Power System with Renewable Energy Sources, North China Electric Power University, Beijing, ChinaSearch for more papers by this authorMohammed Al Fakih, School of New Energy, North China Electric Power University, Beijing, ChinaSearch for more papers by this authorAhmed Rabee Sayed, orcid.org/0000-0001-5855-7125 School of New Energy, North China Electric Power University, Beijing, China Faculty of Engineering, Cairo University, Giza, EgyptSearch for more papers by this authorJianxi Yao, School of New Energy, North China Electric Power University, Beijing, ChinaSearch for more papers by this author Sayed A. Zaki, Corresponding Author eng_sayed_002010@yahoo.com orcid.org/0000-0001-7545-0039 School of New Energy, North China Electric Power University, Beijing, China Faculty of Engineering, Cairo University, Giza, Egypt Correspondence Sayed A. Zaki, School of Renewable and Clean Energy, North China Electric Power University, 102206, Beijing, China. Email: eng_sayed_002010@yahoo.comSearch for more papers by this authorHonglu Zhu, School of New Energy, North China Electric Power University, Beijing, China The State Key Laboratory of Alternate Electrical Power System with Renewable Energy Sources, North China Electric Power University, Beijing, ChinaSearch for more papers by this authorMohammed Al Fakih, School of New Energy, North China Electric Power University, Beijing, ChinaSearch for more papers by this authorAhmed Rabee Sayed, orcid.org/0000-0001-5855-7125 School of New Energy, North China Electric Power University, Beijing, China Faculty of Engineering, Cairo University, Giza, EgyptSearch for more papers by this authorJianxi Yao, School of New Energy, North China Electric Power University, Beijing, ChinaSearch for more papers by this author First published: 12 January 2021 https://doi.org/10.1049/rpg2.12016Citations: 1AboutSectionsPDF ToolsRequest permissionExport citationAdd to favoritesTrack citation ShareShare Give accessShare full text accessShare full-text accessPlease review our Terms and Conditions of Use and check box below to share full-text version of article.I have read and accept the Wiley Online Library Terms and Conditions of UseShareable LinkUse the link below to share a full-text version of this article with your friends and colleagues. Learn more.Copy URL Share a linkShare onEmailFacebookTwitterLinked InRedditWechat Abstract The installation of photovoltaic (PV) system, as a renewable energy source, has significantly increased. Therefore, fast and efficient fault detection and diagnosis technique is highly needed to prevent unpredicted power interruptions. This is obtained in this study in the following steps. First, an efficient meta-heuristic algorithm is proposed for extracting the optimal five parameters of the PV model in order to assist the MATLAB simulation model. It is used due to its simplicity and high efficiency in building the PV array simulation. Second, a new PV system deep-learning convolutional neural network (CNN) fault classification method is presented for the advantage of automatic feature extraction, which reduces the computational burden and increases the high classification capability. Finally, for the practical and theoretical validation of the employed CNN model, normal and six fault cases are selected based on different atmospheric conditions. At same time, three electrical indicators are analysed and accordingly chosen as inputs to the proposed classification model. Moreover, the proposed model is compared with other machine-learning models. 1 INTRODUCTION Renewable sources of energy recently have noticeable donation to sustain sources of power generation. This is due to the yearly rise in energy demand, the bad impact of non-sustainable energy sources as well as global degradation of fossil fuel such as coal and oil. Among these renewable sources, solar energy is one of the most common widely utilised power sources due to its friendly environmental impact, zero-running cost, and simple technical issues needed for efficient operation. Currently, photovoltaic (PV) modules, as the main part of solar energy, is growing rapidly all over the world due to the significant progress of the developed technology as it helps to decrease the solar cells material costs [1]. However, the need for efficient fault diagnosis techniques has increased also for better monitoring the outdoor-installed PV systems considering the variation of environmental conditions. For effective fault detection methods, modelling the PV system mathematically plays an important key on the accuracy of the classification technique. This is because it has a remarkable role in obtaining the optimal parameters, design, and assessment of the PV solar system fault diagnosis methods [2, 3]. Although the manufacturers of solar modules provide the electrical characteristics of PV modules at standard test condition (STC) to users to help in building the simulation model and other calculations, these characteristics can change from its nominal values under actual long-term working condition. Moreover, the PV system operation depends on the environmental conditions such as temperature and solar irradiance [4]. Therefore, an efficient PV model using measured current-voltage (I-V) data is needed to accurately and efficiently optimise the PV parameters [5]. There are many models that are formulated to characterise the I-V curve. The most commonly used for PV modelling is the single diode, double diode, and the PV module models [6]. On the other hand, an efficient PV optimisation algorithm is needed to extract the electrical parameters from PV models. Recently, several algorithms have been dedicated that can be classified into three sets: Analytical algorithms [7], deterministic algorithms [8], and meta-heuristic algorithms [9]. Based on the latter set, the improved teaching-learning-based optimisation algorithm (ITLBO) [10] is used in this study for extracting the electrical parameters from PV module to be employed in the simulation model due to its simplicity in execution as well as no special requirements. In general, studying the different faults, in case of low-solar irradiance level, is computationally challengeable for the PV fault diagnosis because disturbances introduced on the different indicators might be unnoticeable. Valuable studies [11-13] offer several algorithms to detect PV faults such as low-mismatch faults, and line-to-line fault (LL) under low-solar irradiance levels. However, such studies provide low accuracy in fault identification and need high-cost measurement sensors. In this work, the low irradiance is taken into consideration when studying the different faults for achieving better performance of the proposed model. Nowadays, the artificial intelligence (AI) techniques are commonly used for the diagnosis of PV fault types due to high classification accuracy. Artificial neural network (ANN), probabilistic neural network (PNN), and fuzzy c-means (FCM) are lately used in a wide range in [14-16]. However, these AI techniques have some demerits such as huge data amount and computational time for training stage, sensitive for small environmental condition variations, and updating the collected data periodically are needed due to degradation ageing problems in solar PV modules. Furthermore, the advantage of monitoring the operation of PV systems gives the user a huge data amount for detecting different known and unknown faults. Consequently, using this kind of data-based models is relevant, and the process of features extraction is effective for PV fault diagnosis issues. For the purpose of extracting the desired features, convolutional neural network (CNN) as a multivariate model is recently used for data image representation and also for faults classification task. In the case of CNN-based classification model, CNN utilises mainly in image recognition for its high accuracy in classification. Moreover, using CNN in monitoring numerous engineering schemes are increased for the advantage of automatic features extraction process. In the field of detecting the motor faults, Lee et al. [17] use signal databases containing a univariate with bivariate data time series, then, they implement CNN as a feature extraction and classification technique to raise the accuracy of bearing fault classification rate. Chen et al. [18] suggest CNN for the classification and extract-features technique. This is obtained by dividing the collected multivariate time-series data into univariate data for each parameter, then, the feature extraction process was performed for each parameter individually. In case of PV solar cells, Li et al. [19] conduct one dimensional CNN to classify the different kinds of PV module defects such as yellowing, dust-shading, and corrosion of gridline using aerial images in large-scale PV plants. However, the equipment used in the work is expensive, and the CNN implemented only on the offline operating condition. In this study, the main innovations are trifold: Based on meta-heuristic techniques, the ITLBO is advised to extract the electrical parameters of PV modules for the simulation model. The CNN fault classification technique is proposed to achieve high performance of the fault diagnosis tasks, considering the advantage of automatic features extraction from input datasets, as softmax layer, to obtain the classification output result. The effectiveness of the proposed technique is validated by simulated and experimental case studies. In addition, it is also compared with three benchmark AI models, namely, two-stage support vector machine (SVM) [13], ANN [14], and PNN [15]. 2 PV SYSTEM MODEL AND DESCRIPTION OF TYPICAL FAULT INDICATORS 2.1 PV modelling The equivalent circuit of the PV module is shown in Figure 1. The PV module can be modelled by connecting some diodes in series representing the series connected solar cells, and some diodes in parallel representing the parallel branches of solar cells (in this study, each PV module has 60 cells connected in series and no parallel branches of solar cells), then the generated current can be written in Equation (1) as [6] I = I p h − I O e V + I N s R s n N s V t h − 1 − V + I N s R s N s R s h (1) FIGURE 1Open in figure viewerPowerPoint The measured curves and simulated ones of the photovoltaic (PV) model obtained by improved teaching-learning–based optimisation (ITLBO) algorithm at 1000 W/m2 irradiance and 25°C temperature: (a) The current-voltage (I–V) curves, (b) The P–V curves where the thermal voltage V t h which can be expressed as in Equation (2) V t h = n K T q (2) To model an accurate PV module, five parameters are required to be mathematically calculated. These parameters are I p h , I O , R s , R s h , and n. The main reason for extracting these parameters is to get the minimum absolute error between the measured with the simulated data of the generated current. The ITLBO algorithm presented in [10] is used in this study as an accurate optimisation technique for parameter extraction of the PV module. Compared with the existing teaching-learning–based algorithms, the proposed algorithm has two main improvements, namely, (i) in the teacher stage, a new teaching plan is proposed where the learners are divided into two different groups based on the learning level, then the best learners are guided from their corresponding teacher as well as themselves; and (ii) in the learner stage, instead of only one learner to exchange knowledge to another learner, the new plan of ITLBO is to make the well-learners learn from the other two learners for accurate search path, while the poorer learners learn from the knowledge of four different learners in order to increase the global search capability and provide the population variety. The simulated data obtained by the ITLBO algorithm at STC operating condition is compared with the measured values in Figure 2. It is observed that the simulated data is almost similar to the measured one which provides accurate modelling for the PV module to get the five parameters The available electrical parameters of the PV module provided by the manufacturer during the optimisation process are listed in Table 1, while the obtained five parameters can be shown in Table 2. FIGURE 2Open in figure viewerPowerPoint The model circuit of the PV module TABLE 1. Electrical parameters of the PV module at STC Parameter Value I m , m o d (A) 8.12 V m , m o d (V) 30.2 P m a x (W) 245 I s c , m o d (A) 8.69 V o c , m o d (V) 37.4 α ( A / K ) 0.0006 Β ( V / K ) –0.0031 TABLE 2. The calculated five parameters of PV module at STC Optimal parameter Value I p h (A) 9.33 I o (A) 10 − 10 R s ( ′ Ω ) 0.24 R s h ( ′ Ω ) (V) 593.24 n ( p . u . ) 1.102 2.2 Indicators selection Selecting the indicators for detecting the faults in PV system must satisfy the following aspects: (i) Ability of identifying and discriminating the different kinds of fault under the variation of solar radiation and module temperature, (ii) suitable for different PV systems scale and configurations, (iii) ability of using optimum number of indicators to help accelerate the classification operation and minimise the used memory, (iv) measuring and building the selected indicators must be simple and easy for implementation. Therefore, in this study, three indicators are implemented from the whole PV array rather than calculating those for each string or module. Voltage, current, and power, which are obtained from the I-V and P-V curves, are the three common indicators used in identifying different faults of the PV array, see examples in [14–16, 20, 21]. Although, these indicators could discriminate between the healthy the faulty conditions, implementing these indicators, in our study, has made an overlapping between the different selected faults for the irradiance variation. More specifically, every two data points have the same working situation and voltage levels attained at some fault cases. Therefore, for efficient fault diagnosis method that has a better sense to environmental condition variation and fault happening, three indicators are chosen in this study for analysing the characteristics of different faults. The fault diagnosis procedure is based on extracting these indicators under the variation of solar irradiance and module temperature. Furthermore, analysing the selected indicators in the PV system represents the key role to detect and accurately classify the different faults. The selected indicators for the PV array are N o r m a l i s e d V m , N o r m a l i s e d I m , and F F , and their expressions can be obtained as in Equations (3) to (5), respectively [11, 22]. N o r m a l i s e d V m = V m V o c , r e f (3) N o r m a l i s e d I m = I m I s c , r e f (4) F F = V m ∗ I m I s c ∗ V o c (5) To calculate these indicators, the electrical parameters of the PV array, namely, V m , I m , I s c and V o c , can be determined analytically at any solar irradiance S and temperature T as in Equations (6) to (9), respectively [23]. V m = N s , m o d ∗ V m , S T C 1 + α T − T S T C + V t h ∗ l n I s h I s h , S T C (6) I m = N P , m o d ∗ I m , S T C S S S T C 1 + α T − T S T C (7) I s h = N P , m o d ∗ I s h , S T C S S S T C 1 + α T − T S T C (8) V o c = N s , m o d ∗ V o c , S T C 1 + β T − T S T C + V t h ∗ l n I s h I s h , S T C (9) 2.3 Typical faults occurring in PV arrays There are various types of faults that may occur in the PV system. Some of faults on the DC side that frequently occur have excessive power losses and reduction in the efficiency of the PV system, including short-circuit (SC) faults, LL faults, open circuit (OPEN), partial shading (Shad), and degradation. It is noted that the most commonly occurring faults in the existing studies are LL, OPEN and Shad faults [13-16]. The LL faults (or bridging faults) are conducted here by generating an SC connection between two points in a string or among different PV strings. Practically, this fault type may have occurred due to either water entering to the conductors, chewing by animals, mechanical damage of insulation, or junction box damage by DC arcs. LL faults can cause fire damage in large-scale PV systems. Moreover, they may generate reverse fault current, with its value depending on the voltage difference among the two faulty points. Low-voltage difference would provide a low fault currents, and this becomes a challenge to detect this LL fault cases using traditional fault detection methods. In addition, in such LL fault cases, the protection devices may fail to sense such currents (see [13, 24] for more details). Therefore, to address these concerns, this study focuses on diagnosing LL1 and LL2 fault cases under the variation of atmospherical conditions, particularly in low-irradiance levels. For implementing the LL fault in this study, two fault scenarios, denoted as LL1 and LL2, represent four and six short-circuited modules mismatch between two strings, respectively. However, such fault has not commonly occurred in the PV arrays, and their effect should be discussed and detection techniques should be utilised to prevent probable power losses or fire damage. As indicated by [11], the resistance of SC can be assumed to equal zero, which is consistent with our setting for faults LL1 and LL2. Shading faults are recently modelled and studied due to its distinct impact on the power output from the PV systems during the day operation. Although, their impacts are not permanent, they should be distinguished from the other fault types for the exact classification in order to serve the protection devices. In this study, two fault scenarios of shading faults, denoted as shad1 and shad2, are selected to be examined. These faults represent four- and six-shaded modules in same string, respectively. Note that the shunt bypass diode is assumed to normally operate for the shaded modules. The shading factor is set at 50%, which means only half of the full solar irradiance is received by the module surface, while the temperature is same as STC. However, the shading fault can be performed at different patterns and shading factors in the PV array. This study considers that the shading is almost fixed over the day, that is, the constant shading factor is applied. More specifically, in small residential PV arrays, the physical area is relatively small and the solar irradiance can be assumed constant on all PV modules over the day [25]. Moreover, in such arrays, the shading faults are assumed to occur due to the nearby buildings or the associated equipment. Therefore, the same numbers of PV-shaded modules may be affected by this small moving shadow along the day, wherever the module location is, inside the examined string. So, the number of shaded modules are considered fixed along the daytime, as similar treatment is found in [15, 16]. Finally, OPEN circuit faults are modelled by disconnecting electric wiring between the PV modules. Two faults are considered, namely, OPEN1 and OPEN2, to denote the OPEN circuit faults in one and two PV strings, respectively. It should be noted that every fault class is simulated separately, that is, the study of multiple faults occurring at the same time is not included in this study. The PV array under study contains three strings, where each string consists of 13 modules connected in series as shown in Figure 3(a). Each module is connected in parallel with shunt bypass diode to avoid hotspots formation and power loss in shading condition as shown in Figure 3(b). Moreover, seven experimental operating tests on the DC side have been simulated to evaluate the fault diagnosis method; one normal condition, and six fault cases can be seen in Figure 4 (LL1, LL2, OPEN1, OPEN2, Shad1, and Shad2). The different typical faults are simulated using MATLAB Simulink in case of STC as shown in Figure 5. According to Figure 5, in the LL1 and LL2 fault cases, V o c and V m are declined, while the I s c and I m are unchanged compared with the normal state. On the other hand, in the OPEN1 and OPEN2 fault cases, the value of I s c and I m are significantly decreased, where the V o c and V m are almost the same. Additionally, in Shad1 and Shad2 fault cases, the I m and V m , are significantly affected, while I s c and V o c are similar with those of the normal condition. FIGURE 3Open in figure viewerPowerPoint (a) Schematic diagram of the PV array with 3×13, and (b) the equivalent circuit model of the PV module FIGURE 4Open in figure viewerPowerPoint The DC side of the PV array with the examined six fault cases FIGURE 5Open in figure viewerPowerPoint The I–V curves of the PV array at one normal and six faults cases at standard test condition (STC) 3 METHODOLOGY First, the analysis of the selected three indicators is performed. Then, a fault detection and diagnosis method is presented based on the deep-machine model of the standard CNN. 3.1 The analyses of selected indicators for typical faults To perform indicators analysis, the typical faults are conducted under different environmental conditions, that is, variable temperature and solar irradiance in three consecutive days in June. A wide range of irradiance from 5 to 1000 W/m2 and temperature data samples from 5 to 55°C are chosen with a time step of 15 min between every two samples with solar irradiance values as shown in Figure 6(a), where all samples with zero irradiance, that is, during the night, are eliminated. FIGURE 6Open in figure viewerPowerPoint The distribution of the selected indicators in normal and different fault conditions in three days. (a) The environmental conditions of module irradiance and temperature, (b) the Normalised Vm, (c) the Normalised Im, and (d) the FF To simulate the shading faults, the solar irradiance is multiplied by gain equals 0.5 to be feed to the shading modules, while the temperature is unchanged as in normal condition. Then, the selected indicators, N o r m a l i s e d V m , N o r m a l i s e d I m , and F F , are calculated using Equations (3) to (5) as shown in Figures 6(b) to (d), respectively. From the presented Figure 6(b), it is observed that the N o r m a l i z e d V m significantly declined in cases LL1 and LL2. For N o r m a l i s e d I m , it decreased in OPEN1 and OPEN2 cases as shown in Figure 6(c). Moreover, the third indicator F F is influenced and declined in cases Shad1 and Shad2 as shown in Figure 6(d); the reason is that these fault types have a direct impact on V m and I m (refer Figure 4). Based on the aforementioned discussions, the selected three indicators are mainly and differently influenced by the fault type, which, in result, indicates the ability of the proposed methodology to detect and classify different kind of faults. Additionally, this would help to decrease overlapping between the collected data samples, and hence, raise the proposed method efficiency. Finally, the conclusion of the three indicators distribution is summarised in Table 3. TABLE 3. The indicators variation of PV array in case of normal and different fault cases Faults Normalised V m Normalised I m Fill factor (FF) Normal 0.68–0.8 0.9–0.95 0.7–0.75 LL1 0.5–0.54 – – LL2 0.38–0.44 – – OPEN1 – 0.6–0.62 – OPEN2 – 0.3–0.31 – Shad1 0.68–0.73 – 0.6–0.63 Shad2 0.69–0.74 – 0.56–0.58 Note: (-) means no change compared with normal case. 3.2 CNN structure CNN is commonly utilised in image recognition and classification. It has been successfully used to identify faces, different objects and so forth. Therefore, CNN is an important machine-learning tool for most practitioners today. The one-dimensional CNN, which is used in this study, consists of two consecutive steps [26] as illustrated in Figure 7: The first step is the feature extraction which contains three stages: The input layer, convolutional, and pooling stages. The convolutional layer is to extract the features across the raw data of the input layer by a filtre and stores them as a convolved feature, which can be conducted using sigmoid activation function. The pooling layer function is to reduce the size of the convolved feature, which progressively reduces the computational time in the network. Max-pooling is the method that usually utilised to complete the process of the pooling layer. The second step is classification, which contains the fully connected and the output stages, where the inputs of fully connected stage are the features attained from the pooling stage, and then classification task using softmax in the output stage is performed. FIGURE 7Open in figure viewerPowerPoint The proposed convolutional neural network (CNN) architecture The samples dataset is collected from the PV simulation model under normal and abnormal conditions. This dataset represents a three-dimensional matrix for data of the three indicator samples calculated in Section 3.1. 3.3 The proposed fault diagnosis method A feature map is conducted based on a one-dimensional structure, where the pooling layer is used to reduce the dimension of the conventional layer at every individual feature map. During the training process of the CNN, weights are updated using the gradient descent method to minimise the loss (classification error) function value. This implies that the larger magnitude of a given weight, to the extracting features significant for classification. Also, zero-padding is applied to prevent shrinking as well as information loss happening at the borders. The flowchart of applying the proposed CNN model can be shown in Figure 8. FIGURE 8Open in figure viewerPowerPoint The proposed CNN-based model for fault classification 4 RESULTS ANALYSES The proposed model is validated in this section theoretically and physically for assessing the classification accuracy. 4.1 Simulation tests In order to validate the proposed method, a case study is performed in this part using MATLAB Simulink software. The data samples distribution between the three indicators are collected in three-dimensional matrix for generating the input of the proposed CNN model. The data is gathered as shown in Figure 9 as described previously in Section 3.1. This data will follow, first, the pre-processed operation for data filtering, elimination of noise and so forth. Then, the dataset are separated for two stages: The training and testing stages for diagnosing the CNN model. The total dataset consists of 1239 = (177*7) including normal and six different fault cases, each sample has the three selected input indicators. FIGURE 9Open in figure viewerPowerPoint The data samples distribution of the three indicators 4.1.1 Training results The dataset at the training stage is assessed by using 80% of the total dataset (118 dataset samples for each normal and fault cases). To perform the training process, the CNN technique was trained using the selected parameters shown in Table 4. Grid search technique is conducted to get these parameters by trying some combinations to the best one suitable for the training dataset. TABLE 4. The parameters of the proposed CNN Parameters Value Input 25 Feature input dimension 16 Batch dimension 128 Learning rate 0.01 RMSprop optimiser variables No. of epochs 50 Input layer nodes 32 Fully connected layer nodes 40 Softmax layer nodes 6 One of the key points for preventing the early convergence process for better model performance is to choose the suitable optimisation algorithm, which helps to increase the training computational speed and the accuracy of fault classification in the CNN model. As stated in [27], there are three types of the commonly used algorithms, namely, RMSprop, Adam and Adadelta. In this study, the RMSprop algorithm is implemented to obtain the learning rate, batch size, and number of epochs related to the technical implementation. The optimisation implementation is obtained based on calculating the mean data of the nearest amount of the grade weight. It is useful to process the dynamic data variables such as the current, and voltage data at normal and faulty conditions. For better monitoring and analysis of the training process, the training accuracy and loss values between CNN model output and labelled ground truth are measured during the number of batches. Furthermore, the results in Table 5 which show the confusion matrix of the training samples evaluate that a few data samples are misclassified in the training
Referência(s)