Automated epileptic seizures detection using multi-features and multilayer perceptron neural network

Detection of epileptic seizure activities from long-term multi-channel electroencephalogram (EEG) signals plays a significant role in the timely treatment of the patients with epilepsy. Visual identification of epileptic seizure in long-term EEG is cumbersome and tedious for neurologists, which might also lead to human error. Therefore, an automated tool for accurate detection of seizures in a long-term multi-channel EEG is essential for the clinical diagnosis. This study proposes an algorithm using multi-features and multilayer perceptron neural network (MLPNN) classifier. After appropriate approval from the ethical committee, recordings of EEG data were collected from the Institute of Neurosciences, Ramaiah Memorial College and Hospital, Bengaluru. Initially, preprocessing was performed to remove the power-line noise and motion artifacts. Four features, namely power spectral density (Yule–Walker), entropy (Shannon and Renyi), and Teager energy, were extracted. The Wilcoxon rank-sum test and descriptive analysis ensure the suitability of the proposed features for pattern classification. Single and multi-features were fed to the MLPNN classifier to evaluate the performance of the study. The simulation results showed sensitivity, specificity, and false detection rate of 97.1%, 97.8%, and 1 h−1, respectively, using multi-features. Further, the results indicate the proposed study is suitable for real-time seizure recognition from multi-channel EEG recording. The graphical user interface was developed in MATLAB to provide an automated biomarker for normal and epileptic EEG signals.


Introduction
EEG is a clinical procedure carried out for monitoring, diagnosing, and determining neurological disorders related to epilepsy [1]. Epilepsy is a neurological disorder caused due to abnormal electrical discharges in the brain that are characterized by seizures and sudden changes in the electrical activity of the brain. An epileptic seizure is commonly identified as a slow-spike waveform. The unpredicted nature of these seizures makes the daily life immobile with temporary impairments of perception, speech, memory, consciousness and may lead to an increased risk of injury or death [2,3]. Nearly 4% of world population experience seizure at some stage of their life out of which 1% are epileptic. In interictal recordings, epileptic seizures are usually activated with photostimulation, hyperventilation, and other methods. However, the drawback is that the behavior of provoked epileptic seizures is not necessarily the same as natural ones [4].
The long-term video-EEG recording is a significant milestone to not only capture and analyze ictal events but also help in the contribution of valuable clinical information. Traditional methods of analyzing EEG are time-consuming and a tedious job done by neurologists. Visual interpretation of these long-term EEG recordings can lead to human error and is inefficient [5]. Moreover, the EEG recordings of epileptic seizure are similar to the waves that are a part of background noise and artifacts. For these reasons, automated detection of epileptic

Open Access
Brain Informatics *Correspondence: sriraam@msrit.edu 1 Centre for Medical Electronics and Computing, Ramaiah Institute of Technology (Affiliated to VTU Belgaum), Bengaluru, India Full list of author information is available at the end of the article seizures is needed to reduce the analyzing time and help the neurologists. The brain is a nonlinear and complex dynamic system, so detecting seizures by a single-channel EEG is not sufficient. Thus, the processing of multi-channel EEG plays a vital role in seizure detection across the brain. However, multi-channel EEG signals impose the challenge of efficiently extracting useful information, and hence, only a few studies have focused on them [6,7]. An ample number of studies have been proposed for seizure detection. Such technique involves preprocessing, feature extraction, and classification. Selecting significant features is essential to distinguish between normal and epileptic EEG signals. Our focus is on making the job of the neurological experts easy by making the abnormality visually understandable by using the multi-features extraction methods.
Multi-channel EEG recording plays a crucial role in recognizing the epileptic seizure activities from the brain lobes. Automated computed aided screening tool to help neurologist in saving their investigation period and enhance the required clinical diagnosis. Therefore, this study proposes the automated detection of epileptic seizures from multi-channel EEG recordings using multi-features. It also helps neurological experts have a complete picture of the epileptic EEG recordings preventing them from false alarms and leading to decision support with increased accuracy. Figure 1 shows the flow of the proposed automated seizure detection system. The database was obtained after taking consent from the ethical committee. The raw data that were obtained had other noises such as power-line noise and motion artifacts other than EEG recording. Suitable filtering techniques were implemented to obtain clean EEG. The 50-Hz power-line noise was removed by using a notch filter, a bandpass filter had been implemented to get the signals in the range of 0.5-40 Hz, and independent component analysis (ICA) was applied to remove the motion artifacts.
The EEG data consisting of both normal and epileptic data annotated by the clinician were segmented separately for offline analysis. The features of interest to evaluate the epileptic EEG, namely PSD, entropy, and TE, were extracted, and descriptive analysis was carried out. The extracted features were given as input to the MLPNN binary classifier. Finally, a graphical user interface (GUI) has been developed to label the signals as normal or epileptic.
So far, several automated epileptic seizure detection methods have been proposed. In the early 1980s, the automated seizure detection procedure for a long duration of EEG recordings was initiated [8]. Guo et al. [9] proposed a line length of EEG as a feature and artificial neural networks classifier-based automated detection of epileptic seizure. The database considered was subjected to preprocessing, visual inspection, and artifact removal. EEG was decomposed into different sub-bands using discrete wavelet transform (DWT), and line length feature was extracted. The classification was done using a threelayer MLPNN, and a classification rate of more than 95% was achieved. Back-propagation neural network classifier with periodogram and autoregressive features was proposed [10]. Orhan et al. [11] used DWT-based features with MLPNN model for automated detection of epileptic seizures. Kamath [2,3] proposed Teager energy as a quantitative feature for EEG signals. The study used the University of Bonn database to extract Teager energy and compared the classification outcome with Higuchi's fractal dimension and sample entropy. It has been proved that TE provided an accuracy rate of 97.8%, and it can be used in real-time automated applications.
Gurwinder et al. [12] proposed a study to detect epileptic seizures using wavelet transformation and spikebased features. The work used University of Bonn database and wavelet transformation as its preprocessing technique. Spike-based parameters were extracted from both normal and interictal data. MLPNN was used for classification which gave an accuracy of 98.6%. Epileptic seizure detection method was developed using autoregressive modeling [13] and that showed the classification accuracy of 84.2% using MLPNN. Hierarchical EEG classification system using best basis-based wavelet packet entropy method was proposed [14]. Abbasi and Esmaeilpour [4] proposed a study to choose statistical characteristics of brain signals for detection of epileptic seizures using DWT and perceptron neural network. Their study used University of Bonn database and DWT as a feature extraction method. Statistical characteristics are derived, and a multi-perceptron The features such as mean, standard deviation, skewness, kurtosis, and the median in the first and second derivative of EEG signals were extracted for mobilebased automated epileptic seizure detection using k-means clustering technique [15]. Bogaarts et al. [16] extracted features such as curve length, root mean square, band power, zero crossing, Hjorth parameters, and Teager energy to classify epileptic EEG from normal using the support vector machine (SVM) classifier. Empirical mode decomposition (EMD) followed by DWT was applied on EEG signals to compute log energy entropy. The obtained features were classified using K-NN classifier, which yields the accuracy of 89.4% [17]. In the recent study [18], significant features were selected from neighborhood component analysis for the classification of focal and non-focal EEG signals. The highest classification accuracy of 96.1% was obtained using SVM classifier.
It was inferred from various studies that automated seizure detection was based on using single feature extraction. However, using multi-features would help in better classification of normal and epileptic data and classification accuracy.

EEG data acquisition
The EEG recordings used in this study were obtained from Ramaiah Memorial College and Hospital, Bengaluru, after getting consent from the ethical committee. Unipolar multi-channel (19 channels) EEG recordings from 20 patients (11 male and nine female), each of 20-min duration, were considered for the study. International 10-20 system was used for the electrode placement, and data were recorded at a sampling rate of 128 Hz. The data, consisting of both normal and epileptic seizures annotated by clinician, were segmented separately for offline analysis. The 19 channels include the recordings from the following placement of the electrodes: Fp1, Fp2, F7, F3, Fz, F4, F8, T3, C3, Cz, C4, T4, P3, Pz, T6, O1, and O2. Table 1 shows each patient information used in our study.

Preprocessing
Suitable filtering techniques were introduced to eliminate noise and artifacts. An infinite impulse response (IIR) notch filter of order 2 was implemented to remove the 50-Hz power-line noise. A bandpass filter of order 5 with a higher cutoff frequency of 40 Hz and a lower cutoff frequency of 0.5 Hz was implemented to retain the EEG rhythms of interest in the data. The filter design specifications are: the passband ripple and attenuation in the stop band were set to 3 dB and 40 dB, respectively. Artifacts were removed from the filtered EEG using joint approximation diagonalization of eigenmatrices-based ICA technique [28][29][30].

Feature extraction
Selecting significant features is essential for the proper classification of epileptic seizures. The number of extracted features should be less and easy to compute with reduced computational time. The significant characteristic of an epileptic EEG is a slow wave followed by a spike. The epileptic EEG varies significantly from that of a normal EEG in frequency, period, complexity, etc. Considering all these parameters, the following features were selected for our research work: power spectral density, entropy (Shannon and Renyi entropy), and Teager energy [19][20][21][22][23]. In this paper, PSD was used as the power of the EEG signal increased during epileptic activity. Entropy is a measure of the complexity or uncertainty of a signal, higher during epileptic activity, and gives a clear distinction between normal and epileptic. Teager energy depends on the amplitude of the epileptic data that is higher than that of the normal signal. From the preliminary study, it was identified that PSD using Yule-Walker method showed better results as compared to the other methods of PSD like Welch method, Burg's method, and Thomson's method.

Yule-Walker method
Yule-Walker method is an autoregressive (AR) method that estimates spectra with narrow peaks by placing the poles of the polynomial close to unity. Narrowly banded spectra are quite common in practice. Hence, this has been chosen as the best method for feature extraction for the study. The AR parameters are represented as θ by forming a biased estimate of the signal's autocorrelation function and a minimization of a prediction error [31].
For this study, a fourth-order autoregressive model was used to produce the PSD estimates. The preprocessed signal was segmented at a length of 0.5 s, followed by obtaining the PSD estimates, and then, the maximum PSD of each segment was determined. This process was carried out in the complete study.

Shannon entropy
It is a measure of the randomness or disorder in physical systems or the amount of average information gained by observations of disordered systems. It is the best possible lossless compression and gives low entropy values for varied distribution and high entropy values when outcomes are uniformly distributed. Shannon's entropy is given by the equation [32]: where pi is the probability of occurrence of the signal.

Renyi entropy
It is a generalized form of Shannon's entropy when the Renyi estimation factor α = 1 . It is also called quadratic entropy as α = 2 . The value of α is estimated to be taken as 2 as peak accuracy is achieved with specificity higher than for Shannon's case. Renyi's entropy equation is given as [33]: where α ≥ 0 and α � = 1.
It can also be added that when α has larger positive value, it is sensitive to events that occur often and when α has larger negative value, and it is sensitive to events that occur seldom.
Since entropy is a function of probability, in this study, the probability was estimated using the histogram method by setting the bins with a uniform width.

Teager energy
Teager energy is a nonlinear operator, which can be used for energy estimation of a non-stationary signal. This feature is extremely sensitive to amplitude and frequency changes of a signal. The method is computationally very efficient, as it requires only three samples at any given instance to calculate the physical energy. Since the EEG signal is non-stationary, Teager energy operator can be used as a discriminating feature for normal and epileptic data set.
As per the Teager algorithm, the Teager energy (TE) is estimated from the signal x(n) through the formation of time-delayed state-spaced vectors x(n) = [x 1 , x 2 , x 3 ,…, x n−1 , x n ] where n is the data points as follows [34]: where N is taken to be 64 (segmentation length of 0.5 s).
From the equation, it is clear that Teager energy takes into account the amplitude and the corresponding frequency to determine the physical energy.

Descriptive analysis
Descriptive analysis was performed on the extracted feature samples obtained from epileptic and normal data. The mean, standard deviation (SD), minimum, maximum, interquartile range (IQR), first quartile (Q1), median (Q2), third quartile (Q3), and semi-interquartile deviation (SID) were estimated for extracted features using box plot. The p and z values were found for individual patients for normal and epileptic feature values. The p value should be less than 0.05 which gives a confidence level of greater than 95%, and the z value should be less n − x n−1 * x n+1 than 1.96 and greater than − 1.96 [34]. The descriptive analysis of extracted features showed that the obtained features are significant for further analysis.

Classifier
MLPNN is a feed-forward neural network, which was used for binary classification of the EEG signal. It contains three consecutive layers, namely input, hidden, and output layer [35][36][37][38]. In this study, we have used the MLPNN model with a single hidden layer of 10 neurons. Hyperbolic tangent and tangent sigmoid were used as input to hidden and hidden to output activation function, respectively. A scaled conjugate gradient back-propagation was used as a training function. The classification target was set to 0 for normal and 1 for epileptic [27].

Performance evaluation
The performance of the proposed method was evaluated based on the sensitivity (S + ), specificity (S − ), and false detection rate (FDR) for individual patients as follows [11,19,25]:

Results
This study takes into account 20 patients' multi-channel EEG recordings. Notch filter and bandpass filters with appropriate cutoff frequencies were used to remove line noise of 50 Hz and other background noises. ICA was used to remove motion artifacts, and artifact-removed multi-channel EEG is shown in Fig. 2. To maintain the uniformity of the signal, the EEG was segmented at 0.5 s duration.   A descriptive analysis of the obtained feature from epileptic and normal data was performed. Table 2 shows that the statistical parameters for both epileptic and normal EEG samples obtained from patients were highly distinguishable. Results show that PSD, entropy, and Teager energy in epileptic EEG were more compared to that of normal EEG. A p value was found between normal and epileptic extracted feature samples using a two-sided Wilcoxon rank-sum test. For all the features, p value was found to be less than 0.05 and z value greater or lesser than the prescribed limits, which indicates that all the features were suitable for classification. Table 2 shows the obtained p and z values ( Table 3).
The classifier was trained using holdout cross-validation method with the ratio of 70 − 30 used for training and testing. Highest sensitivity and specificity of 86.2% and 95.2% were obtained using Renyi entropy for individual features, respectively. Further, sensitivity, specificity, and FDR of 97.8%, 96.4%, and 0.15 h −1 were recorded using multi-features which were highest than all other combinations. Table 4 shows the classification results of the proposed system for all individual features and multifeature combination. Figure 3 shows the ROC curve obtained from the classification results of PSD, Shannon entropy, Renyi entropy, Teager energy, and multi-features. Maximum AUC of 0.97 was obtained for multi-features, whereas a minimum of 0.83 attained for Shannon entropy. Classification results revealed that the highest performance measures were achieved using multi-features than the single features with the betterment of sensitivity, specificity, and FDR.

Discussion
The foremost objective of this study was to introduce an automated detection of epileptic seizures using multichannel EEG. Four features, namely PSD, variants of entropy, and Teager energy, were utilized followed by MLPNN classifier. These features were selected for the study based on previous performance on other databases. Experimental results show that multi-features perform better as compared to single features. Figure 4 shows the best validation performance of MLPNN classifiers for multi-features. It can be seen that the best validation performance of 0.08 was obtained at epoch 54. Further, Fig. 5 shows the error histogram of training, validation,  A GUI was built using MATLAB for automated classification of epileptic seizures using the trained model developed. The name assigned to the GUI developed was ' Aepitect' , which stands for automated epileptic seizure detection. The GUI was designed in such a way that it displays the 20 s of EEG every page. Features were extracted at a segmentation length of 0.5 s, and the same were used to classify using the trained model. Figure 6 shows the screenshot of ' Aepitect' , and it was cross-validated with the neurologist and found 98.5% matching.
The button 'Select File' allows the user to select the patient file, and the button 'Biomark' performs preprocessing, feature extraction, classification, and biomarking.
The performance of the proposed approach was compared with the other existing studies reported earlier. Table 5 shows the comparison results between different studies. As it is seen from Table 5, most of the studies have used single-channel EEG data from the University of Bonn and achieved better results. One should take the attention while comparing the performance of different methods since different EEG databases were used in their respective studies. University of Bonn database was found to be clean EEG, and it works well for all the methods. However, the challenge arises while dealing with long-term multi-channel EEG. Therefore, we have used our database for the study to overcome the existing issues such as less sensitivity, specificity, and FDR. The results of seizure detection algorithms are usually evaluated based on the sensitivity of the raised alarms (number of detected seizures/total number of seizures) and false detection rate; it is not evaluated by the sensitivity and specificity of epochs/segments. It was noticed that studies using University of Bonn database had classified epileptic seizures as epochs/segments instead of detecting them as a complete seizure. When comparing with other methods, our method follows the evaluation criteria of sensitivity and FDR to evaluate the performance of the algorithm. As compared to other methods listed in Table 5, the proposed method matches the results of other studies without using any DWT on EEG signal.  The significant contributions of the proposed study were: 1. EEG data were recorded at Ramaiah memorial hospital, Bengaluru, and were used for the study. 2. Artifacts were removed automatically using ICA technique and experts validated same at Ramaiah memorial hospital, Bengaluru. 3. From the preliminary study, the best PSD method (Yule-Walker) was selected for the feature extraction. 4. Three features, namely PSD, variants of entropy, and Teager energy, were used for the feature extraction. 5. The descriptive analysis shows the noticeable band difference between normal and epileptic EEG activities. 6. Wilcoxon rank-sum test shows the evidence to reject the null hypothesis at the 5% significance level. 7. Classification results show the better performance using multi-features as compared to the single features. 8. A MATLAB GUI called ' Aepitect' was developed for automated detection.
The above findings suggest that the proposed method is suitable for automated detection of epileptic seizures in real time. The complete study was implemented in MATLAB 2016b using 8 GB RAM, CPU 2 GHz with Intel i5 processor. As a future step, more features will be included to increase the sensitivity and decrease the FDR. Further, deep learning concept will be explored for the classification of epileptic seizures.

Conclusion
This study provides a multi-channel EEG analysis for the detection of epileptic seizures using PSD, entropy, Teager energy, and MLPNN classifier. Initially, EEG signals were preprocessed to remove noise and artifacts, and features were extracted. Descriptive analysis and Wilcoxon rank-sum test proved the suitability of the extracted features for classification with noticeable band difference between normal and epileptic EEG. The simulation results showed sensitivity, specificity, and false detection rate of 97.8%, 96.4%, and 1 h −1 , respectively, using multi-features. Results indicate that the proposed study is suitable for real-time seizure recognition from multi-channel EEG recording. The graphical user interface referred as ' Aepitect' was developed in MATLAB to provide an automated biomarker for normal and epileptic EEG signals. It is anticipated that the proposed algorithm will offer a faster and accurate diagnosis and also reduce the time spent on detecting seizures from long-term multi-channel EEG recordings and can be extended to more patients for long-term EEG.