FFT Power Relationships Applied to EEG Signal Analysis: A Meeting between Visual Analysis of EEG and Its Quantification
=======================================================================================================================

* Juan M. Díaz López
* Jose Curetti
* Vanesa B. Meinardi
* Hugo Díaz Farjreldines
* Carina Boyallian

## Abstract

**Objective** This study presents a novel computational approach for analyzing electroencephalogram (EEG) signals, focusing on the distribution and variability of energy in different frequency bands. The proposed method, FFT Weed Plot, systematically encodes EEG spectral information into structured metrics that facilitate quantitative analysis.

**Methods** The methodology employs Fast Fourier Transform (FFT) to compute the Power Spectral Density (PSD) of EEG signals. A novel encoding technique transforms frequency band distributions into six-entry vectors, referred to as “words,” which serve as the basis for three key metrics: a scalar value ![Graphic][1]</img> a vector ![Graphic][2]</img>, and a matrix *H*. These metrics are evaluated using a dataset comprising EEG recordings from 30 healthy individuals and 15 patients with epilepsy. Machine learning classifiers are then applied to assess the discriminatory power of the proposed features.

**Results** The classification models achieved a 95.55% accuracy, 93.33% sensitivity, and 96.67% specificity, demonstrating the robustness of the proposed metrics in distinguishing between control and epileptic EEGs.

**Conclusions** The FFT Weed Plot method provides a novel approach for EEG signal quantification, improving the systematization of spectral analysis in neurophysiological studies. The metrics developed could serve as quantitative descriptors for automated EEG interpretation, offering potential applications in clinical and research settings.

**Highlights**

*   From frequency domain analysis to information and probability theory, new ways of encoding information.

*   A step towards the systematization and automation of medical EEG reading.

*   New global metrics for the description of the energy of an EEG recording and their applications in machine learning.

*   The FFT Weed Plot method, We present a new, reproducible, robust and clinically designed method to improve the objectivity of medical practice and research in neurophysiology.

![Figure1](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2025/03/15/2025.03.14.25323563/F1.medium.gif)

[Figure1](http://medrxiv.org/content/early/2025/03/15/2025.03.14.25323563/F1)

Keywords
*   EEG signal processing
*   Fast Fourier Transform (FFT)
*   Power Spectral Density (PSD)
*   feature extraction
*   machine learning
*   biomedical signal analysis
*   FFT Weed Plot
*   medical EEG reading
*   clinical neurophysiology

## 1. Introduction

An electroencephalogram (EEG) is a graphical representation of neural activity that is registered using electrodes placed on different parts of the scalp. Each electrode mainly samples the synaptic activity that occurs in the superficial layers of the brain cortex. In clinical neurophysiology, the EEG is an instrument for observing and monitoring the electrophysiological activity of the brain for evolutionary and prognostic diagnosis.

Usually, clinical interpretations of an EEG are achieved by associating pathological features with certain pattern recognition during visual inspection Ebersole and Pedley [15],Tong and Thankor [52]. Although this traditional analysis is quite useful, it is subjective and does not easily allow for any systematization Ré et al. [39], Nascimento et al. [32].

The interpretation of an EEG has two stages: descriptive and inferential. The first stage includes all the descriptive aspects of the signals, namely analyzing the representative graphoelements, the inter-hemispheric symmetry, the synchronicity between signals, and the distribution and variability of the energy of the signal Tsuchida et al. [53], Hirsch et al. [20]. On the other hand, in the second stage, contextual information, such as age, consciousness state, physiological condition, duration, and psychopharmacological prescription before EEG recording, is considered Niedermeyer et al. [33]. At this point, inference occurs and physicians can provide a report on neurophysiology diagnosis. Sometimes this report contains both interpretations, but a clear inferential interpretation is not always possible. This is because the signs present in the EEG are non-specific to a unique pathology and also these signs can be considered as normal or abnormal waves depending on the stages of neurodevelopment or consciousness state. This complexity underlies EEG interpretation [15, 58]. For all of the above reasons, EEG analysis has traditionally relied on visual inspection by trained neurophysiologists, a process which, although effective, requires many years of medical training, lacks standardization and is inherently subjective.

In recent years, quantitative EEG (qEEG) analysis has gained attention, the quantitative analysis of an EEG recording was mainly based on the use of classical techniques of signal processing Harmony [19], Tong and Thankor [52], Oppenheim et al. [35], Blanco et al. [7], Sanei and Chambers [46]. Methods related to wavelets Rosso et al. [44], Bethany Gosala [6], Yang et al. [60], [2, 43], which can be thought of as a generalization of Fourier Transform Anuragi et al. [4], Mehla et al. [31], Al-Salman et al. [3], and others related to information theory, using Lempel-Ziv complexity [55, 13, 23, 12, 30], entropy Cataldo et al. [11], Guisande et al. [18], Gancio et al. [16], and Jensen-Shanon divergence Pereyra et al. [36], Lamberti and Majtey [24], Ré et al. [40], Mateos et al. [29], are proposed to find information in EEG signals Vicente et al. [54], Bossomaier et al. [10], Restrepo et al. [41].

In addition, the rise of artificial intelligence has led to the development of new methods for classifying EEG recordings Tjepkema-Cloostermans et al. [51], White et al. [56]. These methods extract information from the signals by dividing them into EEG bands Huang and Ling [21] and applying neural networks to classify the extracted characteristics [49, 26, 59].

However, applying these methods to daily physicians’ practice is not always easy. In some cases, physicians may find it challenging to use these methods due to their focus on mathematical and physical tools that are difficult to introduce. In other cases, the methods may not capture the unique features and singularities of physicians’ EEG interpretation. These abilities are often hidden in the expertise and experience of clinical physicians.

In this work, we have developed and applied a new method for quantifying, analyzing, and comparing EEG recordings. Our approach approximates the quantitative analysis (qEEG) to the visual analysis (vEEG) of EEG by formalizing the way physicians visually inspect power proportions across bands. Our goal is to understand the dynamics of energy variability in EEG recordings over time and among all electroencephalographic bands, not just the band with the highest power. The method developed in this article focuses on the process of medical EEG reading and its descriptive interpretation for diagnostic purposes. Here we introduce a novel computational method, the FFT Weed Plot, which encodes EEG spectral information of Power Spectral Density (PSD) analysis into structured representations that enhance the interpretation of EEG power distributions. The method systematically quantifies the relative power proportions across EEG bands using lexicographic ordering, yielding three primary metrics: ![Graphic][3]</img> (scalar): Average vocabulary complexity of EEG spectral patterns, a vector ![Graphic][4]</img> (vector): Normalized prevalence of spectral words across EEG channels., and *H* (matrix): A representation of energy distribution dynamics across EEG bands.

To evaluate the effectiveness of these metrics, we applied the method to a dataset consisting of 30 control subjects and 15 patients with epilepsy. Machine learning classifiers were employed to assess their discriminative power, achieving an accuracy of 95.55% in distinguishing between normal and pathological EEGs.

In addition, this method could provide a population-based frame of reference to contrast the dynamics of a single EEG recording in a single case study. This is part of our ongoing work The structure of the article is the following: Section 2: describes the proposed methodology, including the FFT-based feature extraction and coding process and the design of the experimental setup. Section 3 analyses the results of the study, based on the different machine learning methods applied and the results of the individual and joint behavior of the three metrics proposed to discriminate between the two groups. In Section 4, our results are compared with those obtained by similar studies, followed by a discussion of clinical and research implications. Section 5 summarizes the study’s key findings, as well as suggests further research using this methodology.

Summarizing, this study proposes a novel EEG analysis method that formalizes the visual inspection process by encoding power spectral distributions into structured quantifiers. Our hypothesis is that these FFT-based power relations can effectively differentiate normal EEG recordings from those with pathological features, bridging the gap between subjective clinical interpretation and quantitative signal analysis. This approach aims to enhance standardization in neurophysiological diagnostics while maintaining interpretability for clinicians.

## 2. Materials and Methods

### 2.1. Preliminaries

The electrical brain activity produces a continuous time analog signal *X**a* (t). Nevertheless, studies of brain activity using digital electroencephalography transform the analog signal to a digital one with discrete values for discrete time. This sampling produces a related signal which is a set of discrete points. The interval of time between samples has to be regular. We name it *T**s*. Thus we have ![Formula][5]</img>  with n ∈ ℤ≥0 and where x(n) is obtained by taking a sample value of the analog signal *X**a* every *T**s* seconds. The time interval *T**s* between successive samples is called *the sampling period* or *the sampling interval*. The *sampling rate or sampling frecuency* is *f**s* = 1/*T**s* samples per second or Hz.

The sequence of N numbers x(0), …, x(*N*− 1) obtained sampling our analog signal is transformed to a sequence of complex numbers X(0), …, X(*N* − 1) through the Discrete Fourier Transform (DFT), Bloomfield [8] as follows: ![Formula][6]</img>  The Discrete Fourier Transform changes from the time domain to the frequency domain. The Parseval theorem states ![Formula][7]</img>  so, the energy of the signal is the same as the energy of its Discrete Fourier Transform Wiener [57]. We denote this energy by *E*.

For each frequency k, we call *Power Spectral Density* (PSD) to: ![Formula][8]</img>  and so the ![Formula][9]</img>  that is, the Energy is the sum of all Power Spectral Densities.

In Clinical Neurophysiology, electrical waves are studied in frequency bands, each with its own range of frequencies. However, only frequencies that comply with the Nyquist criterion can be studied. To sample an analog signal with a frequency *f**s* and avoid aliasing [35], the highest frequency that can be considered should be equal to or less than *f**s*/2. This condition is known as the *Nyquist criterion*, which states that the sample frequency *f**s* must be at least twice the bandwidth of the signal [34, 48, 25].

Each hardware has its own sampling rate to transform analog signals into digital ones. In other words, not all hardware samplings allow for the study of all EEG bands. Therefore, it is necessary to take the Nyquist criterion into account when choosing the bands to be studied. Since we used equipment with a sampling rate of 65 Hz in this work, the electroencephalographic bands considered are: ![Formula][10]</img>  For each value of b ranging from 1 to 6, we define the “Accumulated Power per Band” as follows: ![Formula][11]</img>  This definition produces a six-entry vector called the “Accumulated Power per Band Vector,” denoted by ![Graphic][12]</img>. It stores in each entry b the percentage of the total energy contained in the corresponding Ib band. A practical way to present clinically relevant information about the accumulated power per band is to use the “Relative Power Histogram” (RPH), this is a step function that assigns to each interval Ib the percentage of the total energy contained in its band. Figure 1 shows an example of an RPH.

![Figure 1:](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2025/03/15/2025.03.14.25323563/F2.medium.gif)

[Figure 1:](http://medrxiv.org/content/early/2025/03/15/2025.03.14.25323563/F2)

Figure 1: 
Each electroencephalographic band corresponds to a percentage value that represents its relative power.

In neurophysiological clinical practice, physicians refer to ‘the dominant band’ as the electroencephalographic band with the highest percentage in the RPH. Namely, it is the one with the highest energy and the most accumulated power.

Our method considers the physical meaning of the RPH and how physicians use this information in their clinical practice.

### 2.2. Method: The FFT Weed Plot

#### 2.2.1. The Words or Power Relations

In this part, we will introduce a novel quantification method for EEG that systematizes the way physicians read them.

As a key element of our method, we define a word ![Graphic][13]</img> to be a vector with six inputs, each of which corresponds to a different frequency band in the EEG signal. Specifically, ϵ1 corresponds to the Delta band, ϵ2 corresponds to the Theta band, and so on, up to ϵ6, which corresponds to the Beta 3 band.

Given an ![Graphic][14]</img> or its corresponding RPH, we define the *Power Relation* as the word ![Graphic][15]</img>, where each entry encodes the relative decreasing order of the attached band according to its energy in the RPH. To illustrate how to calculate the Power Relation, consider Figure 1: the decreasing order of the bands according to their energy is Alpha, Delta, T heta, Beta1, Beta3, and Beta2. Thus, 1 corresponds to Alpha or ϵ3, 2 to Delta or ϵ1, 3 to T heta or ϵ2, 4 to Beta1 or ϵ4, 5 to Beta3 or ϵ6, and 6 to Beta2 or ϵ5, yielding ![Graphic][16]</img>.

An important point to note is that many different ![Graphic][17]</img> may have the same power relation s![Graphic][18]</img>. This means that our method can capture similarities between EEG signals that may not be immediately apparent from their raw data, since different signals can have the same underlying power relation.

We define the *vocabulary* as the set of all 6! = 720 possible power relations. In this vocabulary, we define the following lexicographic order: Given two words ![Graphic][19]</img> and ![Graphic][20]</img>, we say that ![Graphic][21]</img> if ϵ1 < σ1 or if ϵi = σi for i = 1, …, k with k < 6 and ϵk+1 < σk+1.

Let h be the ordinal of a given word in the vocabulary. Since we have a complete order, we name the word by its h-word, h-power relation, or simply write ![Graphic][22]</img>.

The dominant band is the band attached to the entry where 1 is placed in ![Graphic][23]</img>. In our example, ![Graphic][24]</img>, and 1 is placed in the entry ϵ3 attached to the Alpha-band. Thus, we say that the dominant band is Alpha. Let’s check another example. Take ![Graphic][25]</img>. Here, the dominant band is the T heta-band since the value 1 is located in the second input of ϵ.

This way of encoding the information obtained from the RPH differs from physicians’ usual readings. They consider only the two bands with more energy, but ours is a new perspective, where the energy of all electrophysiological bands are simultaneously taken into account.

#### 2.2.2. Processing the raw signal

At this point, it is necessary to know the data structure, the number of EEG recording channels, the sample rate, and the duration of the recording, among other things.

Similar to Permutation Entropy from Information Theory or Spectogram Multitapper, our method uses a sliding rectangular window through the signal [28, 37]. In our case, we tested the window length (Δ) and step size (τ) for Δ = 4 and Δ = 2 seconds with τ = 0.25 and τ = 0.50 seconds. However, we ultimately chose a window length of Δ = 2 seconds and a step size of τ = 0.25 seconds to capture brief events in the EEG activity.

For a given EEG signal and each channel, we proceed as follows:

1.  Place the rectangular sliding window of length Δ at the beginning of our raw signal. In our case, Δ = 2 seconds, so we have the first 130 data points of our signal. Generally, we set the position of the window as the place of its last data point. For this first window, it is at 130. Then,
    
    *   Apply the FFT to this window and compute the Accumulated Power APb for each band Ib with b = 1, *· · ·*, 6.
    
    *   Finally, encode that information in a word named ![Graphic][26]</img>, as described at the beginning of the section, and let h1 be its corresponding order in the vocabulary. We also set a counter f1 = 1, which will count new words as they appear while we slide the window through the signal.

2.  Next, we slide the window forward, in our case by τ = 0.25 seconds, which means we move forward 17 data points. So, the position of our window is now at 146. Note that the new window includes data points 16 to 146. As in our first window, we apply FFT, compute the PSD for each band, ![Graphic][27]</img>, RPH2, and finally, we have the corresponding word ![Graphic][28]</img> and its order h2 for this window. At this point, we set *f*2 = 1 if *h*1 = *h*2 or *f*2 = 2 otherwise (see Figure 3).

3.  Then, we keep moving our sliding window forward by τ = 0.25 seconds, attaching a ![Graphic][29]</img>, a word ![Graphic][30]</img>, and its order *h*k to each k-window until the sliding window position reaches the last data point of our signal. Note that the subscript k refers to the corresponding step in the procedure, and *f*k gives the number of different new words that appeared until the *k*-step.

In the end, for each EEG channel, we have three sets: the set of all ![Graphic][31]</img> or their graphical representations, the RPHk, the set of all words ![Graphic][32]</img>, and their corresponding orders *h**k* in the vocabulary, which encode our original signal, and the counter f, which gives us the number of different words that appeared (see Figure 2).

![Figure 2:](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2025/03/15/2025.03.14.25323563/F3.medium.gif)

[Figure 2:](http://medrxiv.org/content/early/2025/03/15/2025.03.14.25323563/F3)

Figure 2: 
Each histogram shows its ϵ and the corresponding h.

![Figure 3:](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2025/03/15/2025.03.14.25323563/F4.medium.gif)

[Figure 3:](http://medrxiv.org/content/early/2025/03/15/2025.03.14.25323563/F4)

Figure 3: 
For each window of width Δ, the data is transformed from the time domain to the frequency domain using the FFT. The accumulated power per band is then calculated. This information is visualized in its Relative Power Histogram (RPH). The data is encoded in a vector ![Graphic][33]</img>, and the corresponding Power Relation and lexicographic order h are determined. The process is repeated by advancing one step τ, iterating until the end of the time series.

We call the *word pool of the channel* c, denoted by Wc, the set of words obtained for this given EEG channel, and *Z*c the set of all ![Graphic][34]</img>. We suggest that the number of words in *W*c or vectors in *Z*c which corresponds to the total number of windows, related to the length of the signal, should be close to an order of magnitude greater than the size of the vocabulary.

#### 2.2.3. The output: Construction of the metrics

To construct the metrics we will need to give an order of the EEG channels. In our case we are dealing with 20 channels ordered from 1 to 20 in the following way, (see Figure 4):

![Figure 4:](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2025/03/15/2025.03.14.25323563/F5.medium.gif)

[Figure 4:](http://medrxiv.org/content/early/2025/03/15/2025.03.14.25323563/F5)

Figure 4: 
In our case, we are working with 20 channels, ordered from 1 to 20 based on how the monopolar channels were arranged in the hardware used. The equipment used was a Neutronic 24-channel polysomnograph, model ME-2400, with bimastoid references [47].

![Figure 5:](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2025/03/15/2025.03.14.25323563/F6.medium.gif)

[Figure 5:](http://medrxiv.org/content/early/2025/03/15/2025.03.14.25323563/F6)

Figure 5: 
The weed plot of the FFT is a graphical representation of the matrix M. The colored segments below the X-axis indicate changes in the highest energy or dominant power among the ordinal ranks of the words in our vocabulary.

![Formula][35]</img>  Our recordings for the control group were made with a Neutronic brand, 24channel polysomnograph, model ME-2400 with bimastoid references. This hardware ordered the channels in this way. Let’s introduce the following metrics.

1.  ![Graphic][36]</img> is the average number of different words used, and it can be calculated by counting the number of different words that appeared in each vocabulary Wc, namely the value of the counter *f**c*, which corresponds to the value *f**k* of the last window slid through the signal. The average of these values over all channels c gives us the average number of different words used during the whole recording.

2.  The Frequency of Occurrence Matrix M and the abacus of PRfft, ![Graphic][37]</img>.

We are also interested in the number of times each word appeared in Wc. To reorganize the encoding data, a vector Vc will be created for each channel with 720 entries, one for each word in our vocabulary ordered according to the lexicographic order given above. The first entry corresponds to the word ![Graphic][38]</img>, the second to ![Graphic][39]</img>, the third to ![Graphic][40]</img>, and so on. The frequency of occurrence of the corresponding word in Wc is assigned to each entry, and the vector is then normalized by its Euclidean norm. We call this vector the (normalized) *prevalence vector for channel* c.

To manage all EEG channels, we rearrange the prevalence vectors into the “frequency of occurrence matrix” *M*, which is a 720 × 20 matrix with columns that are vectors of normalized prevalence *V**c*. Note that in this way, we can check the normalized frequency of occurrence of a given word in each channel by reading each row.

To plot each *V**c*, we select a different color for each channel and set the numbers 1 to 720 on the x-axis to represent the ordinals of the words in our vocabulary. We plot the normalized frequency of occurrence of the corresponding word in this channel on the y-axis (see Figure 1). We call this the “Weed plot of FFT”.

To obtain global information for each EEG recording, we compute the mean occurrence of each word in the whole EEG by taking the mean of each row of the frequency matrix M. This yields a new 720-entry vector that we call ![Graphic][41]</img>. As with each *V**c*, we plot ![Graphic][42]</img> and call it the “Abacus of PRfft” (see Figure 6).

![Figure 6:](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2025/03/15/2025.03.14.25323563/F7.medium.gif)

[Figure 6:](http://medrxiv.org/content/early/2025/03/15/2025.03.14.25323563/F7)

Figure 6: 
The abacus of PRfft is the graphic representation of ![Graphic][43]</img>

*   The average cumulative percentage power per band matrix *H*

We go back to our matrix M, and we choose the three highest values of each column. Based on their position in the column, we can recover the three most prevalent words in the corresponding channel. We denote such words by Ac, Bc, and Cc, where A stands for the word with the highest frequency of occurrence, B for the following one, and C for the word with the lowest among the three most prevalent ones in a given channel c.

Recall that *Z**c* is the set of all ![Graphic][44]</img> of channel *c*, and for a given word, you may have many ![Graphic][45]</img> that correspond to it.

We recover from ![Graphic][46]</img> the sequence ![Graphic][47]</img> of all ![Graphic][48]</img> that share the same power relation with the word Ac. We do the same for Bc and Cc, recovering the corresponding sequences ![Graphic][49]</img> and ![Graphic][50]</img>.

Let *X* denote *A, B*, or *C*. Given a sequence ![Graphic][51]</img>, we arrange its elements in a six-column array ![Graphic][52]</img> so that its columns correspond to the values of cumulative percentage power of the corresponding band.

Then, we compute the mean of each column of ![Graphic][53]</img>, and we rearrange this information in a new six-entry vector called the mean percentage vector of cumulative power per band, denoted by ![Graphic][54]</img> corresponding to *X**c* (see Figure 2). Thus, we have three such vectors, one for *A**c*, *B**c*, and *C**c*. Since we are dealing with 20 channels c, at the end, we will have 60 such vectors that we rearrange in a new 60 *×* 6 matrix *H* as follows: ![Formula][55]</img>  We represent each ![Graphic][56]</img>, i.e., each row of H, as a step function that associates each band interval Ib, see equation 7, in the frequency domain with the value of its corresponding entry. This function is called the prevalent mean histogram (PMH) (see Figure 2). Therefore, we have 60 PMHs, three for each EEG channel.

Note that each ![Graphic][57]</img> and its graphic representation PMH recover the distribution of the size of the percentage energy per band of the corresponding three most prevalent words of the corresponding EEG channel. The H matrix is the third relevant metric of the method.(see Figure 7)

![Figure 7:](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2025/03/15/2025.03.14.25323563/F8.medium.gif)

[Figure 7:](http://medrxiv.org/content/early/2025/03/15/2025.03.14.25323563/F8)

Figure 7: 
The graphic representation of matrix H

##### Summarizing, for each EEG, we have the following metrics

*   The average vocabulary used by the EEG, ![Graphic][58]</img>.

*   ![Graphic][59]</img> whose values are the average frequencies of occurrences of each word in all channels, and its plot, the Abacus, which allows us to see in a single image the behavior of the total energy of the EEG recording.

*   The matrix *H*, which recovers the distribution of the sizes of the percentage energy per band of the corresponding three most prevalent words of the corresponding EEG channel.

##### Synopsis of the Weed Plot method. See Figure 8

These are the metrics used in this work. However, more metrics can be defined with this method. For instance, the dynamic abacus, which is the 20-column matrix whose columns are given by the vectors of the ordinals of the words corresponding to the channel c, or the growth curve, where we arrange the vectors Fc with entries fk for each channel c as columns of our matrix. We are not dealing with them in this article, and they are part of the work in progress.

![Figure 8:](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2025/03/15/2025.03.14.25323563/F9.medium.gif)

[Figure 8:](http://medrxiv.org/content/early/2025/03/15/2025.03.14.25323563/F9)

Figure 8: 
Synopsis: The medical interpretation of an EEG involves at least three key metacognitive processes: a) Direction of reading: Vertical and horizontal scanning, represented in the recording image by purple and green arrows. b) Visuoperceptive field: Represented by purple arcs for the macrotemporal scale (minutes or hours) and a green arc for the microtemporal scale (seconds). c) Focus of observation: Depicted by dashedline squares. A light blue rectangle indicates focal observation, while an orange rectangle represents global observation. In this synopsis of the Weed Plot method, we demonstrate how the three quantifiers are constructed to incorporate these metacognitive processes in both their creation and interpretation. The graph illustrates the process, beginning with the transformation of the raw EEG signal in the time domain using a sliding window and applying the FFT. This produces Relative Power Histograms (RPHs) for each EEG channel. The RPH data is encoded into sequences of words for each channel, which are subsequently represented by their lexicographic order, h. The complete EEG recording is then reorganized into the Weed Plot, from which the three quantifiers are derived. The symbols associated with the quantifiers—ϕ (number),![Graphic][60]</img> (vector), and H (matrix)—relate to the metacognitive aspects of EEG interpretation outlined above. This framework integrates the direction of reading, the visuoperceptive field, and the focus of observation, offering a comprehensive approach to EEG analysis.

### 2.3. Application of the Weed Plot method. A small experimental design

To create a proof of concept for the method, we designed a small experiment based on the contrast between two groups of EEG recordings.

The control group, referred to as group A, consisted of 30 individuals with a normal EEG report confirmed by a neurophysiologist. The inclusion criteria were: a) women and men aged between 20-50 years old; b) individuals with at least 7 years of formal education; c) individuals without any neurological or psychiatric medical history and normal values on the Beck Depression Inventory [9, 17] and State-Trait Anxiety Inventory, forms X and Y [50]; d) individuals with a normal EEG report based on St. Louis EK and Frey LC’s criteria of normality [27]; and, e) individuals who had accepted and signed the informed consent.

The exclusion criteria included: a) adults over 50 or under 20 years of age; b) individuals who have not been literate; c) individuals with a clinical history record indicating any issue; d) individuals who consume psychotropic drugs or drugs with effects on the Central Nervous System; e) individuals who have an abusive consumption of psychoactive substances such as cocaine, marijuana, alcohol, or others; f) individuals who consumed moderate or minimal alcohol in the 48 hours before the complete research evaluation; g) individuals who scored above 9 points on the administered BDI scale; h) individuals who obtained values above the 70th percentile for anxiety symptoms on the STAI X scale for anxiety; i) individuals whose electroencephalographic studies showed paroxysmal activity or any other detected functional abnormality; and, j) individuals who did not agree to sign the Informed Consent.

The research protocol was approved by the ethics committee of Hospital Córdoba, Córdoba, Argentina and registered in the Ministry of Health of the Córdoba Province on 20-03-2017 under number 3170.

Each of these EEGs corresponds to the basal electrophysiological condition, i.e., awake with closed eyes. The recordings were made using a Neutronic brand 24-channel polysomnography, model ME-2400 with bimastoid references. The average length of each recording is 30 minutes, and they were sampled at 65Hz.

The second group, referred to as our “single case study” group (SCS), comprises n=15 recordings of patients with epilepsy of different causes, who presented generalized tonic-clonic seizures. This data was obtained from “The TUH EEG Epilepsy Corpus (TUEP),” which is a subset of TUEG containing 100 subjects with epilepsy and 100 subjects without epilepsy, as determined by a certified neurologist. The data was developed in collaboration with several partners, including NIH.

All the pathological recordings we considered for this work correspond to intercritical EEGs in epileptic patients [5], and they were down sampled to 62Hz after decimating the raw signal, which had an original sampling rate of 250Hz. They have an average duration of 30 minutes each.

All EEG’s were processed identically applying our method so we have a set of ![Formula][61]</img>  for control group *A*, and ![Formula][62]</img>  for the single case study group.

We arrange the ![Graphic][63]</img> as columns in a 30 × 720 matrix, and then compute the mean of each row. This gives us a new vector ![Graphic][64]</img> with 720 entries, which represents the average occurrence of each word in the EEGs from our control group A.

Next, we plot the vector ![Graphic][65]</img> and use it as a watermark to compare against each ![Graphic][66]</img> with n = 1, …, 15, which corresponds to our single case study group. For example, we plot the watermark ![Graphic][67]</img> and one of the single case study vectors ![Graphic][68]</img> as shown in Figure 9.

![Figure 9:](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2025/03/15/2025.03.14.25323563/F10.medium.gif)

[Figure 9:](http://medrxiv.org/content/early/2025/03/15/2025.03.14.25323563/F10)

Figure 9: 
The single case study is represented in blue, while the control group watermark is displayed in gray. The colored segments below the X-axis indicate changes in the highest energy or dominant power among the ordinal ranks of the words in our vocabulary. Additionally, the vertical dashed lines represent shifts in the second-highest energy or subdominant power in the ordinal ranks of the words. This visualization mirrors the process of visual perception employed by medical experts when interpreting and reporting EEG findings.

In this way, it is possible to contrast two pieces of information and to reproduce the heuristic knowledge of descriptive statistics-epidemiological comparison done by clinical physicians supported by their accumulated expertise in reading EEGs for years. The development of this tool is left for future work.

### 2.4. Result analyses

#### 2.4.1. Machine learning algorithms

Machine learning algorithms play a crucial role in uncovering intricate patterns within data, enabling the creation of predictive models with increasing accuracy as more data is available for training. In our pursuit of developing machine learning models for disease classification, we have explored both traditional and ensemble learning algorithms following the work in [6]. In total, we have created six ML classifiers: Logistic Regression (LR), Support Vector Machine (SVM), Decision Tree (DT), Random Forest (RF), AdaBoost (AB), Gradient Boosting (GB). LR and SVM are Traditional ML models, DT and RF are Bagging models, finally AB and GB are Boosting models [38].

1.  Logistic Regression (LR)
    
    LR is a statistical and machine learning technique used for binary classifications task, where the goal is to predict a binary outcome (CONTROL or SCS) based on values of input variable. Logistic Regression has the following advantages: simplicity of implementation, computational efficiency, efficiency from training perpective, ease of regularization. No scaling is required for input features.

2.  Support Vector Machine (SVM)
    
    SVM can handle both classification and regression problems. In this method hyperplane needs to be defined which is the decision boundary. When there are a set of objects belonging to different classes then decision plane is needed to separate them. The objects may or may not be linearly separable in which case complex mathematical functions called kernels are needed to separate the objects which are members of different classes. SVM aims at correctly classifying the objects based on examples in the training data set. Following are the advantages of SVM : it can handle both semi structured and structured data, it can handle complex function if the appropriate kernel function can be derived. As generalization is adopted in SVM so there is less probability of over fitting. It can scale up with high dimensional data. It does not get stuck in local optima.

3.  Decision Tree (DT)
    
    DT is a Supervised Machine Learning approach to solve classification and regression problems by continuously splitting data based on a certain parameter. The decisions are in the leaves and the data is split in the nodes. Decision Tree might encounter the problem of over-fitting for which Random Forest is the solution which is based on ensemble modeling approach.

4.  Random Forest (RF)
    
    RF is a tree-based ensemble with each tree depending on a collection of random variables. RF can be used for both categorical response variables (Classification) and continuous response (Regression). Random forests have many advantages: they can handle Regression and multiclass classification, are faster in training and predictions, uses very few (one or two) parameters to tune, easy to implement, and can be directly used for high-dimensional problems.

5.  Adaptative boosting (AB)
    
    AB, is a machine learning ensemble technique used primarily for classification problems. AB begins by fitting a classifier on the original dataset and then fits additional copies of the classifier on the same dataset but where the weights of incorrectly classified instances are adjusted such that subsequent classifiers focus more on difficult cases.

6.  Gradient Boosting (GB)
    
    Gradient Boosting is another powerful ensemble machine learning technique that is used primarily for regression and classification problems. It is a generalization of AdaBoost and builds a strong predictive model by combining the predictions of multiple weaker models (typically decision trees) sequentially. Gradient Boosting, in particular, focuses on reducing the errors made by the previous models in the ensemble.

#### 2.4.2. Principal component analysis (PCA)

Principal component analysis (PCA) is a well-established method for feature extraction and dimensionality reduction. In PCA, we seek to represent the d-dimensional data in a lower-dimensional space. This will reduce the degrees of freedom; reduce the space and time complexities. The objective is to represent data in a space that best expresses the variation in a sum-squared error sense. The basic approach in principal components is theoretically rather simple. First, the d-dimensional mean vector *µ* and *d*× *d* covariance matrix *σ* are computed for the full data set. Next, the eigenvectors and eigenvalues are computed, and sorted according to decreasing eigenvalue. Call these eigenvectors e1 with eigenvalue λ1, e2 with eigenvalue λ2, and so on. Sub-sequently, the largest k such eigenvectors are chosen. In practice, this is done by looking at a spectrum of eigenvectors. It can be show that this representation minimizes a squared error criterion [42].

#### 2.4.3. Proposed methodology

We have three sets of data, the average frequencies of occurrences of each word in all channels, ![Graphic][69]</img> the average vocabulary used by the EEG, ![Graphic][70]</img>, and the matrix H, which recovers the distribution of the sizes of the percentage energy per band of the corresponding three most prevalent words of the corresponding EEG channel. The statistical analysis carried out for the proposed method consists of (see Figure 10):

![Figure 10:](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2025/03/15/2025.03.14.25323563/F11.medium.gif)

[Figure 10:](http://medrxiv.org/content/early/2025/03/15/2025.03.14.25323563/F11)

Figure 10: 
Experimental designed of the proposed model

*   Application of the method to predict SCS groups from a healthy control group (CONTROL).

*   A comparative study between the ![Graphic][71]</img> H quantifiers on six ML classifiers.

*   A construction of a classifier by assembling the best-performing classifier for each quantifier through a majority vote.

#### Preprocessing data

For the matrix H we reorder the data in a new dataset so that for each individual there is a single row in the new base. The columns will be represented by the combination of the channels with each word that appears in H and will be completed with its prevalence. In addition, there will be a column for each energy band and channel whose value will be the average of the energy in the three words. more prevalent. This gives us a new base of size 45 x 921. We will refer to this new base as H as well.

We standardize the data on H with **StandarScaler()** library of python. After that we apply PCA to H and to the data frame with the observations of quantifier ![Graphic][72]</img> to reduce the dimensionality of the data and increase its interpretability, using **PCA()** function of python (see Figures A.11a-A.11b) **Cross-validation**.

Generalization of ML models is very important in learning-based models and to make a model that we developed to be independent of the subjects’ data we have implemented the k-fold cross-validation method by taking the k value to be 5. The 5-fold cross-validation helps to validate the ML models that we developed and helps to generalize them for the other new data as well.

#### Model building

This section describe the six machine learning models we built for the classification, with the various parameters on each model and for each of the three quantifiers ![Graphic][73]</img>. We also introduced the final model, which was constructed by assembling the best-performing classifier for each quantifier through a majority vote. These models are executed with the help of the Scikit-learn library. We have used five metrics to evaluate our developed classifier: accuracy, sensitivity, specificity, F1-Score and the “Area Under the Curve” (AUC). Each of them is discussed below.

True Positive (TP): it refers to a situation where a model correctly identifies or classifies a positive condition when the condition is actually present. True Negative (TN): it refers to a situation where a model correctly identifies or classifies a negative condition when the condition is actually absent. False Positive (FP): It refers to a situation where a test or model incorrectly identifies or classifies a negative condition as positive, when in reality, the condition is not present

False Negative (FN): It refers to a situation where a test or model incorrectly identifies or classifies a positive condition as negative, when in reality, the condition is present.

Accuracy: The overall accuracy of the model is given by a formula ![Formula][74]</img>  Sensitivity (true positive rate): It measures the ability of the model or test to correctly identify positive instances or cases when they are truly positive. ![Formula][75]</img>  Specificity: It measures the ability of the model or test to correctly identify negative instances or cases when they are truly negative. ![Formula][76]</img>  F1-Score: is a single metric that combines both Precision and Recall (Sensitivity) into a single value. It is calculated as the harmonic mean of Precision ![Graphic][77]</img> and Recall.

The formula for the F1-score is: ![Formula][78]</img>  The “Area Under the Curve” (AUC) quantifies the performance of a model in distinguishing between positive and negative classes across different decision thresholds. In practical terms, a higher AUC indicates a betterperforming model, particularly in terms of its ability to discriminate between positive and negative cases The details of the six classifiers, for which we conducted parameter tuning within a specified range, are provided in tables 1-3, showcasing the highest recall, accuracy, and AUC achieved.

View this table:
[Table 1:](http://medrxiv.org/content/early/2025/03/15/2025.03.14.25323563/T1)

Table 1: 1st Quatifier: ![Graphic][79]</img>

View this table:
[Table 2:](http://medrxiv.org/content/early/2025/03/15/2025.03.14.25323563/T2)

Table 2: 2nd Quatifier: ![Graphic][80]</img>

View this table:
[Table 3:](http://medrxiv.org/content/early/2025/03/15/2025.03.14.25323563/T3)

Table 3: 3rd Quatifier: *H*

## 3. Results

We studied the quantifiers ![Graphic][81]</img> *H* on six AI models. We have recorded various performance metrics like accuracy, sensitivity, specificity, F1-Score and AUC, which are presented in table 4. The performance parameters of six models are illustrated in the figures A.12-A.16. We refers as 1st quantifier, 2nd quantifier and 3rd quantifier on the graphics to the quantifiers ![Graphic][82]</img> and *H* respectively. Based on the results obtained from the various metrics, we selected SVM models for the first and third quantifiers, while we opted for a Random Forest model for the second quantifier. Details of the parameters used are detailed in the tables 1-3. We combined the three models through a majority voting process. The metrics resulting from this assembly model are presented in the table 5. We have calculated the confusion matrix for the SVM algorithm apply to the the first and third quantifiers, the Random Forest algorithm for the second quantifier, and the assembly model.

View this table:
[Table 4:](http://medrxiv.org/content/early/2025/03/15/2025.03.14.25323563/T4)

Table 4: Results of the metrics obtained on the six Machine learning algoritms

View this table:
[Table 5:](http://medrxiv.org/content/early/2025/03/15/2025.03.14.25323563/T5)

Table 5: Results of the metrics obtained on the Assembly Model and the best models for each quantifier

## 4. Discussion

Generally, signal processing methods separate the information by bands, [4]. We suggests an alternative method for analyzing EEG signals by examining the behavior of the accumulated power of all bands simultaneously, which could be more efficient than separating the information by bands.

Artifacts in EEG signals during prolonged studies produce different types of noise, including brief, intermittent, and single-channel artifacts, among others. These artifacts pose a challenge for neurophysiological diagnosis. In [45], a method was developed to detect discharges even in high noise environments. Our method describes the behavior of recording energy by using averaged values in the calculation of various metrics. This averaging process helps to smooth out some of the artifacts that interfere with the descriptive neurophysiological diagnosis of cumulative power.

On the other hand, much of the older equipment used in medical practice, such as in our case, has a low sampling rate. This poses a problem for the detection of spikes in intercritical recordings for focal epilepsies [5]. In [14], colleagues present a method for automatically detecting epileptic patterns in EEG using deep learning. The multi-head self-attention mechanism effectively identifies long-term dependencies as well as dynamic temporal correlations between short-term temporal pattern features and sequential relationships. Contextual representations are introduced into a bidirectional short-term memory network (bilstm), achieving a sensitivity of 96.5 %, specificity of 97.04%, F1-score of 96.6%, and precision of 96.2%.

While these results surpass ours, our method focuses on the study of basal activity (wakefulness with closed eyes) in interictal recordings, providing insights for both diagnosis and monitoring the progression of patients already diagnosed with epilepsy and/or other clinical conditions through quantifiable metrics. The method we propose demonstrates how the behavior of energy ratios in the background activity can be quantified and used to supplement evolutionary neurophysiological diagnosis in epilepsies in general and in other clinical situations.

Overall, the proposed method could be a valuable addition to the existing tools and techniques for analyzing EEG signals, and it could potentially lead to more accurate and efficient diagnosis and treatment of neurological and neuropsychiatric conditions [22].

## 5. Conclusions

The functional neurophysiological diagnosis of an EEG is a multidimensional process that involves evaluating various characteristics to determine whether the recording is normal [58, 20]. One critical aspect is the energy distribution, which we quantify using specific metrics to provide a structured, objective analysis.

Our method successfully classifies 96.6% of control group patients as normal and detects abnormalities even for individuals without specialized training in clinical neurophysiology. A key strength of this approach is its reliance on a rigorously defined control group, ensuring diagnostic consistency even when the underlying clinical pathology is unknown. The proposed metrics demonstrate stable performance, enabling different laboratories to establish their own control groups and seamlessly integrate the method into daily clinical practice.

The methodology developed in this study is robust in both information encoding and systematic EEG interpretation. By standardizing the assessment of energy distribution, these metrics improve the objectivity of EEG evaluation across different experimental designs and hardware configurations. Given the substantial variability in medical EEG reporting, as highlighted by a recent study analyzing seizure onset patterns in 1799 EEG recordings, our method addresses the need for quantitative, standardized diagnostic descriptors.

Beyond classification accuracy, the proposed quantification approach enhances inter-rater reliability and lays the foundation for automated diagnostic tools. Importantly, it distinguishes neurophysiological descriptive diagnosis from psychiatric or neurological diagnoses, ensuring that EEG interpretation remains within its intended clinical scope as a complementary diagnostic tool. The method’s versatility allows EEG analysis to be independent of hardware constraints, supporting various channel configurations. Additionally, its computational simplicity makes it suitable for real-world clinical applications, though further refinements may facilitate broader adoption.

In conclusion, this FFT-based quantification framework represents a significant step toward standardizing EEG diagnostics. By improving objectivity and reproducibility while maintaining clinical relevance, it contributes to a more accessible and structured approach to neurophysiological assessment.

## Author Contributions

Conceptualization, Juan Manuel Diaz López and Carina Boyallian; methodology, Juan Manuel Díaz López; software, Juan Manuel Díaz López and José Curetti; validation, Juan Manuel Díaz López and Carina Boyallian and Vanesa Meinardi.; formal analysis, Juan Manuel Díaz López and Carina Boyallian; investigation, Juan Manuel Díaz López and Carina Boyallian; resources, Juan Manuel Díaz López; EEG control group data collection, Juan Manuel Díaz López; data curation, Juan Manuel Díaz López and Vanesa Meinardi.; writing—original draft preparation, Juan Manuel Díaz López, Carina Boyallian and Vanesa Meinardi; writing—review and editing, Juan Manuel Díaz López and Carina Boyallian; visualization, Juan Manuel Díaz López and Vanesa Meinardi.; supervision, Carina Boyallian and Hugo Díaz Farjreldines. All authors have read and agreed to the published version of the manuscript.

## Funding

This research received no external funding.

### Institutional Review Board Statement

The study was conducted in accordance with the Declaration of Helsinki, and approved by the Ethics Committee of Hospital Córdoba: Comite Institucional de Etica de la investigación en salud del adulto. Ministerio de Salud de la Provincia de Córdoba. (protocol code CO000152 and 20-03-2017).

### Informed Consent Statement

Informed consent was obtained from all subjects involved in the study.

## Data Availability Statement

The data presented in this study are available on request from the corresponding author.

## Conflicts of Interest

The authors declare no conflicts of interest.

## Acknowledgments

The first author thanks Dr. Graciela Lucatelli and Dr. Atilio Bollo, for their valuable support. The last author thanks Dr. Lamberti, Dr. Buonanote and Dr. Rosso Osvaldo A. for useful discussions.

## Appendix A

![Figure A.11:](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2025/03/15/2025.03.14.25323563/F12.medium.gif)

[Figure A.11:](http://medrxiv.org/content/early/2025/03/15/2025.03.14.25323563/F12)

Figure A.11: 
Principal Component Analysis

![Figure A.12:](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2025/03/15/2025.03.14.25323563/F13.medium.gif)

[Figure A.12:](http://medrxiv.org/content/early/2025/03/15/2025.03.14.25323563/F13)

Figure A.12: 
Accuracy (%) of six classifiers under three quantifiers

![Figure A.13:](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2025/03/15/2025.03.14.25323563/F14.medium.gif)

[Figure A.13:](http://medrxiv.org/content/early/2025/03/15/2025.03.14.25323563/F14)

Figure A.13: 
Sensitivity (%) of six classifiers under three quantifiers

![Figure A.14:](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2025/03/15/2025.03.14.25323563/F15.medium.gif)

[Figure A.14:](http://medrxiv.org/content/early/2025/03/15/2025.03.14.25323563/F15)

Figure A.14: 
Specificity (%) of six classifiers under three quantifiers

![Figure A.15:](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2025/03/15/2025.03.14.25323563/F16.medium.gif)

[Figure A.15:](http://medrxiv.org/content/early/2025/03/15/2025.03.14.25323563/F16)

Figure A.15: 
F1-Score (%) of six classifiers under three quantifiers

![Figure A.16:](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2025/03/15/2025.03.14.25323563/F17.medium.gif)

[Figure A.16:](http://medrxiv.org/content/early/2025/03/15/2025.03.14.25323563/F17)

Figure A.16: 
AUC (%) of six classifiers under three quantifiers

View this table:
[Table A.6:](http://medrxiv.org/content/early/2025/03/15/2025.03.14.25323563/T6)

Table A.6: Confusion Matrices for the SVM model applied to the 1st quantifier

View this table:
[Table A.7:](http://medrxiv.org/content/early/2025/03/15/2025.03.14.25323563/T7)

Table A.7: Confusion Matrices for the Random Forest model applied to the 2nd quantifier

View this table:
[Table A.8:](http://medrxiv.org/content/early/2025/03/15/2025.03.14.25323563/T8)

Table A.8: Confusion Matrices for the SVM model on the 3rd quantifier

View this table:
[Table A.9:](http://medrxiv.org/content/early/2025/03/15/2025.03.14.25323563/T9)

Table A.9: Confusion Matrices for the assembly of SVM and Random Forest models

## Footnotes

*   *Email addresses:* juan.manuel.diaz{at}unc.edu.ar (Juan M. Díaz López), jose.curetti{at}unc.edu.ar (Jose Curetti), vmeinardi{at}unvm.edu.ar (Vanesa B. Meinardi), hugo.diaz.fajreldines{at}unc.edu.ar (Hugo Díaz Farjreldines), cboyallian{at}unc.edu.ar (Carina Boyallian)

*   Received March 14, 2025.
*   Revision received March 14, 2025.
*   Accepted March 15, 2025.


*   © 2025, Posted by Cold Spring Harbor Laboratory

The copyright holder for this pre-print is the author. All rights reserved. The material may not be redistributed, re-used or adapted without the author's permission.

## References

1.  [1]. C. Abdallah,  D. Mansilla,  E. Minato,  C. Grova,  S. Beniczky, and  B. Frauscher. Systematic review of seizure-onset patterns in stereoelectroencephalography: Current state and future directions. Clinical Neurophysiology, 163:112–123, 2024.
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.clinph.2024.04.016&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=38733701&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2025%2F03%2F15%2F2025.03.14.25323563.atom) 

2.  [2]. N. Ahmadi,  Y. Pei,  E. Carrette,  A. P. Aldenkamp, and  M. Pechenizkiy. Eeg-based classification of epilepsy and pnes: Eeg microstate and functional brain network features. Brain informatics, 7(1):1–22, 2020.
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=32064541&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2025%2F03%2F15%2F2025.03.14.25323563.atom) 

3.  [3]. W. Al-Salman,  Y. Li,  P. Wen,  F. S. Miften,  A. Y. Oudah, and  H. R. Al Ghayab. Extracting epileptic features in eegs using a dual-tree complex wavelet transform coupled with a classification algorithm. Brain Research, 1779:147777, 2022.
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=34999060&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2025%2F03%2F15%2F2025.03.14.25323563.atom) 

4.  [4]. A. Anuragi,  D. S. Sisodia, and  R. B. Pachori. Epileptic-seizure classification using phase-space representation of fbse-ewt based eeg sub-band signals and ensemble learners. Biomedical Signal Processing and Control, 71:103138, 2022.
    
    

5.  [5]. M. O. Baud,  K. Schindler, and  V. R. Rao. Under-sampling in epilepsy: limitations of conventional eeg. Clinical neurophysiology practice, 6:41–49, 2021.
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=33532669&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2025%2F03%2F15%2F2025.03.14.25323563.atom) 

6.  [6]. P. D. K. P. J. R. N. C. M. G. Bethany Gosala. Wavelet transforms for feature engineering in eeg data processing: An application on schizophrenia. Biomedical Signal Processing and Control, 85(3):1143–1151, 2023.
    
    

7.  [7]. S. Blanco,  R. Q. Quiroga,  O. Rosso, and  S. Kochen. Time-frequency analysis of electroencephalogram series. Physical review E, 51(3):2624, 1995.
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=9962925&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2025%2F03%2F15%2F2025.03.14.25323563.atom) 

8.  [8]. P. Bloomfield. Fourier analysis of time series: an introduction. John Wiley & Sons, 2004.
    
    

9.  [9]. S. Bonicatto,  A. M. Dew, and  J. J. Soria. Analysis of the psychometric properties of the spanish version of the beck depression inventory in argentina. Psychiatry Research, 79(3):277–285, 1998.
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/S0165-1781(98)00047-X&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=9704874&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2025%2F03%2F15%2F2025.03.14.25323563.atom) 
    
    [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000074980300010&link_type=ISI) 

10. [10]. T. Bossomaier,  L. Barnett,  M. Harré,  J. T. Lizier,  T. Bossomaier,  L. Barnett,  M. Harré, and  J. T. Lizier. Transfer entropy. Springer, 2016.
    
    

11. [11]. A. Cataldo,  S. Criscuolo,  E. De Benedetto,  A. Masciullo,  M. Pesola,  J. Picone, and  R. Schiavoni. Eeg complexity-based algorithm using multiscale fuzzy entropy: Towards a detection of alzheimer’s disease. Measurement, 225:114040, 2024.
    
    

12. [12]. í. R. S. C. Da Paz,  P. F. Silva,  H. B. de Lucas,  S. H. Lira,  O. A. Rosso, and  F. S. Matias. Symbolic information approach applied to human intracranial data to characterize and distinguish different congnitive processes. Physical Review E, 110(2):024403, 2024.
    
    

13. [13]. J. M. Diaz,  D. M. Mateos, and  C. Boyallian. Complexity-entropy maps as a tool for the characterization of the clinical electrophysiological evolution of patients under pharmacological treatment with psychotropic drugs. Entropy, 19(10):540, 2017.
    
    

14. [14]. A. K. Dutta,  M. Raparthi,  M. Alsaadi,  M. W. Bhatt,  S. B. Dodda,  M. Sandhu, and  J. C. Patni. Deep learning-based multi-head self-attention model for human epilepsy identification from eeg signal for biomedical traits. Multimedia Tools and Applications, pages 1–23, 2024.
    
    

15. [15]. J. S. Ebersole and  T. A. Pedley. Current practice of clinical electroencephalography. Lippincott Williams & Wilkins, 2003.
    
    

16. [16]. J. Gancio,  C. Masoller, and  G. Tirabassi. Permutation entropy analysis of eeg signals for distinguishing eyes-open and eyes-closed brain states: Comparison of different approaches. Chaos: An Interdisciplinary Journal of Nonlinear Science, 34(4), 2024.
    
    

17. [17]. H. Ghassemzadeh,  R. Mojtabai,  N. Karamghadiri, and  N. Ebrahimkhani. Psychometric properties of a persian-language version of the beck depression inventory-second edition: Bdi-ii-persian. Depression and anxiety, 21(4):185–192, 2005.
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1002/da.20070&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=16075452&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2025%2F03%2F15%2F2025.03.14.25323563.atom) 
    
    [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000231180000006&link_type=ISI) 

18. [18]. N. Guisande,  O. A. Rosso, and  F. Montani. Neural dynamics associated with biological variation in normal human brain regions. Entropy, 26 (10):828, 2024.
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=39451905&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2025%2F03%2F15%2F2025.03.14.25323563.atom) 

19. [19]. T. Harmony. Neurometric assessment of brain dysfunction in neurological patients. Taylor & Francis, 2021.
    
    

20. [20]. L. J. Hirsch,  M. W. Fong,  M. Leitinger,  S. M. LaRoche,  S. Beniczky,  N. S. Abend,  J. W. Lee,  C. J. Wusthoff,  C. D. Hahn,  M. B. Westover, et al. American clinical neurophysiology society’s standardized critical care eeg terminology: 2021 version. Journal of clinical neurophysiology: official publication of the American Electroencephalographic Society, 38 (1):1, 2021.
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=33475321&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2025%2F03%2F15%2F2025.03.14.25323563.atom) 

21. [21]. Z. Huang and  B. W.-K. Ling. Sleeping stage classification based on joint quaternion valued singular spectrum analysis and ensemble empirical mode decomposition. Biomedical Signal Processing and Control, 71: 103086, 2022.
    
    

22. [22]. J. W. Kam,  A. R. Bolbecker,  B. F. O’Donnell,  W. P. Hetrick, and  C. A. Brenner. Resting state eeg power and coherence abnormalities in bipolar disorder and schizophrenia. Journal of psychiatric research, 47(12): 1893–1901, 2013.
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.jpsychires.2013.09.009&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=24090715&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2025%2F03%2F15%2F2025.03.14.25323563.atom) 

23. [23]. X. Kang,  X. Liu,  S. Chen,  W. Zhang,  S. Liu, and  D. Ming. Major depressive disorder recognition by quantifying eeg signal complexity using proposed aplzc and awplzc. Journal of Affective Disorders, 356:105–114, 2024.
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=38580036&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2025%2F03%2F15%2F2025.03.14.25323563.atom) 

24. [24]. P. W. Lamberti and  A. P. Majtey. Distances measures for probability distributions. Concepts and Recent Advances in Generalized Information Measures and Statistics, page 130, 2013.
    
    

25. [25]. H. Landau. Sampling, data transmission, and the nyquist rate. Proceedings of the IEEE, 55(10):1701–1706, 1967.
    
    

26. [26]. M. Lo Giudice,  G. Varone,  C. Ieracitano,  N. Mammone,  G. G. Tripodi,  E. Ferlazzo,  S. Gasparini,  U. Aguglia, and  F. C. Morabito. Permutation entropy-based interpretability of convolutional neural network models for interictal eeg discrimination of subjects with epileptic seizures vs. psychogenic non-epileptic seizures. Entropy, 24(1):102, 2022.
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=35052128&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2025%2F03%2F15%2F2025.03.14.25323563.atom) 

27. [27]. E. K. S. Louis,  L. C. Frey,  J. W. Britton,  J. L. Hopp,  P. Korb,  M. Z. Koubeissi,  W. E. Lievens, and  E. M. Pestana-Knight. The normal eeg. In Electroencephalography (EEG): An Introductory Text and Atlas of Normal and Abnormal Findings in Adults, Children, and Infants [Internet]. American Epilepsy Society, 2016.
    
    

28. [28]. D. Mateos,  J. M. Diaz, and  P. W. Lamberti. Permutation entropy applied to the characterization of the clinical evolution of epileptic patients under pharmacological treatment. Entropy, 16(11):5668–5676, 2014.
    
    

29. [29]. D. M. Mateos,  L. E. Riveaud, and  P. W. Lamberti. Detecting dynamical changes in time series by using the jensen shannon divergence. Chaos: An Interdisciplinary Journal of Nonlinear Science, 27(8), 2017.
    
    

30. [30]. D. M. Mateos,  J. Gómez-Ramírez, and  O. A. Rosso. Using time causal quantifiers to characterize sleep stages. Chaos, Solitons & Fractals, 146: 110798, 2021.
    
    

31. [31]. V. K. Mehla,  A. Singhal,  P. Singh, and  R. B. Pachori. An efficient method for identification of epileptic seizures from eeg signals using fourier analysis. Physical and Engineering Sciences in Medicine, 44(2): 443–456, 2021.
    
    

32. [32]. F. A. Nascimento,  J. Jing,  R. Strowd,  I. S. Sheikh,  D. Weber,  J. R. Gavvala,  A. Maheshwari,  A. Tanner,  M. Ng,  K. Vinayan, et al. Competency-based eeg education: a list of “must-know” eeg findings for adult and child neurology residents. Epileptic Disorders, 1(1):1–4, 2022.
    
    

33. [33]. E. Niedermeyer et al. The normal eeg of the waking adult. Electroencephalography: Basic principles, clinical applications, and related fields, 167:155–164, 2005.
    
    

34. [34]. H. Nyquist. Certain topics in telegraph transmission theory. Transactions of the American Institute of Electrical Engineers, 47(2):617–644, 1928.
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1109/T-AIEE.1928.5055024&link_type=DOI) 

35. [35]. A. V. Oppenheim,  J. R. Buck, and  R. W. Schafer. Discrete-time signal processing. Vol. 2. Upper Saddle River, NJ: Prentice Hall, 2001.
    
    

36. [36]. M. Pereyra,  P. Lamberti, and  O. Rosso. Wavelet jensen–shannon divergence as a tool for studying the dynamics of frequency band components in eeg epileptic seizures. Physica A: Statistical Mechanics and its Applications, 379(1):122–132, 2007.
    
    

37. [37]. P. L. Purdon,  A. Sampson,  K. J. Pavone, and  E. N. Brown. Clinical electroencephalography for anesthesiologists: part i: background and basic signatures. Anesthesiology, 123(4):937–960, 2015.
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1097/ALN.0000000000000841&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=26275092&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2025%2F03%2F15%2F2025.03.14.25323563.atom) 

38. [38]. S. Ray. A quick review of machine learning algorithms. International Conference on Machine Learning, Big Data, Cloud and Parallel Computing, 85(3):1143–1151, 2019.
    
    

39. [39]. M. A. Ré,  J. M. Díaz, and  G. G. Aguirre Varela. Comparación de registros electroencefalográficos por divergencia de jensen shannon. In XXIII Congreso de Métodos Numéricos y sus Aplicaciones (ENIEF)(La Plata, noviembre 2017), 2017.
    
    

40. [40]. M. Á. Ré,  J. M. Díaz, and  G. G. A. Varela. A measure of distance for eeg analysis. Revista Argentina de Bioingeniería, 22(3):54–59, 2018.
    
    

41. [41]. J. F. Restrepo,  D. M. Mateos, and  J. M. D. López. A transfer entropy-based methodology to analyze information flow under eyes-open and eyes-closed conditions with a clinical perspective. Biomedical Signal Processing and Control, 86:105181, 2023.
    
    

42. [42]. D. S. R.O. Duda,  P.E. Hart. Pattern classification. (2nd ed.), John Wiley & Sons, 2001.
    
    

43. [43]. O. Rosso,  M. Martin,  A. Figliola,  K. Keller, and  A. Plastino. Eeg analysis using wavelet-based information tools. Journal of neuroscience methods, 153(2):163–182, 2006.
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.jneumeth.2005.10.009&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=16675027&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2025%2F03%2F15%2F2025.03.14.25323563.atom) 
    
    [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000238101700001&link_type=ISI) 

44. [44]. O. A. Rosso,  S. Blanco,  J. Yordanova,  V. Kolev,  A. Figliola,  M. Schürmann, and  E. Başar. Wavelet entropy: a new tool for analysis of short duration brain electrical signals. Journal of neuroscience methods, 105 (1):65–75, 2001.
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/S0165-0270(00)00356-3&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=11166367&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2025%2F03%2F15%2F2025.03.14.25323563.atom) 
    
    [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000166873400007&link_type=ISI) 

45. [45]. Y. Ru,  J. Li,  H. Chen, and  J. Li. Epilepsy detection based on variational mode decomposition and improved sample entropy. Computational Intelligence and Neuroscience, 2022, 2022.
    
    

46. [46]. S. Sanei and  J. A. Chambers. EEG signal processing. John Wiley & Sons, 2013.
    
    

47. [47]. M. Seeck,  L. Koessler,  T. Bast,  F. Leijten,  C. Michel,  C. Baumgartner,  B. He, and  S. Beniczky. The standardized eeg electrode array of the ifcn. Clinical Neurophysiology, 128(10):2070–2077, 2017.
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.clinph.2017.06.254&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=28778476&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2025%2F03%2F15%2F2025.03.14.25323563.atom) 

48. [48]. C. E. Shannon. Communication in the presence of noise. Proceedings of the IRE, 37(1):10–21, 1949.
    
    

49. [49]. A. Shoeibi,  N. Ghassemi,  M. Khodatars,  P. Moridian,  R. Alizadehsani,  A. Zare,  A. Khosravi,  A. Subasi,  U. R. Acharya, and  J. M. Gorriz. Detection of epileptic seizures on eeg signals using anfis classifier, autoencoders and fuzzy entropies. Biomedical Signal Processing and Control, 73:103417, 2022.
    
    

50. [50]. C. D. Spielberger and  P. R. Vagg. Test anxiety: Theory, assessment, and treatment. Taylor & Francis, 1995.
    
    

51. [51]. M. C. Tjepkema-Cloostermans,  R. C. de Carvalho, and  M. J. van Putten. Deep learning for detection of focal epileptiform discharges from scalp eeg recordings. Clinical neurophysiology, 129(10):2191–2196, 2018.
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.clinph.2018.06.024&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=30025804&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2025%2F03%2F15%2F2025.03.14.25323563.atom) 

52. [52]. S. Tong and  N. V. Thankor. Quantitative EEG analysis methods and clinical applications. Artech House, 2009.
    
    

53. [53]. T. Tsuchida,  C. J. Wusthoff,  R. A. Shellhaas,  N. S. Abend,  C. D. Hahn,  J. E. Sullivan,  S. Nguyen,  S. Weinstein,  M. S. Scher,  J. J. Riviello, et al. Acns standardized eeg terminology and categorization for the description of continuous eeg monitoring in neonates. J. Clin, 2013.
    
    

54. [54]. R. Vicente,  M. Wibral,  M. Lindner, and  G. Pipa. Transfer entropy—a model-free measure of effective connectivity for the neurosciences. Journal of computational neuroscience, 30(1):45–67, 2011.
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1007/s10827-010-0262-3&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=20706781&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2025%2F03%2F15%2F2025.03.14.25323563.atom) 

55. [55]. Q. Wang,  Z. Ren,  M. Yue,  Y. Zhao,  B. Wang,  Z. Zhao,  B. Wen,  Y. Hong,  Y. Chen,  T. Zhao, et al. A model for the diagnosis of anxiety in patients with epilepsy based on phase locking value and lempel-ziv complexity features of the electroencephalogram. Brain Research, 1824:148662, 2024.
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=37924926&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2025%2F03%2F15%2F2025.03.14.25323563.atom) 

56. [56]. A. M. White,  P. A. Williams,  D. J. Ferraro,  S. Clark,  S. D. Kadam,  F. E. Dudek, and  K. J. Staley. Efficient unsupervised algorithms for the detection of seizures in continuous eeg recordings from rats after brain injury. Journal of neuroscience methods, 152(1-2):255–266, 2006.
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.jneumeth.2005.09.014&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=16337006&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2025%2F03%2F15%2F2025.03.14.25323563.atom) 
    
    [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000236485700029&link_type=ISI) 

57. [57]. N. Wiener. Generalized harmonic analysis. Acta mathematica, 55(1): 117–258, 1930.
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1007/BF02546511&link_type=DOI) 
    
    [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000187873600004&link_type=ISI) 

58. [58]. S. Wüstenhagen,  D. Terney,  E. Gardella,  P. M. Larsen,  C. Rømer,  H. Aurlien, and  S. Beniczky. Eeg normal variants: A prospective study using the score system. Clinical Neurophysiology Practice, 7:183–200, 2022.
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=35865124&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2025%2F03%2F15%2F2025.03.14.25323563.atom) 

59. [59]. L. Yang,  C. Xu,  Y. Qin,  K. Chen,  Y. Xie,  X. Zhou,  T. Liu,  S. Tan,  J. Liu, and  D. Yao. Exploring resting-state eeg oscillations in patients with neuromyelitis optica spectrum disorder. Brain Research Bulletin, 208:110900, 2024.
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=38364986&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2025%2F03%2F15%2F2025.03.14.25323563.atom) 

60. [60]. Q. Yang,  L. Liu,  J. Wang,  Y. Zhang,  N. Jiang, and  M. Zhang. Wavelet entropy analysis of electroencephalogram signals during wake and different sleep stages in patients with insomnia disorder. Nature and Science of Sleep, pages 347–358, 2024.

 [1]: /embed/inline-graphic-1.gif
 [2]: /embed/inline-graphic-2.gif
 [3]: /embed/inline-graphic-3.gif
 [4]: /embed/inline-graphic-4.gif
 [5]: /embed/graphic-2.gif
 [6]: /embed/graphic-3.gif
 [7]: /embed/graphic-4.gif
 [8]: /embed/graphic-5.gif
 [9]: /embed/graphic-6.gif
 [10]: /embed/graphic-7.gif
 [11]: /embed/graphic-8.gif
 [12]: /embed/inline-graphic-5.gif
 [13]: /embed/inline-graphic-6.gif
 [14]: /embed/inline-graphic-7.gif
 [15]: /embed/inline-graphic-8.gif
 [16]: /embed/inline-graphic-9.gif
 [17]: /embed/inline-graphic-10.gif
 [18]: /embed/inline-graphic-11.gif
 [19]: /embed/inline-graphic-12.gif
 [20]: /embed/inline-graphic-13.gif
 [21]: /embed/inline-graphic-14.gif
 [22]: /embed/inline-graphic-15.gif
 [23]: /embed/inline-graphic-16.gif
 [24]: /embed/inline-graphic-17.gif
 [25]: /embed/inline-graphic-18.gif
 [26]: /embed/inline-graphic-19.gif
 [27]: /embed/inline-graphic-20.gif
 [28]: /embed/inline-graphic-21.gif
 [29]: /embed/inline-graphic-22.gif
 [30]: /embed/inline-graphic-23.gif
 [31]: /embed/inline-graphic-24.gif
 [32]: /embed/inline-graphic-25.gif
 [33]: F4/embed/inline-graphic-26.gif
 [34]: /embed/inline-graphic-27.gif
 [35]: /embed/graphic-14.gif
 [36]: /embed/inline-graphic-28.gif
 [37]: /embed/inline-graphic-29.gif
 [38]: /embed/inline-graphic-30.gif
 [39]: /embed/inline-graphic-31.gif
 [40]: /embed/inline-graphic-32.gif
 [41]: /embed/inline-graphic-33.gif
 [42]: /embed/inline-graphic-34.gif
 [43]: F7/embed/inline-graphic-35.gif
 [44]: /embed/inline-graphic-36.gif
 [45]: /embed/inline-graphic-37.gif
 [46]: /embed/inline-graphic-38.gif
 [47]: /embed/inline-graphic-39.gif
 [48]: /embed/inline-graphic-40.gif
 [49]: /embed/inline-graphic-41.gif
 [50]: /embed/inline-graphic-42.gif
 [51]: /embed/inline-graphic-43.gif
 [52]: /embed/inline-graphic-44.gif
 [53]: /embed/inline-graphic-45.gif
 [54]: /embed/inline-graphic-46.gif
 [55]: /embed/graphic-16.gif
 [56]: /embed/inline-graphic-47.gif
 [57]: /embed/inline-graphic-48.gif
 [58]: /embed/inline-graphic-49.gif
 [59]: /embed/inline-graphic-50.gif
 [60]: F9/embed/inline-graphic-51.gif
 [61]: /embed/graphic-19.gif
 [62]: /embed/graphic-20.gif
 [63]: /embed/inline-graphic-52.gif
 [64]: /embed/inline-graphic-53.gif
 [65]: /embed/inline-graphic-54.gif
 [66]: /embed/inline-graphic-55.gif
 [67]: /embed/inline-graphic-56.gif
 [68]: /embed/inline-graphic-57.gif
 [69]: /embed/inline-graphic-58.gif
 [70]: /embed/inline-graphic-59.gif
 [71]: /embed/inline-graphic-60.gif
 [72]: /embed/inline-graphic-61.gif
 [73]: /embed/inline-graphic-62.gif
 [74]: /embed/graphic-23.gif
 [75]: /embed/graphic-24.gif
 [76]: /embed/graphic-25.gif
 [77]: /embed/inline-graphic-63.gif
 [78]: /embed/graphic-26.gif
 [79]: T1/embed/inline-graphic-64.gif
 [80]: T2/embed/inline-graphic-65.gif
 [81]: /embed/inline-graphic-66.gif
 [82]: /embed/inline-graphic-67.gif