Estimation of direct nonlinear effective connectivity using information theory and multilayer perceptron

doi:10.1016/j.jneumeth.2014.04.008

Journal of Neuroscience Methods

Volume 229, 30 May 2014, Pages 53-67

https://doi.org/10.1016/j.jneumeth.2014.04.008 Get rights and content

Highlights

•
The direct nonlinear effective connectivity of high-dimensional datasets is estimated.
•
A combination of regressor selection, MLP modeling and Granger Causality is proposed.
•
βmRMR-MLP-GC can deal with highly nonlinear, high-dimensional datasets.
•
In simulations, βmRMR-MLP-GC yields both high sensitivity and specificity.
•
βmRMR-MLP-GC detects Back-to-Front alpha information flows in resting brain.

Abstract

Background

Despite the variety of effective connectivity measures, few methods can quantify direct nonlinear causal couplings and most of them are not applicable to high-dimensional datasets.

New method

In this paper, a novel approach (called βmRMR-MLP-GC) is proposed to estimate direct nonlinear effective connectivity of high-dimensional datasets. βmRMR is used to select a suitable subset of candidate regressors for approximating each neural (here EEG) signal. The multilayer perceptron (MLP) is used for multivariate characterization of EEG signals while the optimum MLP structure is selected using an iterative cross-validation scheme. Finally a causality measure is defined based on Granger Causality (GC) concept to quantify the casual relations among EEG channels.

Results

Applying βmRMR-MLP-GC to high-dimensional simulated datasets with different linear and nonlinear structures yields sensitivity and specificity values higher than 95%. Also, applying it to eyes-closed resting state EEG of six normal subjects in the alpha frequency band yields significant net activity propagations from the posterior to anterior brain regions. This is in accordance with the most previous studies in this field.

Comparison with existing method(s)

βmRMR-MLP-GC is compared with Granger Causality Index, Conditional Granger Causality Index, and Transfer Entropy. It outperforms these methods in terms of sensitivity and specificity in simulated datasets. Also, βmRMR-MLP-GC detects the most number of significant and reproducible Back-to-Front net information flows among the specified brain regions and highlights the posterior brain regions as dominant source of alpha activity propagation.

Conclusions

βmRMR-MLP-GC provides a novel tool to estimate the direct nonlinear causal networks of high-dimensional datasets.

Introduction

The human brain is definitely one of the most complex natural systems in the world. Despite lots of guided studies for discovering its functional organization, there are still many unknowns about it. Founded on the “brain specialization concept” most studies aimed at exploring the brain regions specialized for particular brain tasks in the past years (Jirsa and McIntosh, 2007). The emergence of “brain integration concept” directed many researches in recent years toward the brain connectivity (Jirsa and McIntosh, 2007). Brain connectivity is a broad concept that can be generally divided into three categories: Structural, Functional and Effective connectivity. The structural connectivity refers to the structural connections of brain regions via nerve fibers. The functional connectivity deals with the temporal interdependencies among the activity of brain regions. The effective connectivity characterizes the causal (directed) effects among brain regions. Reviews of the most commonly used functional and effective connectivity measures can be found in Greenblatt et al. (2012), Sakkalis (2011), Pereda et al. (2005), and Muskulus et al. (2009).

Exploring the effective connectivity networks helps neurologists to investigate the changes of brain causal networks due to the brain disorders such as autism (Wicker et al., 2008), Alzheimer (Liu et al., 2012), schizophrenia (Diaconescu et al., 2011), epilepsy (Amini et al., 2010a, Amini et al., 2010b). This helps the physicians to find effective treatments for the brain disorders.

Among the functional brain imaging modalities, EEG and MEG are capable of capturing the temporal dynamics of cortical connectivity owing to their high temporal resolution (He et al., 2011). Consequently, they are popular modalities for functional/effective connectivity estimation, despite their limitations in terms of spatial resolution and volume conduction effects (Schoffelen and Gross, 2009).

According to Wiener Causality concept (Wiener, 1956), if adding the past and present information of (system) X to the past and present information of (system) Y improves predicting the future of (system) Y, X is the cause of Y. Granger (1969) limited the very general definition of Wiener Causality (Wiener, 1956) to linear bivariate autoregressive models and proposed a mathematical formulation to infer the Wiener Causality quantitatively (Granger, 1969). The terms Wiener causality (WC) and Granger Causality (GC) are usually used interchangeably. Geweke (1982) proposed the most practical GC-based effective connectivity measure, which is usually known as Granger Causality Index (GCI) (Geweke, 1982). To distinguish between direct and indirect causal links, a variety of multivariate linear GC-based measures have been developed, for example Conditional Granger Causality Index (CGCI) in the time domain (Ding et al., 2007), and DTF, PDC, dDTF and ffDTF in the frequency domain (Kaminski and Liang, 2005, Wu et al., 2011). They are all based on linear Multivariate Auto-Regressive (MVAR) models.

It is widely assumed that the interactions among neuronal populations are nonlinear (Marinazzo et al., 2011, Pereda et al., 2005, Ioannides and Mitsis, 2010, Stam, 2005; Gourévitch et al., 2006). Consequently, using the linear connectivity measures may oversimplify the functional organization of brain and even lead to incorrect estimation of causal relations. A few multivariate and nonlinear effective connectivity measures have been proposed based on GC in the literature. Some of them are parametric and model-based like the kernel-based nonlinear Granger Causality measures (Marinazzo et al., 2008, Guo et al., 2008) and locally linear MVAR models on the reconstructed attractor space (Chen et al., 2004) whereas the others are nonparametric. Some of those nonparametric methods are partial entropy with non-uniform embedding (Faes et al., 2011), and Partial Transfer Entropy (PTE) (Gomez-Herrero, 2010, Vakorin et al., 2009) which is an extension of well-known Transfer Entropy (TE) measure (Schreiber, 2000). Some other nonparametric methods are based on the Correlation Integral (CI) (Zhidong et al., 2010, Gourévitch et al., 2006).

An ideal effective connectivity measure has three characteristics: (1) It uses no restrictive model, therefore it can explore both linear and nonlinear causal connections. (2) It is multivariate in order to distinguish the direct connections from indirect ones. It has been shown that bivariate measures may yield misleading results on multivariate data (Kus et al., 2004). (3) It must be practically applicable to high-dimensional datasets (e.g. EEG/MEG with high number of channels), and to systems with long memory or large coupling delays.

Despite the existence of many effective connectivity measures, very few measures have all three features above. Due to the lack of a systematic dimensionality reduction stage, none of the multivariate nonlinear measures mentioned above are applicable to high-dimensional datasets except the method proposed by Marinazzo et al. (2008) and may be the one proposed by Faes et al. (2011). Both these measures use some kinds of dimensionality reduction techniques. The nonparametric nonlinear measures (in fact, information theoretic-based and CI-based measures) need the estimation of multivariate probability density functions. Consequently, they are not applicable to high-dimensional datasets since their required data samples exponentially grow with the number of variables. As a result, PTE (Gomez-Herrero, 2010, Vakorin et al., 2009) and CI-based methods (Zhidong et al., 2010, Gourévitch et al., 2006) are not applicable to high-dimensional datasets. Even the method proposed by Faes et al., 2011 may produce inaccurate results if the number of selected partial terms in the partial entropy becomes slightly big. Also in the methods proposed by Chen et al. (2004) and Guo et al. (2008) the number of modeling parameters is proportional to the square of number of dimensions so they do not have the third desired feature of an ideal effective connectivity measure, too.

In this paper, we propose a new approach to estimate the total (linear and nonlinear) effective connectivity network of high-dimensional datasets (e.g. EEG/MEG datasets with large number of channels). We use multilayer perceptron (MLP) to explore linear and nonlinear causal dependencies among neural signals. MLP is a member of the artificial neural networks (ANNs) family, which has the “universal approximation” property (Bishop, 1995, Ripley, 1996). MLP has been used successfully for modeling mean profile of resting state EEG signals (Kawano et al., 2003, Nagashino et al., 2002). As a novel work, in this paper we use MLP for effective connectivity estimation. The performance of MLP like any other regression method may be deteriorated if the number of its input regressors (and so the number of its parameters) increases. In such a case, an appropriate input selection method should be used before MLP modeling. May et al. (2011) reviewed the practical input selection approaches for ANNs.

In this paper, to keep the advantage of detecting nonlinear structure of high-dimensional datasets, an information theoretic-based approach is used for input regressor selection. This approach is called βmRMR (β minimal-Redundancy-Maximal-Relevance) (Hejazi and Cai, 2009). βmRMR is a modified version of the well-known mRMR (Peng et al., 2005). It iteratively selects the input regressors that add maximum amount of new information about the output to the previously selected set of inputs. Since βmRMR is based on information theory, it can keep the advantage of MLPs for exploration of highly nonlinear structures. Moreover due to its careful design, it can yield reliable results even for very large set of candidate regressors. Then we combine βmRMR and MLP modeling to approximate time series of channels. Finally we combine βmRMR-MLP with GC concept to construct a new method called βmRMR-MLP-GC for estimating the direct total (linear and nonlinear) effective connectivity of high-dimensional datasets. We also use Time-Shifted surrogate data to evaluate the significance of the effective connectivity estimates. We applied the proposed method on both simulated datasets and experimental EEG data and compared its performance with three of the most widely used effective connectivity measures: GCI (Geweke, 1982); CGCI (Ding et al., 2007); and TE (Schreiber, 2000).

This paper is organized as follows: in Section 2, we review MLP, βmRMR input regressor selection, and Granger Causality Index. Then we combine these three tools to propose our method called βmRMR-MLP-GC. Then the simulation designs for evaluating the performance of our proposed method are described and our EEG data are introduced. In Section 3, we report the results of applying βmRMR-MLP-GC to the simulated datasets and EEG data. In Section 4, we discuss some results, approach some issues related to βmRMR-MLP-GC, and propose some future works. Finally, in Section 5 we conclude the paper.

Section snippets

Materials and methods

In this section, at first the theories behind multilayer perceptron (MLP), βmRMR input selection, and the Granger Causality Index are reviewed. Then by combining these three tools in a unified framework, βmRMR-MLP-GC method is proposed. Afterwards, the simulation designs for evaluating the performance of βmRMR-MLP-GC are described. Finally, our EEG data that are used to explore the activity propagation patterns of resting brain are introduced.

Results

In this section, we illustrate the results of applying βmRMR-MLP-GC to the simulated networks and EEG dataset. We used the Neural Network Toolbox of Matlab R2009b software for MLP modeling. Also, three of the most widely used effective connectivity measures were used in direct comparison with βmRMR-MLP-GC. Those methods were Conditional Granger Causality Index (CGCI) (Ding et al., 2007), Granger Causality Index (GCI) (Geweke, 1982), and Transfer Entropy (TE) (Schreiber, 2000). CGCI, GCI and TE

Discussion

In a novel approach called βmRMR-MLP-GC we analyzed brain effective connectivity by combining βmRMR regressor selection, MLP function approximator, and Granger Causality concept. βmRMR-MLP-GC had excellent simulation results in terms of both sensitivity and specificity values and often outperformed CGCI, GCI, and TE. It is noteworthy that the simulations of Section 2.5 were not designed to model the multichannel real EEG data. In fact, realistic modeling of the multi channel EEG is not a

Conclusion

In this paper we proposed a method called βmRMR-MLP-GC for the estimating direct total (linear and nonlinear) effective connectivity of high-dimensional neural datasets. We combined βmRMR regressor selection, MLP modeling and the concept of GC Index to introduce βmRMR-MLP-GC method. Applying βmRMR-MLP-GC to simulated networks produced sensitivity and specificity values higher than 95% simultaneously and its application to alpha frequency band of eyes-closed resting state EEG showed promising

Acknowledgments

This work was supported by the University of Tehran (Under Grant No. 8101079/1/02). The authors would like to thank Prof. Hossein Esteky, from School of Medicine, Shahid Beheshti University of Medical Sciences, Tehran, Iran for letting us use his laboratory for EEG data acquisition. Also, we would like to appreciate Dr. Anahita Khorrami from ICSS for her kind assistance and valuable comments on EEG data acquisition.

References (72)

L.Y. Cao
Practical method for determining the minimum embedding dimension of a scalar time series
Physica D
(1997)
M. Chavez et al.
Statistical assessment of nonlinear causality: application to epileptic EEG signals
J Neurosci Methods
(2003)
Y. Chen et al.
Analyzing multiple nonlinear time series with extended Granger Causality
Phys Lett A
(2004)
O. David et al.
Evaluation of different measures of functional connectivity using a neural mass model
Neuroimage
(2004)
R.E. Greenblatt et al.
Connectivity measures applied to human brain electrophysiological data
J Neurosci Methods
(2012)
S. Guo et al.
Partial Granger Causality—eliminating exogenous inputs and latent variables
J Neurosci Methods
(2008)
M.I. Hejazi et al.
Input variable selection for water resources systems using a modified minimum redundancy maximum relevance (mMRMR) algorithm
Adv Water Resour
(2009)
A.A. Ioannides et al.
Do we need to consider non-linear information flow in corticomuscular interaction
Clin Neurophysiol
(2010)
S.H. Jin et al.
Linear and nonlinear information flow based on time-delayed mutual information method and its application to corticomuscular interaction
Clin Neurophysiol
(2010)
M. Kaminski et al.
Topographic analysis of coherence and propagation of EEG activity during sleep and wakefulness
Electroencephalogr Clin Neurophysiol
(1997)

D. Marinazzo et al.

Nonlinear connectivity by Granger Causality

Neuroimage

(2011)

M. Muskulus et al.

Functional similarities and distance properties

J Neurosci Methods

(2009)

E. Pereda et al.

Nonlinear multivariate analysis of neurophysiological signals

Prog Neurobiol

(2005)

P. Sauseng et al.

What does phase information of oscillatory brain activity tell us about cognitive processes?

Neurosci Biobehav Rev

(2008)

V. Sakkalis

Review of advanced techniques for the estimation of brain connectivity measured with EEG/MEG

Comput Biol Med

(2011)

T. Schreiber et al.

Surrogate time series

Physica D

(2000)

C.J. Stam

Nonlinear dynamical analysis of EEG and MEG: review of an emerging field

Clin Neurophysiol

(2005)

J. Theiler et al.

Testing for nonlinearity in time series: the method of surrogate data

Physica D

(1992)

V.A. Vakorin et al.

Confounding effects of indirect connections on causality estimation

Neurosci Methods

(2009)

V.A. Vakorin et al.

Exploring transient transfer entropy based on a group-wise ICA decomposition of EEG data

Neuroimage

(2010)

M. Wibral et al.

Transfer entropy in magnetoencephalographic data: quantifying information flow in cortical and cerebellar networks

Prog Biophys Mol Biol

(2011)

M. Wu et al.

A comparison of multivariate causality based measures of effective connectivity

Comput Biol Med

(2011)

M. Zervakis et al.

Intertrial coherence and causal interaction among independent EEG components

J Neurosci Methods

(2011)

L. Amini et al.

Comparison of five directed graph measures for identification of leading interictal epileptic regions

Physiol Meas

(2010)

L. Amini et al.

Directed differential connectivity graph of interictal epileptiform discharges

IEEE Trans Biomed Eng

(2010)

N. Ancona et al.

Radial basis function approach to nonlinear Granger Causality of time series

Phys Rev E

(2004)

C. Babiloni et al.

Sources of cortical rhythms in adults during physiological aging: a multicentric EEG study

Hum Brain Mapp

(2006)

D. Battaglia et al.

Dynamic effective connectivity of inter-areal brain circuits

PLoS Comput Biol

(2012)

C.M. Bishop

Neural networks for pattern recognition

(1995)

A.O. Diaconescu et al.

Aberrant effective connectivity in schizophrenia patients during appetitive conditioning

Front Hum Neurosci

(2011)

M. Ding et al.

Granger Causality: basic theory and application to neuroscience

K.T. Dolan et al.

Surrogate for nonlinear time series analysis

Phys Rev E

(2001)

S. Erlaa et al.

Multivariate autoregressive model with instantaneous effects to improve brain connectivity estimation

Inter J Bioelectromagnetism

(2009)

L. Faes et al.

Mutual nonlinear prediction as a tool to evaluate coupling strength and directionality in bivariate time series: comparison among different strategies based on k nearest neighbors

Phys Rev E

(2008)

L. Faes et al.

Extended causal modeling to assess Partial Directed Coherence in multiple time series with significant instantaneous interactions

Biol Cybern

(2010)

L. Faes et al.

Testing frequency-domain causality in multivariate time series

IEEE Trans Biomed Eng

(2010)

Cited by (24)

A self-organized recurrent neural network for estimating the effective connectivity and its application to EEG data
2019, Computers in Biology and Medicine
Citation Excerpt :
For example, Talebi et al. [14] proposed CREANN, which uses a multilayer perceptron neural network for estimating the effective connectivity. Khadem et al. [15] suggested the βmRMR-MLP-GC model to identify direct nonlinear effective connectivity using multilayer perceptron neural networks. This model involves the estimation of causal relations between EEG channels based on the concept of GC.
Effective connectivity is an important notion in neuroscience research, useful for detecting the interactions between regions of the brain.
Since we are dealing with a dynamic system, it seems that using a dynamic tool could effectively achieve better results. In this paper, a novel approach, called “Recurrent Neural Network - Neuron Growth Using Error Whiteness - Granger Causality” (RNN-NGUEW-GC) is proposed to estimate the effective connectivity. An RNN is used for predicting and modeling time series and multivariate signals. NGUEW is used to determine the optimum time lag with the help of an error whiteness criterion. When this criterion is not satisfied, the number of neurons in the network input is increased, producing an increase in the time lag. Accordingly, the network achieves a self-organized structure. Finally, causal effects are determined for linear and nonlinear models using the concept of Granger causality. Also, an indicator of the ‘‘intensity of causality’’ is defined to approximate the strength of the linear interactions based on the structure of the network and the weights of the connections.
RNN-NGUEW-GC had a major outcome in terms of both method accuracy on simulation data and prediction of epileptic seizures on the EEG dataset. The main advantages of this method in comparison with other methods of determining the effective connectivity are: 1) there is no need for physiological information; 2) it yields a self-organized network structure. In addition, the calculation of the appropriate time lag using NGUEW is another superiority of this method in comparison with multivariate auto-regressive models.
Nonlinear effective connectivity measure based on adaptive Neuro Fuzzy Inference System and Granger Causality
2018, NeuroImage
Citation Excerpt :
A limitation of the bivariate measures is that they do not distinguish between the direct and indirect connections. To overcome this limitation, another LGC measure based on the MVAR models has been proposed (Khadem and Hossein-Zadeh, 2014). However, this criterion is only suitable in detecting the linear causal links of the MVAR models.
Exploring brain networks is an essential step towards understanding functional organization of the brain, which needs characterization of linear and nonlinear connections based on measurements like EEG or MEG. Conventional measures of connectivity are mostly linear and bivariate. This paper proposes an effective connectivity measure called Adaptive Neuro-Fuzzy Inference System Granger Causality (ANFISGC). The proposed measure is based on the symplectic geometry embedding dimension, Adaptive Neuro-Fuzzy Inference System (ANFIS) predictor, and Granger Causality (GC). It is a powerful predictor that detects both linear and nonlinear causal information flow. It is not bivariate and thus can distinguish between direct and indirect connections. The performance of the proposed method is evaluated and compared with those of the Linear Granger Causality (LGC), Kernel Granger Causality (KGC), combination of Pairwise Granger Causality and Conditional Granger Causality (PwGC + CGC), Transfer Entropy (TE), and Phase Transfer Entropy (PTE) methods using simulated and experimental MEG data. Simulation results show that ANFISGC outperforms the other methods in detecting both linear and nonlinear connections and, by increasing the coupling strength between nodes, the value of ANFISGC increases. In the analysis of the time series of the brain sources of epilepsy patients obtained from the MEG inverse problem, the regions found by ANFISGC were more similar to the clinical findings than those found by the other methods.
Estimate the effective connectivity in multi-coupled neural mass model using particle swarm optimization
2017, Physica A: Statistical Mechanics and its Applications
Citation Excerpt :
Non-interventional methodologies, for example, electroencephalogram (EEG), magnetoencephalogram (MEG), or functional magnetic resonance imaging (fMRI) data have all been used to evaluate connectivity in epileptic patients. Among these imaging modalities, EEG captures high temporal resolution [6], and [7], and is thus a powerful tool for estimating temporal dynamics of brain connectivity and seizure electrophysiology [8–11], and [12]. Mathematical models especially network models make a contribution to better expound the relationship between brain electric activities varying from health to disease.
Assessment of the effective connectivity among different brain regions during seizure is a crucial problem in neuroscience today. As a consequence, a new model inversion framework of brain function imaging is introduced in this manuscript. This framework is based on approximating brain networks using a multi-coupled neural mass model (NMM). NMM describes the excitatory and inhibitory neural interactions, capturing the mechanisms involved in seizure initiation, evolution and termination. Particle swarm optimization method is used to estimate the effective connectivity variation (the parameters of NMM) and the epileptiform dynamics (the states of NMM) that cannot be directly measured using electrophysiological measurement alone. The estimated effective connectivity includes both the local connectivity parameters within a single region NMM and the remote connectivity parameters between multi-coupled NMMs. When the epileptiform activities are estimated, a proportional–integral controller outputs control signal so that the epileptiform spikes can be inhibited immediately. Numerical simulations are carried out to illustrate the effectiveness of the proposed framework. The framework and the results have a profound impact on the way we detect and treat epilepsy.
A novel MLP network implementation in CMOL technology
2014, Engineering Science and Technology, an International Journal
Citation Excerpt :
Recently, extensive research attempts have been carried out on artificial neural network (ANN) implementations and applications [5,11,16,19,28,42].
Hybrid CMOS/nanodevice technology is a well-known candidate to extend the exponential Moor-Law progress of microelectronics beyond the 10-nm frontier. This paper presents and evaluates a novel method for synaptic weights implementation of artificial neural networks in CMOL technology, a hybrid CMOS/nanodevice technology. In this novel method, the analog property of the I–V characteristic of the nanodevice is utilized to implement each neuromorphic synaptic weight. Each synaptic weight is also implemented by using one nanodevice instead of several nanodevices. Moreover, the proposed method is applied to the multilayer perceptron (MLP) network in CMOL technology. Our analysis shows that the power consumption and speed are effectively improved in the proposed method compared to other methods at the expense of a reasonable overhead defect tolerance.
Exploring brain effective connectivity of early MCI with GRU_GC model on resting-state fMRI
2024, Applied Neuropsychology:Adult
Kendall transfer entropy: a novel measure for estimating information transfer in complex systems
2023, Journal of Neural Engineering

View all citing articles on Scopus

View full text

Computational NeuroscienceEstimation of direct nonlinear effective connectivity using information theory and multilayer perceptron

Highlights

Abstract

Background

New method

Results

Comparison with existing method(s)

Conclusions

Introduction

Section snippets

Materials and methods

Results

Discussion

Conclusion

Acknowledgments

Physica D

J Neurosci Methods

Phys Lett A

Neuroimage

J Neurosci Methods

J Neurosci Methods

Adv Water Resour

Clin Neurophysiol

Clin Neurophysiol

Electroencephalogr Clin Neurophysiol

Neuroimage

J Neurosci Methods

Prog Neurobiol

Neurosci Biobehav Rev

Comput Biol Med

Physica D

Clin Neurophysiol

Physica D

Neurosci Methods

Neuroimage

Prog Biophys Mol Biol

Comput Biol Med

J Neurosci Methods

Comparison of five directed graph measures for identification of leading interictal epileptic regions

Physiol Meas

Directed differential connectivity graph of interictal epileptiform discharges

IEEE Trans Biomed Eng

Radial basis function approach to nonlinear Granger Causality of time series

Phys Rev E

Sources of cortical rhythms in adults during physiological aging: a multicentric EEG study

Hum Brain Mapp

Dynamic effective connectivity of inter-areal brain circuits

PLoS Comput Biol

Neural networks for pattern recognition

Aberrant effective connectivity in schizophrenia patients during appetitive conditioning

Front Hum Neurosci

Granger Causality: basic theory and application to neuroscience

Surrogate for nonlinear time series analysis

Phys Rev E

Multivariate autoregressive model with instantaneous effects to improve brain connectivity estimation

Inter J Bioelectromagnetism

Mutual nonlinear prediction as a tool to evaluate coupling strength and directionality in bivariate time series: comparison among different strategies based on k nearest neighbors

Phys Rev E

Extended causal modeling to assess Partial Directed Coherence in multiple time series with significant instantaneous interactions

Biol Cybern

Testing frequency-domain causality in multivariate time series

IEEE Trans Biomed Eng

Computational Neuroscience
Estimation of direct nonlinear effective connectivity using information theory and multilayer perceptron