Computational Neuroscience
Estimation of direct nonlinear effective connectivity using information theory and multilayer perceptron

https://doi.org/10.1016/j.jneumeth.2014.04.008Get rights and content

Highlights

  • The direct nonlinear effective connectivity of high-dimensional datasets is estimated.

  • A combination of regressor selection, MLP modeling and Granger Causality is proposed.

  • βmRMR-MLP-GC can deal with highly nonlinear, high-dimensional datasets.

  • In simulations, βmRMR-MLP-GC yields both high sensitivity and specificity.

  • βmRMR-MLP-GC detects Back-to-Front alpha information flows in resting brain.

Abstract

Background

Despite the variety of effective connectivity measures, few methods can quantify direct nonlinear causal couplings and most of them are not applicable to high-dimensional datasets.

New method

In this paper, a novel approach (called βmRMR-MLP-GC) is proposed to estimate direct nonlinear effective connectivity of high-dimensional datasets. βmRMR is used to select a suitable subset of candidate regressors for approximating each neural (here EEG) signal. The multilayer perceptron (MLP) is used for multivariate characterization of EEG signals while the optimum MLP structure is selected using an iterative cross-validation scheme. Finally a causality measure is defined based on Granger Causality (GC) concept to quantify the casual relations among EEG channels.

Results

Applying βmRMR-MLP-GC to high-dimensional simulated datasets with different linear and nonlinear structures yields sensitivity and specificity values higher than 95%. Also, applying it to eyes-closed resting state EEG of six normal subjects in the alpha frequency band yields significant net activity propagations from the posterior to anterior brain regions. This is in accordance with the most previous studies in this field.

Comparison with existing method(s)

βmRMR-MLP-GC is compared with Granger Causality Index, Conditional Granger Causality Index, and Transfer Entropy. It outperforms these methods in terms of sensitivity and specificity in simulated datasets. Also, βmRMR-MLP-GC detects the most number of significant and reproducible Back-to-Front net information flows among the specified brain regions and highlights the posterior brain regions as dominant source of alpha activity propagation.

Conclusions

βmRMR-MLP-GC provides a novel tool to estimate the direct nonlinear causal networks of high-dimensional datasets.

Introduction

The human brain is definitely one of the most complex natural systems in the world. Despite lots of guided studies for discovering its functional organization, there are still many unknowns about it. Founded on the “brain specialization concept” most studies aimed at exploring the brain regions specialized for particular brain tasks in the past years (Jirsa and McIntosh, 2007). The emergence of “brain integration concept” directed many researches in recent years toward the brain connectivity (Jirsa and McIntosh, 2007). Brain connectivity is a broad concept that can be generally divided into three categories: Structural, Functional and Effective connectivity. The structural connectivity refers to the structural connections of brain regions via nerve fibers. The functional connectivity deals with the temporal interdependencies among the activity of brain regions. The effective connectivity characterizes the causal (directed) effects among brain regions. Reviews of the most commonly used functional and effective connectivity measures can be found in Greenblatt et al. (2012), Sakkalis (2011), Pereda et al. (2005), and Muskulus et al. (2009).

Exploring the effective connectivity networks helps neurologists to investigate the changes of brain causal networks due to the brain disorders such as autism (Wicker et al., 2008), Alzheimer (Liu et al., 2012), schizophrenia (Diaconescu et al., 2011), epilepsy (Amini et al., 2010a, Amini et al., 2010b). This helps the physicians to find effective treatments for the brain disorders.

Among the functional brain imaging modalities, EEG and MEG are capable of capturing the temporal dynamics of cortical connectivity owing to their high temporal resolution (He et al., 2011). Consequently, they are popular modalities for functional/effective connectivity estimation, despite their limitations in terms of spatial resolution and volume conduction effects (Schoffelen and Gross, 2009).

According to Wiener Causality concept (Wiener, 1956), if adding the past and present information of (system) X to the past and present information of (system) Y improves predicting the future of (system) Y, X is the cause of Y. Granger (1969) limited the very general definition of Wiener Causality (Wiener, 1956) to linear bivariate autoregressive models and proposed a mathematical formulation to infer the Wiener Causality quantitatively (Granger, 1969). The terms Wiener causality (WC) and Granger Causality (GC) are usually used interchangeably. Geweke (1982) proposed the most practical GC-based effective connectivity measure, which is usually known as Granger Causality Index (GCI) (Geweke, 1982). To distinguish between direct and indirect causal links, a variety of multivariate linear GC-based measures have been developed, for example Conditional Granger Causality Index (CGCI) in the time domain (Ding et al., 2007), and DTF, PDC, dDTF and ffDTF in the frequency domain (Kaminski and Liang, 2005, Wu et al., 2011). They are all based on linear Multivariate Auto-Regressive (MVAR) models.

It is widely assumed that the interactions among neuronal populations are nonlinear (Marinazzo et al., 2011, Pereda et al., 2005, Ioannides and Mitsis, 2010, Stam, 2005; Gourévitch et al., 2006). Consequently, using the linear connectivity measures may oversimplify the functional organization of brain and even lead to incorrect estimation of causal relations. A few multivariate and nonlinear effective connectivity measures have been proposed based on GC in the literature. Some of them are parametric and model-based like the kernel-based nonlinear Granger Causality measures (Marinazzo et al., 2008, Guo et al., 2008) and locally linear MVAR models on the reconstructed attractor space (Chen et al., 2004) whereas the others are nonparametric. Some of those nonparametric methods are partial entropy with non-uniform embedding (Faes et al., 2011), and Partial Transfer Entropy (PTE) (Gomez-Herrero, 2010, Vakorin et al., 2009) which is an extension of well-known Transfer Entropy (TE) measure (Schreiber, 2000). Some other nonparametric methods are based on the Correlation Integral (CI) (Zhidong et al., 2010, Gourévitch et al., 2006).

An ideal effective connectivity measure has three characteristics: (1) It uses no restrictive model, therefore it can explore both linear and nonlinear causal connections. (2) It is multivariate in order to distinguish the direct connections from indirect ones. It has been shown that bivariate measures may yield misleading results on multivariate data (Kus et al., 2004). (3) It must be practically applicable to high-dimensional datasets (e.g. EEG/MEG with high number of channels), and to systems with long memory or large coupling delays.

Despite the existence of many effective connectivity measures, very few measures have all three features above. Due to the lack of a systematic dimensionality reduction stage, none of the multivariate nonlinear measures mentioned above are applicable to high-dimensional datasets except the method proposed by Marinazzo et al. (2008) and may be the one proposed by Faes et al. (2011). Both these measures use some kinds of dimensionality reduction techniques. The nonparametric nonlinear measures (in fact, information theoretic-based and CI-based measures) need the estimation of multivariate probability density functions. Consequently, they are not applicable to high-dimensional datasets since their required data samples exponentially grow with the number of variables. As a result, PTE (Gomez-Herrero, 2010, Vakorin et al., 2009) and CI-based methods (Zhidong et al., 2010, Gourévitch et al., 2006) are not applicable to high-dimensional datasets. Even the method proposed by Faes et al., 2011 may produce inaccurate results if the number of selected partial terms in the partial entropy becomes slightly big. Also in the methods proposed by Chen et al. (2004) and Guo et al. (2008) the number of modeling parameters is proportional to the square of number of dimensions so they do not have the third desired feature of an ideal effective connectivity measure, too.

In this paper, we propose a new approach to estimate the total (linear and nonlinear) effective connectivity network of high-dimensional datasets (e.g. EEG/MEG datasets with large number of channels). We use multilayer perceptron (MLP) to explore linear and nonlinear causal dependencies among neural signals. MLP is a member of the artificial neural networks (ANNs) family, which has the “universal approximation” property (Bishop, 1995, Ripley, 1996). MLP has been used successfully for modeling mean profile of resting state EEG signals (Kawano et al., 2003, Nagashino et al., 2002). As a novel work, in this paper we use MLP for effective connectivity estimation. The performance of MLP like any other regression method may be deteriorated if the number of its input regressors (and so the number of its parameters) increases. In such a case, an appropriate input selection method should be used before MLP modeling. May et al. (2011) reviewed the practical input selection approaches for ANNs.

In this paper, to keep the advantage of detecting nonlinear structure of high-dimensional datasets, an information theoretic-based approach is used for input regressor selection. This approach is called βmRMR (β minimal-Redundancy-Maximal-Relevance) (Hejazi and Cai, 2009). βmRMR is a modified version of the well-known mRMR (Peng et al., 2005). It iteratively selects the input regressors that add maximum amount of new information about the output to the previously selected set of inputs. Since βmRMR is based on information theory, it can keep the advantage of MLPs for exploration of highly nonlinear structures. Moreover due to its careful design, it can yield reliable results even for very large set of candidate regressors. Then we combine βmRMR and MLP modeling to approximate time series of channels. Finally we combine βmRMR-MLP with GC concept to construct a new method called βmRMR-MLP-GC for estimating the direct total (linear and nonlinear) effective connectivity of high-dimensional datasets. We also use Time-Shifted surrogate data to evaluate the significance of the effective connectivity estimates. We applied the proposed method on both simulated datasets and experimental EEG data and compared its performance with three of the most widely used effective connectivity measures: GCI (Geweke, 1982); CGCI (Ding et al., 2007); and TE (Schreiber, 2000).

This paper is organized as follows: in Section 2, we review MLP, βmRMR input regressor selection, and Granger Causality Index. Then we combine these three tools to propose our method called βmRMR-MLP-GC. Then the simulation designs for evaluating the performance of our proposed method are described and our EEG data are introduced. In Section 3, we report the results of applying βmRMR-MLP-GC to the simulated datasets and EEG data. In Section 4, we discuss some results, approach some issues related to βmRMR-MLP-GC, and propose some future works. Finally, in Section 5 we conclude the paper.

Section snippets

Materials and methods

In this section, at first the theories behind multilayer perceptron (MLP), βmRMR input selection, and the Granger Causality Index are reviewed. Then by combining these three tools in a unified framework, βmRMR-MLP-GC method is proposed. Afterwards, the simulation designs for evaluating the performance of βmRMR-MLP-GC are described. Finally, our EEG data that are used to explore the activity propagation patterns of resting brain are introduced.

Results

In this section, we illustrate the results of applying βmRMR-MLP-GC to the simulated networks and EEG dataset. We used the Neural Network Toolbox of Matlab R2009b software for MLP modeling. Also, three of the most widely used effective connectivity measures were used in direct comparison with βmRMR-MLP-GC. Those methods were Conditional Granger Causality Index (CGCI) (Ding et al., 2007), Granger Causality Index (GCI) (Geweke, 1982), and Transfer Entropy (TE) (Schreiber, 2000). CGCI, GCI and TE

Discussion

In a novel approach called βmRMR-MLP-GC we analyzed brain effective connectivity by combining βmRMR regressor selection, MLP function approximator, and Granger Causality concept. βmRMR-MLP-GC had excellent simulation results in terms of both sensitivity and specificity values and often outperformed CGCI, GCI, and TE. It is noteworthy that the simulations of Section 2.5 were not designed to model the multichannel real EEG data. In fact, realistic modeling of the multi channel EEG is not a

Conclusion

In this paper we proposed a method called βmRMR-MLP-GC for the estimating direct total (linear and nonlinear) effective connectivity of high-dimensional neural datasets. We combined βmRMR regressor selection, MLP modeling and the concept of GC Index to introduce βmRMR-MLP-GC method. Applying βmRMR-MLP-GC to simulated networks produced sensitivity and specificity values higher than 95% simultaneously and its application to alpha frequency band of eyes-closed resting state EEG showed promising

Acknowledgments

This work was supported by the University of Tehran (Under Grant No. 8101079/1/02). The authors would like to thank Prof. Hossein Esteky, from School of Medicine, Shahid Beheshti University of Medical Sciences, Tehran, Iran for letting us use his laboratory for EEG data acquisition. Also, we would like to appreciate Dr. Anahita Khorrami from ICSS for her kind assistance and valuable comments on EEG data acquisition.

References (72)

  • D. Marinazzo et al.

    Nonlinear connectivity by Granger Causality

    Neuroimage

    (2011)
  • M. Muskulus et al.

    Functional similarities and distance properties

    J Neurosci Methods

    (2009)
  • E. Pereda et al.

    Nonlinear multivariate analysis of neurophysiological signals

    Prog Neurobiol

    (2005)
  • P. Sauseng et al.

    What does phase information of oscillatory brain activity tell us about cognitive processes?

    Neurosci Biobehav Rev

    (2008)
  • V. Sakkalis

    Review of advanced techniques for the estimation of brain connectivity measured with EEG/MEG

    Comput Biol Med

    (2011)
  • T. Schreiber et al.

    Surrogate time series

    Physica D

    (2000)
  • C.J. Stam

    Nonlinear dynamical analysis of EEG and MEG: review of an emerging field

    Clin Neurophysiol

    (2005)
  • J. Theiler et al.

    Testing for nonlinearity in time series: the method of surrogate data

    Physica D

    (1992)
  • V.A. Vakorin et al.

    Confounding effects of indirect connections on causality estimation

    Neurosci Methods

    (2009)
  • V.A. Vakorin et al.

    Exploring transient transfer entropy based on a group-wise ICA decomposition of EEG data

    Neuroimage

    (2010)
  • M. Wibral et al.

    Transfer entropy in magnetoencephalographic data: quantifying information flow in cortical and cerebellar networks

    Prog Biophys Mol Biol

    (2011)
  • M. Wu et al.

    A comparison of multivariate causality based measures of effective connectivity

    Comput Biol Med

    (2011)
  • M. Zervakis et al.

    Intertrial coherence and causal interaction among independent EEG components

    J Neurosci Methods

    (2011)
  • L. Amini et al.

    Comparison of five directed graph measures for identification of leading interictal epileptic regions

    Physiol Meas

    (2010)
  • L. Amini et al.

    Directed differential connectivity graph of interictal epileptiform discharges

    IEEE Trans Biomed Eng

    (2010)
  • N. Ancona et al.

    Radial basis function approach to nonlinear Granger Causality of time series

    Phys Rev E

    (2004)
  • C. Babiloni et al.

    Sources of cortical rhythms in adults during physiological aging: a multicentric EEG study

    Hum Brain Mapp

    (2006)
  • D. Battaglia et al.

    Dynamic effective connectivity of inter-areal brain circuits

    PLoS Comput Biol

    (2012)
  • C.M. Bishop

    Neural networks for pattern recognition

    (1995)
  • A.O. Diaconescu et al.

    Aberrant effective connectivity in schizophrenia patients during appetitive conditioning

    Front Hum Neurosci

    (2011)
  • M. Ding et al.

    Granger Causality: basic theory and application to neuroscience

  • K.T. Dolan et al.

    Surrogate for nonlinear time series analysis

    Phys Rev E

    (2001)
  • S. Erlaa et al.

    Multivariate autoregressive model with instantaneous effects to improve brain connectivity estimation

    Inter J Bioelectromagnetism

    (2009)
  • L. Faes et al.

    Mutual nonlinear prediction as a tool to evaluate coupling strength and directionality in bivariate time series: comparison among different strategies based on k nearest neighbors

    Phys Rev E

    (2008)
  • L. Faes et al.

    Extended causal modeling to assess Partial Directed Coherence in multiple time series with significant instantaneous interactions

    Biol Cybern

    (2010)
  • L. Faes et al.

    Testing frequency-domain causality in multivariate time series

    IEEE Trans Biomed Eng

    (2010)
  • Cited by (24)

    • A self-organized recurrent neural network for estimating the effective connectivity and its application to EEG data

      2019, Computers in Biology and Medicine
      Citation Excerpt :

      For example, Talebi et al. [14] proposed CREANN, which uses a multilayer perceptron neural network for estimating the effective connectivity. Khadem et al. [15] suggested the βmRMR-MLP-GC model to identify direct nonlinear effective connectivity using multilayer perceptron neural networks. This model involves the estimation of causal relations between EEG channels based on the concept of GC.

    • Nonlinear effective connectivity measure based on adaptive Neuro Fuzzy Inference System and Granger Causality

      2018, NeuroImage
      Citation Excerpt :

      A limitation of the bivariate measures is that they do not distinguish between the direct and indirect connections. To overcome this limitation, another LGC measure based on the MVAR models has been proposed (Khadem and Hossein-Zadeh, 2014). However, this criterion is only suitable in detecting the linear causal links of the MVAR models.

    • Estimate the effective connectivity in multi-coupled neural mass model using particle swarm optimization

      2017, Physica A: Statistical Mechanics and its Applications
      Citation Excerpt :

      Non-interventional methodologies, for example, electroencephalogram (EEG), magnetoencephalogram (MEG), or functional magnetic resonance imaging (fMRI) data have all been used to evaluate connectivity in epileptic patients. Among these imaging modalities, EEG captures high temporal resolution [6], and [7], and is thus a powerful tool for estimating temporal dynamics of brain connectivity and seizure electrophysiology [8–11], and [12]. Mathematical models especially network models make a contribution to better expound the relationship between brain electric activities varying from health to disease.

    • A novel MLP network implementation in CMOL technology

      2014, Engineering Science and Technology, an International Journal
      Citation Excerpt :

      Recently, extensive research attempts have been carried out on artificial neural network (ANN) implementations and applications [5,11,16,19,28,42].

    View all citing articles on Scopus
    View full text