Using echo state networks for classification: A case study in Parkinson's disease diagnosis

https://doi.org/10.1016/j.artmed.2018.02.002Get rights and content

Highlights

Abstract

Despite having notable advantages over established machine learning methods for time series analysis, reservoir computing methods, such as echo state networks (ESNs), have yet to be widely used for practical data mining applications. In this paper, we address this deficit with a case study that demonstrates how ESNs can be trained to predict disease labels when stimulated with movement data. Since there has been relatively little prior research into using ESNs for classification, we also consider a number of different approaches for realising input–output mappings. Our results show that ESNs can carry out effective classification and are competitive with existing approaches that have significantly longer training times, in addition to performing similarly with models employing conventional feature extraction strategies that require expert domain knowledge. This suggests that ESNs may prove beneficial in situations where predictive models must be trained rapidly and without the benefit of domain knowledge, for example on high-dimensional data produced by wearable medical technologies. This application area is emphasized with a case study of Parkinson's disease patients who have been recorded by wearable sensors while performing basic movement tasks.

Introduction

Reservoir computing is a general approach for modelling complex dynamical systems using a large recurrent neural network (RNN), with only the network output weights being trained [1]. Echo state networks (ESNs) are a well known implementation where the output connections are fitted using simple ordinary least-squares regression [2]. Owing to this, ESNs are significantly faster and more scalable than many existing more complex machine learning approaches and are ideally suited for time series analysis. Nevertheless, despite this significant advantage, they have yet to be widely used in data mining applications.

Many medical applications have a need for predictive models that can capture the complexity of biological disease pathways to facilitate personalised healthcare. A good example is the Parkinson's disease (PD) case study considered in this paper. PD is a debilitating progressive neurodegenerative disease that presents with a broad spectrum of movement disorders, which even expert clinicians can find challenging to characterise and discriminate from other related diseases [3]. Wearable sensors can provide significant benefits to patient care by objectively measuring movement disorders in high resolution and therefore help monitor disease progress and their use is becoming increasingly widespread [4].

ESNs, with their ability to model dynamical processes, would seem like a sensible candidate for modelling such data and provide two primary benefits. The first is that they can directly model the raw time series to identify any patterns in the underlying dynamics of the signal that conventional feature extraction techniques may miss. Their second advantage is their rapid training speed, resulting from having a closed form solution. This is an important consideration for applied predictive modelling, owing to the need to train and evaluate candidate models on a range of data sets when performing model selection and evaluation.

In this paper, we consider how ESNs can be applied to the problem of diagnosing PD from movement data of the kind that might be collected using wearable accelerometers. Since there has been little existing work in this area, we focus on exploring the key issue of how inputs and outputs can be mapped to the ESN methodology, and how this affects the predictive accuracy of the model. One aspect that is investigated is whether to segment the data before inputting into the model. This has ramifications for subsequent work on analysis of data recorded from wearable sensors by facilitating simpler processing and analysis at the cost of adding more design time [5]. To evaluate the practicality of the resulting network, a two-fold comparison is performed. First, ESN classifiers are compared to models built on summary features derived using the guidance of an expert in movement disorders, in order to establish whether ESNs offer a more flexible alternative without compromising on accuracy. The second comparison is against previous attempts on this data set, highlighting the ability of ESNs to rapidly fit an accurate model comparable with those produced from complex optimisation routines requiring significantly longer computational time.

The rest of the paper is organised as follows: Section 2 provides details of ESNs and previous applications to both classification tasks and medical problems in general, and Section 3 details the data collection process of the Parkinson's disease movement data. The experimental methodology is laid out in Section 4, while Section 5 presents the results. Finally, Section 6 concludes.

Section snippets

Background

In recent years, a new and increasingly well researched dynamical systems approach to modelling complex time series has been developed, termed reservoir computing. As the name implies, the model is focused around what is known as a reservoir: a coupled system of non-linear functional elements in which dynamical behaviour can be modelled. Data is passed directly into the reservoir through a set of input nodes, while the output at each time step is determined by a linear readout. The functional

Test subjects

Two separate studies were run at Leeds General Infirmary (LGI) in Leeds, United Kingdom, to record both patients and control subjects while performing various physical movement tests. Local ethical approval was granted for both studies, with details of the centres and the number of recordings at each being found in Table 1. One of the tasks being recorded—finger tapping—is the source of the data used in this study. For ethical reasons, patients were observed while medicated, thereby increasing

Methodology

The empirical investigation described in this paper has two principal aims: to determine the optimal configuration of ESN parameters for classification problems, and to establish whether directly modelling the separation waveform can prove as accurate as conventional models of summary features extracted with domain knowledge. This section details the experimental methodology used to explore these goals.

ESN configuration

As highlighted in Table 2, there were two factors related to ESN configuration for classification under investigation: whether to pass the separation waveform into the network sample-by-sample or segmented into tap cycles, and how to aggregate the extended states from each time step to form a single predicted output. Two possibilities were investigated for each of these choices: mean and last state aggregation functions and single and taps data processing method. The results of these

Conclusions

The results of this empirical investigation have shown that the use of ESNs to model positional data is a viable technique for medical diagnosis of a movement disorder, with accuracies comparable to previous approaches that rely upon more complex training algorithms such as EAs. The primary advantage of ESNs is their rapid training time, which proves most advantageous when performing model selection over a variety of hyper-parameters. In addition, while they are not are as accurate as models

References (31)

  • J. Jankovic

    Parkinson's disease: clinical features and diagnosis

    J Neurol Neurosurg Psychiatry

    (2008)
  • S.C. Mukhopadhyay

    Wearable sensors for human activity monitoring: a review

    IEEE Sens J

    (2015)
  • M.A. Lones et al.

    Going through directional changes: evolving human movement classifiers using an event based encoding

  • W. Maass et al.

    Real-time computing without stable states: a new framework for neural computation based on perturbations

    Neural Comput

    (2002)
  • D. Prokhorov

    Echo state networks: appeal and challenges

  • Cited by (42)

    • Parkinson's disease diagnosis using neural networks: Survey and comprehensive evaluation

      2022, Information Processing and Management
      Citation Excerpt :

      Shin et al. used the YoloV3 (Redmon & Farhadi, 2018) network for assessing nigrostriatal degeneration. Lacy, Smith, and Lones (2018) studied the efficacy of echo state networks (ESN) in PD diagnosis. The ESNs work on time-series movement signals.

    • Echo-state networks for soft sensor design in an SRU process

      2021, Information Sciences
      Citation Excerpt :

      Applications to a synthetic NARX system and the Mackey-Glass chaotic system were also reported. Additional applications that have been investigated relate to time-series prediction in financial markets [41] and the medical field to classify the actions of Parkinson’s patients [19]. However, the use of reservoir-based methods in the industrial field for SS design is still minimal.

    View all citing articles on Scopus
    View full text