Toward enhanced P300 speller performance

https://doi.org/10.1016/j.jneumeth.2007.07.017Get rights and content

Abstract

This study examines the effects of expanding the classical P300 feature space on the classification performance of data collected from a P300 speller paradigm [Farwell LA, Donchin E. Talking off the top of your head: toward a mental prosthesis utilizing event-related brain potentials. Electroenceph Clin Neurophysiol 1988;70:510–23]. Using stepwise linear discriminant analysis (SWLDA) to construct a classifier, the effects of spatial channel selection, channel referencing, data decimation, and maximum number of model features are compared with the intent of establishing a baseline not only for the SWLDA classifier, but for related P300 speller classification methods in general. By supplementing the classical P300 recording locations with posterior locations, online classification performance of P300 speller responses can be significantly improved using SWLDA and the favorable parameters derived from the offline comparative analysis.

Introduction

A brain–computer interface (BCI) is a device that uses brain signals to provide a non-muscular communication channel (Wolpaw et al., 2002), particularly for individuals with severe neuromuscular disabilities. The P300-event related potential is an evoked response to an external stimulus that is observed in scalp-recorded electroencephalography (EEG). The P300 response has proven to be a reliable signal for controlling a BCI (Farwell and Donchin, 1988). Farwell and Donchin (1988) describe the P300 speller, which presents a selection of characters arranged in a 6 × 6 matrix. The user focuses attention on one of the 36 character cells of the matrix while each row and column of the matrix is intensified in a random sequence. The row and column intensifications that intersect at the attended cell represent the target stimuli, which occur with a probability of 1/6. The rare presentation of the target stimuli in the random sequence of stimuli constitutes an Oddball Paradigm (Fabiani et al., 1987) and will elicit a P300 response to the target stimuli. With proper P300 feature selection and classification, the attended character of the matrix can be identified and communicated.

A variety of feature extraction and classification procedures such as stepwise linear discriminate analysis (SWLDA) (Donchin et al., 2000, Sellers and Donchin, 2006), wavelets (Bostanov, 2004), support vector machines (Kaper et al., 2004, Meinicke et al., 2002, Thulasidas et al., 2006), and matched filtering (Serby et al., 2005) have been implemented, improving the performance beyond that originally reported in (Farwell and Donchin, 1988). Based on multiple studies in healthy volunteers (Donchin et al., 2000, Sellers and Donchin, 2006, Serby et al., 2005), and initial studies in persons with physical disability (Vaughan et al., 2006), the P300 speller has potential to serve as an effective communication device for persons who have lost or are losing the ability to write and speak. An individual with advanced-stage ALS reported the P300 speller to be superior and preferential to his modern eye-gaze system and uses the BCI 4–6 h/day for e-mail and other computer applications (Vaughan et al., 2006). Initial reports from other disabled individuals currently testing the P300 speller BCI also indicate that its speed, accuracy, and ease of use are superior or competitive with other assistive technologies.

Up to the present, BCI-related P300 research has focused almost exclusively on signals from standard P300 scalp locations (i.e., Fz, Cz, Pz). While recent offline evaluations suggest that the use of additional locations, particularly posterior sites, may improve classification accuracy (BCI, 2003, BCI, 2005, Blankertz et al., 2004, Blankertz et al., 2006, Kaper et al., 2004, Spencer et al., 2001, Vaughan et al., 2003), this possibility has not been formally addressed in comprehensive offline and online studies.

To address this possibility, the present study explores the value of incorporating information from electrode locations that are not traditionally associated with the P300 response. In addition, several data preprocessing and model parameters are evaluated to assess the relative effects with respect to the new spatial information. Using a SWLDA classifier, both offline and online results obtained from 64-channel data show that some of the most discriminable EEG features evoked by the P300 speller occur at posterior electrodes (namely PO7, PO8, Oz), and that these features can significantly improve classification performance when used in conjunction with the classical P300 feature space (i.e., EEG features at electrodes Fz, Cz, Pz (Sharbrough et al., 1991)). The results are also relevant to current speculations concerning the nature of the neural processes underlying P300-based BCI operation and the question of their dependence on gaze direction.

Section snippets

Participants

Seven able-bodied people were the participants in this study. The demographics and previous speller matrix experience of the participants are listed in Table 1. The participants varied in their previous BCI experience, but all participants had either no experience or relatively few sessions with a P300-based BCI system. The study was approved by the New York State Department of Health Institutional Review Board, and each participant gave informed consent.

Task, procedure, and design

The participant sat upright in front of

Stepwise linear discriminant analysis

Determining the presence or absence of a P300 evoked potential from EEG features can be considered a binary classification problem with a decision hyper-plane defined by:wxb=0where x is the feature vector as described in Section 2.2, w a vector of feature weights, and b is the bias term. However, because it is assumed that a P300 is elicited for one of the six row/column intensifications, and that the initial results indicate that the P300 response is invariant to row/column stimuli, the

Analysis protocol

In the previous work on SWLDA for classifying P300 responses (BCI, 2003, Farwell and Donchin, 1988, Sellers and Donchin, 2006), only channels Fz, Cz, and Pz were used for analysis. However, the posterior response seems to provide significant additional discriminative information for the P300 speller (BCI, 2003, BCI, 2005, Blankertz et al., 2004, Blankertz et al., 2006, Kaper et al., 2004, Spencer et al., 2001, Vaughan et al., 2003). Thus far, neither the temporal attributes of this posterior

Offline evaluation

The four factors of primary interest were examined separately offline. Fig. 3 illustrates the fundamental effects of channel set, reference, decimation, and maximum features, respectively. Because all combinations of factors were initially evaluated, factors that are clearly superior or show consistent performance across conditions are fixed for presentation purposes. For example, the factors of decimation and maximum features do not result in significantly different performance for any of the

Discussion

It is evident from the results that an essential factor in optimizing the performance of the P300 classifier is the identification of an appropriate channel set for the individual. Channel set was the only factor that yielded a significant statistical effect on classification rates; however, the analyses of the other factors have provided important information. The analyses conducted here suggest that, on average, a decimation of 12, ear reference, maximum features of at least 45, and either

Acknowledgements

This work was supported in part by the National Institutes of Health under Grant NICHD HD30146 and Grant NIBIB/NINDS EB00856 and in part by the James S. McDonnell Foundation.

References (26)

  • E. Donchin et al.

    The mental prosthesis: assessing the speed of a P300-based brain–computer interface

    IEEE Trans Rehabil Eng

    (2000)
  • N. Draper et al.

    Applied regression analysis

    (1981)
  • M. Fabiani et al.

    Definition, identification, and reliability of measurement of the P300 component of the event-related brain potential

    Adv Psychophysiol

    (1987)
  • Cited by (623)

    • A new benchmark dataset for P300 ERP-based BCI applications

      2023, Digital Signal Processing: A Review Journal
    View all citing articles on Scopus
    View full text