Introduction

The issue of whether language processing utilizes a verbal working memory system that is employed in other verbally mediated tasks (e.g., Fedorenko, Gibson, & Rohde, 2007; King & Just, 1991) or, instead, relies on a special set of resources dedicated specifically to language processing (e.g., Caplan & Waters, 1999) is unresolved. Fedorenko et al. examined the domain specificity of the working memory system responsible for one aspect of language—syntactic processing—in a unique series of self-paced dual-task experiments. In one set of experiments, subjects performed reading and arithmetic tasks simultaneously, while in another the reading task was paired with spatial tasks. In both series of experiments, the difficulty of each task was varied in 2 × 2 designs. Sentences containing subject-extracted relative clauses (SRCs) or object-extracted relative clauses (ORCs) were presented phrase by phrase in self-paced reading, with an arithmetic or spatial computation appearing simultaneously with each phrase (Fig. 1). Fedorenko et al. found a superadditive interaction when the sentence and arithmetic tasks were combined, with response times in the hard-sentence/hard-math condition being disproportionately long when the relative clause was viewed. In contrast, they did not find a superadditive interaction when they paired the sentence task with a spatial integration task. On the basis of additive factors logic (Sternberg, 1969), they argued that reading and arithmetical operations rely on a common pool of working memory resources, while reading and spatial processing rely on separate pools. Fedorenko et al. argued that they obtained a superadditive interaction between syntactic and arithmetical complexity because both tasks required both retrieval and integration of information, unlike in previous dual-task studies in which the concurrent memory task (immediate serial recall) required only retrieval (Caplan & Waters, 1999; Waters, Caplan, & Rochon, 1995).

Fig. 1
figure 1

Depiction of self-paced dual-task presentation from Fedorenko et al. (2007), Experiment 1

The present study constitutes an effort to replicate Fedorenko et al. (2007) results (Experiment 1) and to examine the effect of combining sentence and arithmetic processing more closely by measuring eye fixations (Experiment 2), which potentially provide a more fine-grained measure of the different aspects of processing involved in this type of task.

Experiment 1

Method

Subjects

Fifty subjects were recruited from Boston University (BU). All were native speakers of English between the ages of 18 and 30 years, possessed normal/corrected-to-normal vision, and had no history of neurological injury or impairment.

Materials and procedure

The materials and procedures were identical to those in Experiment 1 in Fedorenko et al. (2007). The experiment was presented using a self-paced reading, phrase-by-phrase design with a moving-windows display. SRCs (“The janitor/who frustrated the plumber/lost the key/on the street.”) and ORCs (“The janitor/who the plumber frustrated/lost the key/on the street.”) sentences were displayed region by region, with a number presented simultaneously directly above each region. In the easy arithmetic condition, the first term was between 1 and 10, and successive addends were between 1 and 3. In the hard arithmetic condition, the first term was between 11 and 20, and successive addends were between 4 and 6. Subjects were required to answer two questions about the sentence and to compute the sum of the series of numbers. Testing took place on a Macintosh laptop computer using Linger (http://tedlab.mit.edu/~dr/Linger/) and used the Linger script written by Fedorenko and colleagues. Buttonpress reaction times were recorded for each sentence region.

Results

Accuracy and reaction time data were analyzed using mixed logistic regression models (Jaeger, 2008) and linear mixed effects models (Baayen, Davidson, & Bates, 2008), respectively, both with crossed random subject and item effects. An initial model that included fixed effects of math and syntax, a math × syntax interaction, and random subject and item slopes was compared with models in which the fixed effects were sequentially removed on the basis of significance level and log likelihood comparisons. For the sake of brevity, only the most highly valued model is reported for each analysis.

Accuracy

Subjects were 90% accurate on arithmetic questions and 78% accurate on comprehension questions, overall (Table 1). There were main effects of arithmetic complexity on math question accuracy, such that harder arithmetic resulted in less accurate responses, and of syntax on comprehension question accuracy, with less accurate responses in the ORC condition (Table 2). There were no interaction effects. The arithmetic and comprehension question accuracy from Experiment 1 was also compared with the original Experiment 1 data from Fedorenko et al. (2007) (Table 1; Supplementary Material: Table S1), and the groups did not significantly differ.

Table 1 Present Experiment 1 question accuracy compared with results of Experiment 1 in Fedorenko et al. (2007): Mean accuracy by condition (with standard errors in parentheses)
Table 2 Mixed logistic regression analysis of question accuracy in Experiment 1: Effects of syntax (reference level: SRC) and math (reference level: small addends) on log(odds) accuracy. Math × syntax interaction terms are dropped from final models due to lack of significance (zs < 1). All models contain crossed random subject/item intercepts

Reaction time

Reaction times more than 3 standard deviations from each condition were discarded (affecting 1.9% of the data, overall). Remaining reaction times were analyzed separately by region, using linear mixed effects models.Footnote 1 Analyses were conducted on all trials regardless of accuracy, as in Fedorenko et al. (2007). Table 3 presents mean reading times by region and condition (with comparisons with those in Fedorenko et al., 2007), and Table 4 presents model results for each condition and region. Analysis of the critical region 2 containing the SRC/ORC contrast revealed main effects for both math and syntax, such that the harder conditions in each instance corresponded to longer reaction times, but there was no interactionFootnote 2 (p = .32 in the initial model). Examination of the other regions revealed main effects for math in regions 1, 3, and 4, such that trials containing hard arithmetic took longer.

Table 3 Present Experiment 1 mean self-paced reading times (in raw milliseconds) compared with results of Experiment 1 in Fedorenko et al. (2007), separately by region and condition. Arithmetic conditions, small addends versus large addends; syntax conditions, subject-extracted relative clause versus object-extracted relative clause (SRC/ORC). Critical region 2 is shown in bold. Standard errors in parentheses
Table 4 Linear mixed effects analysis of reaction time by region in Experiment 1: Effects of syntax (reference level: SRC) and math (reference level: small addends) on raw reaction times in milliseconds. Math × syntax interaction terms are dropped from all final models when pMCMC > .1 in the initial model. All models contain crossed random subject/item intercepts

Raw reaction times from region 2 in this experiment were compared with the original data from Fedorenko et al. (2007) (Supplementary Material: Tables S2, S3). There were interactions between math and population and between syntax and population, such that our subjects took longer than those in Fedorenko et al. in the difficult conditions. There was also an overall interaction between math and syntax, as found in the Fedorenko et al. data alone. The three-way interaction of population, math, and syntax was not significant.

In order to determine whether individual differences in processing efficiency played a role in determining the size of the interaction effect, the correlation between the by-subject reading time adjustment and the interaction terms was calculated in the combined group (Supplementary Material: Table S4). The correlation was not significant (r = .062).

Discussion

Although we replicated the main effects of arithmetical and syntactic complexity on answering questions and self-paced reaction times reported in Fedorenko et al. (2007), we failed to observe their reported interaction of these factors in region 2 in our experiment. The results suggest that there are individual differences in the processes that generated the interaction in Fedorenko et al. (Experiment 1). These individual differences do not appear to be due to individual differences in a shared working memory system, because there was no association between individual processing efficiency, as measured by overall processing speed, and the size of the interaction effect.

Although we did not replicate Fedorenko et al.’s (2007) result, both their result and the finding of an interaction of math and syntax in the critical segment in the analysis combining the two populations may be taken as evidence of an interaction between math and syntax under these dual-task conditions. Experiment 2 addressed the question of whether the interaction would arise with another group of subjects taken from the BU population when a second, similar design was used (Fedorenko et al., 2007, Experiment 2) and further explored the interaction by measuring eye fixations.

Experiment 2

Method

Subjects

Forty subjects were recruited from BU, using the same selection criterion as in Experiment 1.

Materials

Materials were taken from Experiment 2 in Fedorenko et al. (2007). Sentential materials and fillers were identical to those used in Experiment 1. Arithmetical complexity was manipulated by varying the type of operation involved (first addition term between 30 and 50; first subtraction term between 40 and 60; subsequent addends/subtrahends between 3 and 6).

Design and procedure

The procedure followed the Experiment 1 methodology, with the following change: Subjects were tested using an Eyelink II head-mounted eye-tracking system (SR Research, Ltd), and the number and the phrase in each segment were separated by 8 cm (approximately 8.5° of visual field) to eliminate parafoveal information. Arithmetic and comprehension question accuracy (via computer keyboard), response times for each region (via USB response pad), and fixation data were recorded.

Results

Accuracy

Subjects were 83% accurate on arithmetic questions and 71% accurate on comprehension questions, overall. Accuracy data (Tables 5 and 6) were analyzed as in Experiment 1. A main effect of syntactic complexity on math question accuracy was found, such that responses were less accurate in the ORC condition. There were no other significant effects for either question type.

Table 5 Mean Experiment 2 question accuracy by condition with standard errors in parentheses
Table 6 Mixed logistic regression analysis of question accuracy in Experiment 2: Effects of syntax (reference level: SRC) and math (reference level: addition) on log(odds) accuracy. Models contain crossed random subject/item intercepts and random slopes for math × syntax interactions

Reaction time

Self-paced reaction times (Tables 7 and 8) were analyzed as in Experiment 1. Of the data, 1.7% were discarded as outliers. Analysis of the critical region 2 containing the SRC/ORC contrast revealed main effects for both math and syntax, such that the harder conditions in each instance corresponded to longer reaction times, but there was no interaction (p value = .73 in the initial model). Examination of the other regions revealed main effects for math in regions 3 and 4, such that trials with subtraction took longer.

Table 7 Experiment 2 mean self-paced reading times (in raw milliseconds), by interest area and condition. Critical region 2 is shown in bold. Standard errors in parentheses
Table 8 Linear mixed effects analysis of reaction time by region in Experiment 2: Effects of syntax (reference level: SRC) and math (reference level: addition) on raw reaction times in milliseconds. Math × syntax interaction terms are dropped from all final models when pMCMC > .1 in the initial model. All models contain crossed random subject/item intercepts

Eye-tracking analyses

Fixations on regions before stimuli appeared and after they disappeared in the self-paced moving window were removed (2.5% and 3.8% of total fixations, respectively). For the sake of brevity, analyses will focus on region 2, with interest areas consisting of the relative pronoun (“who”), the noun phrase (NP; “the plumber”), the verb (“frustrated”), and the number appearing in this region (“number 2”). Three reading time measures were calculated for these interest areas: first-pass reading time (the sum duration of the first fixation plus all subsequent fixations until the subject left the interest area), total reading time (the sum duration of all fixations on the interest area), and second-pass reading time (the difference between first-pass and total reading times). Mean reading times by condition, interest area, and reading measure were calculated (Table 9), and data were analyzed using linear mixed effects models, as above (Table 10). Second-pass reading times produced the same results as total reading times and are not reported.

Table 9 Experiment 2 Mean first-pass reading times and total reading times (in milliseconds), by relative clause interest area and condition. Arithmetic conditions, addition versus subtraction; syntax conditions, subject-extracted relative clause versus object-extracted relative (SRC/ORC). Standard errors in parentheses
Table 10 Linear mixed effects analyses of eye fixations within region 2 of Experiment 2: Effects of syntax (reference level: SRC) and math (reference level: addition) on raw first pass, first pass after a fixation on number 2, and total reading time in milliseconds on relative pronoun, noun phrase, and verb. Math × syntax interaction terms are dropped from all final models when pMCMC > .1 in the initial model. All final models contain crossed random subject/item intercepts

For the number in region 2, there was a main effect of math on total reading time, such that subjects took longer on trials with subtraction. There were no other effects in either first-pass or total reading time.

For the relative pronoun, there was a main effect for syntax on both first-pass and total reading time, such that subjects took longer in the ORC condition. There were no math effects and no interaction.

For the relative clause verb, there were no syntax effects on first-pass reading time, and total reading time was longer on the SRC verb than the ORC verb. There were no math effects and no interaction.

For the NP, there were main effects of both syntax and math on first-pass reading time, with the hard conditions (subtraction, ORC) taking longer in each case. There was also a main effect of syntax and an interaction between math and syntax in total reading time, with the ORC/hard arithmetic condition taking the longest.

In order to make certain that first-pass reading times for the relative clause were influenced by demands made by concurrent arithmetic, we analyzed the subset of first-pass reading times following a fixation on the number in region 2 (94% of trials for the noun and verb interest areas and 80% of trials for the relative pronoun). In this set of analyses, the main effect of syntax on first-pass reading time for the NP was reduced to a trend (p = .1), but there were no other changes in significance for any other factor or interest area.

To assess the effect of changing fixations from math to syntax, we analyzed total reading time following fixations on the same and different type of materialFootnote 3 (syntax, math) (Supplementary Materials: Tables S5 and S6). For fixations on the NP following fixations on the number, there were no main effects or interactions of arithmetical or syntactic complexity. For fixations on the NP following fixations on other sentence segments, there were no main effects, but there was a significant superadditive interaction of arithmetical and syntactic complexity.

To determine whether the interaction of syntactic and arithmetic complexity in total reading times for the NP was due to subjects’ fixating the NP after the verb in ORCs (i.e., to initially skipping the ORC NP), we analyzed the proportion of first fixations on the verb and NP of each sentence type in each condition. Of the first fixations, 99% fell on the NP before the verb in ORCs.

Finally, we also analyzed results by positions —position 1 consisting of the verb in the SRC condition and the NP in the ORC condition, and position 2 consisting of the NP in the SRC condition and the verb in the ORC condition (Supplementary Materials: Table S7). Because of differences in lexical items in these positions in SRC and ORC sentences, interest area word length and average log frequency (SUBTLEXus database; Brysbaert & New, 2009) were included as factors in these models. There were main effects of math and syntax on first-pass reading time and on total reading time, such that the ORC and subtraction conditions took longer. There was an interaction between position and math for total reading time (and a trend in the first-pass measure), such that reading times were disproportionately long in the position 1/subtraction condition, and between position and syntax in total reading time, such that reading times were disproportionately long in the position 1/ORC condition. The three-way interaction between math, syntax, and position was not significant (p > .1 in the initial model).

Discussion

We replicated the main effects of arithmetical and syntactic complexity on self-paced reading times reported in Fedorenko et al. (2007) but again failed to find an interaction of these factors in region 2 in our experiment. Eye fixations showed some expected effects of complexity (we discuss unexpected results below). There was an interaction of math and syntax in total reading time for the NP. The interaction itself points to some process or resource that affects both the syntactic and arithmetic tasks. The fact that it was found only when subjects had immediately previously fixated on the sentence material indicates that it was not due to costs of shifting visual focus (and, presumably, attention) from one input and task to the other. The fact that it was found following fixations on the sentence, and not on the number, is also consistent with its being due to a shared resource: Inspecting the number might have allowed subjects to complete arithmetic processing or to reinforce the encoded intermediate product of computation, thereby reducing concurrent memory demand while they accomplished the sentence task and eliminating the interaction of arithmetic and sentence processing. The remaining, critical question is what process(es) does this shared resource support.

The locus and timing of the interaction differ from the locus and timing of the effect of syntactic complexity on eye fixations for reading sentences in isolation. Effects of the ORC structure at the NP consist of more regressions and (consequently) longer go-past reading times (Staub, 2010) and have been attributed to surprisal (Levy, 2008; Staub, 2010). Effects of the ORC structure also are seen in the form of longer first-pass reading times for the verb (Staub, 2010), which have been attributed to integration of information retrieved from memory with the current input. There was no interaction of syntax and arithmetic for this constituent in this study. Thus, neither the location nor the timing of the observed interaction supports Fedorenko et al.’s (2007) claim that it results from a working memory system that is common to syntactic and arithmetical operations. However, features of the eye fixations in this study suggest that it is risky to base interpretations of eye fixations here on results in normal reading. We take this up in the General Discussion section.

General discussion

The self-paced reaction time results in these studies differ from those in Fedorenko et al. (2007), in that we did not find an interaction of syntactic and arithmetic complexity. The differences between the groups do not seem to be due to individual differences in a working memory capacity that is shared by the two tasks, because there was no correlation between how quickly a subject processed the critical segment and the size of his/her interaction effect. Subjects in our Experiment 1 were less efficient than those in Fedorenko et al.’s (showing larger reading time main effects for both tasks) and would, therefore, have been expected to show the interaction if it arose from reliance of the two processes on a common working memory system. These results thus raise doubts about Fedorenko et al.’s conclusion that operations in these tasks rely on a common working memory system.

The eye fixation results showed an interaction of syntactic and arithmetical complexity in total reading times for the relative clause NP. On the basis of the literature on eye fixations in isolated sentences, the time course and location of this interaction are not consistent with its arising from a resource involved in both parsing and interpretation. However, as has been noted, the results showed some features that have not been described in the reading of isolated sentences. Subjects were slow: First-pass reading times on the first element in the relative clause (~1,000 ms) were approximately double those reported by Staub (2010), and viewing time for region 2 was 3.5–4 s, much longer than is usually seen in self-paced reading times in isolated sentence reading. Subjects fixated the first constituent in the relative clause differently than in normal reading, showing longer total reading times on the verb of SRCs than ORCs and greater effects of complexity in total reading times for the verb of SRC and the NP of ORC sentences. The differences in self-paced reading times and eye fixations in this task and reading of isolated sentences could be due to either of the two major differences between these conditions—the requirement to do a second task, and the disappearance of each segment once the button has been pushed (which makes many normally occurring regressive eye movements uninformative). The results of this study suggest that subjects react to these features of this task both by increasing their exposure to each segment and by allocating eye fixations to segments within the visible regions in ways that differ from those in normal exposure conditions. This affects the interpretation of the interaction seen in the eye fixation data. If subjects’ processing of sentences differed from normal reading conditions, it is possible that the interaction in total reading times on the ORC NP reflects competition of parsing and arithmetic for a single set of resources, which is manifest in ways not expected on the basis of effects in normal reading.

One possibility (suggested by an anonymous reviewer) is that subjects often skipped the ORC NP, leading to different aspects of parsing and interpretation being associated with total reading times on this item than are seen in normal reading. However, subjects did not skip the ORC NP, ruling out this possibility. Another possibility is that subjects postponed some syntactic and interpretive processing, such as the integration of successive representations (or probability distributions of representations) that produces surprisal effects, from initial to later fixations. There is no way to directly verify this possibility.

Alternatively, the interaction of math and syntax seen in total fixations on the ORC NP, but not in first-pass reading times on this segment, may reflect reliance of what Caplan and Waters (1999) called “post-interpretive” language-related operations and arithmetic on a common resource pool. One such operation might be a late check of the encoded meaning of the phrase against the input for the purpose of later end-of-sentence verification. Traxler, Morris, and Seely (2002; Traxler, Williams, Blozis, & Morris, 2005) interpreted regressions from the main clause to the sentence-initial NP in ORCs in this way. In a self-paced study, the only informative regressions in ORCs are from the verb to the NP, and these may serve this purpose. Verification of meaning after it has been assigned and reinforcement of what has been encoded in memory are “post-interpretive” verbally mediated operations that Caplan and Waters suggested are supported by a “general” verbal working memory system that could also support arithmetic.

In summary, interactions between concurrent arithmetical and syntactic complexity in self-paced times in dual-task experiments were not observed in our study population. Eye fixations did show such an interaction, but, on the basis of the results in normal reading, its location and timing do not suggest that it arises because of a resource that is shared between syntactic and arithmetical operations. The eye fixations also suggest that subjects in this complex dual task engage in some processes, or scheduling of processes, that differ from those that occur when sentences are read under normal viewing conditions. All these results raise doubts about the inference Fedorenko et al. (2007) drew from their data that syntactic and arithmetical operations utilize a common pool of resources.