P300-Based Brain-Computer Interface Speller: Usability Evaluation of Three Speller Sizes by Severely Motor-Disabled Patients

Medina-Juliá, M. Teresa; Fernández-Rodríguez, Álvaro; Velasco-Álvarez, Francisco; Ron-Angevin, Ricardo

doi:10.3389/fnhum.2020.583358

ORIGINAL RESEARCH article

Front. Hum. Neurosci., 29 October 2020
Sec. Brain-Computer Interfaces
Volume 14 - 2020 | https://doi.org/10.3389/fnhum.2020.583358

P300-Based Brain-Computer Interface Speller: Usability Evaluation of Three Speller Sizes by Severely Motor-Disabled Patients

M. Teresa Medina-Juliá

Álvaro Fernández-Rodríguez

Francisco Velasco-Álvarez^*

Ricardo Ron-Angevin

Departamento de Tecnología Electrónica, Universidad de Málaga, Malaga, Spain

Brain-computer interface (BCI) spellers allow severe motor-disabled patients to communicate using their brain activity without muscular mobility. Different visual configurations of the widely studied P300-based BCI speller had been assessed with healthy and motor-disabled users. However, the speller size (in terms of cm) had only been assessed for healthy subjects. We think that the speller size might be limiting for some severely motor-disabled patients with restricted head and eye movements. The usability of three speller sizes was assessed for seven patients diagnosed with amyotrophic lateral sclerosis (ALS) and a participant diagnosed with Duchenne muscular dystrophy (DMD). This is the first usability evaluation of speller size with severely motor-disabled participants. Effectiveness (in the online results) and efficiency (in the workload test) of the medium speller was remarkably better. Satisfaction was significantly the highest with the medium size speller and the lowest with the small size. These results correlate with previously described findings in healthy subjects. In conclusion, the speller size should be considered when designing a speller paradigm, especially for motor-disabled individuals, since it might affect their performance and user experience while controlling a BCI speller.

Introduction

Amyotrophic lateral sclerosis (ALS) is a neurological disorder that degenerates the upper and lower motor neurons, leading to paralysis and eventually death (Patterson and Grabois, 1986). However, other functions such as sensory perception or intellectual abilities are usually preserved. ALS patients may gradually enter a locked-in state (LIS), where they are only able to slightly move their eyes and make other small residual movements (Bauer et al., 1979; Murguialday et al., 2011). On the other hand, Duchenne muscular dystrophy (DMD) is a genetic progressive muscular degeneration disorder which also leads to paralysis and eventually death (Emery et al., 2015). As with ALS patients, DMD patients usually preserve their sensory perception and intellectual abilities (Emery et al., 2015). Some of the main differences between ALS and DMD disorders are that DMD is genetic, usually starts at early ages—childhood—and often evolves slowly; while ALS cause is unknown, usually starts at later ages—adulthood—and normally evolves faster than DMD.

Researchers in the field of assistive technology have developed different systems to these patients with an alternative communication channel, e.g., eye-tracker (Pal et al., 2017) or brain-computer interface (BCI) systems (Birbaumer, 2006). The latter allows people to interact with their environment using brain activity without any peripheral nerve involvement [see Nicolas-Alonso and Gomez-Gil (2012) for an extended review of BCI]. As these patients may not be able to control gaze at some stages of their condition, they may require a BCI to establish communication to provide them with some autonomy in their daily life.

According to Nicolas-Alonso and Gomez-Gil (2012), BCI systems most often use electroencephalography (EEG) to measure a subject’s brain activity to study different waveforms. This article will focus on the P300 evoked potentials, which are positive peaks appearing 300 ms after an odd stimulus happens. This signal is typically used by P300-based BCI systems named virtual spellers (Rezeika et al., 2018). The first oddball paradigm was proposed by Farwell and Donchin (1988); it had a matrix of letters and numbers, with each matrix’s column and row flashing pseudo-randomly. The subject must pay attention to a particular character while the rows and columns flash, and when his/her target character is flashed, the P300 potential is evoked and recorded by the BCI system to determine which letter the user wants to select.

Numerous visual factors, e.g., colors or the nature of the stimulus, have been assessed in a P300 speller (Ikegami et al., 2012; Acqualagna et al., 2013; Li et al., 2015). However, the speller size has barely been tested. Sellers et al. (2006) compared two matrixes with different numbers of elements (3 × 3 and 6 × 6) and dimensions (5.44°H × 7.07°W and 8.30°H × 10.90°W of visual angle, respectively). In that study, the small matrix showed better accuracy; however, it is unknown if this difference is due to the visual angle defined by the speller or the number of elements in the speller. On the other hand, Salvaris and Sepulveda (2009) tested three visual configurations of the speller relative to the background color, the distance between symbols, and symbol size. However, the symbol size and symbol distance parameters were not varied together to find the optimal actual speller dimensions. Nonetheless, their results showed that the matrix with the smallest size gave the worst performance for both conditions. Li et al. (2011) compared three screen sizes (computer monitor: 17″, 1,200 × 1,000 pixels; GPS: 9″, 700 × 500 pixels; cell phone: 5″, 260 × 425 pixels) and concluded that better performance was achieved with the largest screen size. However, no information about the speller and symbol size was provided. Therefore, the differences between resolutions and the lack of the exact measurements of the spellers and symbols prevent satisfactory conclusions about the symbol size. Finally, Ron-Angevin et al. (2019) assessed three speller sizes—under overt and covert attention—using the usability approach (ISO, 2000). They found that the medium speller size (9.98 × 9.98 cm; 9.5°H × 9.5°W) was the most convenient since it offered high effectiveness, efficiency and satisfaction.

It is important to highlight that none of the quoted articles used motor-disabled participants. Therefore, it is necessary to verify these results with potential end-users. Ron-Angevin et al. (2019) studied three speller sizes with healthy subjects under overt attention conditions. Nevertheless, this condition might not be representative of motor-disabled patients as most of them may only preserve the residual head and eye movements at some stages of their disease (Patterson and Grabois, 1986; Emery et al., 2015). In this sense, an adequate speller size has to be established considering the limitations of patients’ gaze and head movement. While large sizes might be hard to handle and tiring due to the required muscular movements, a too-small speller could be less tiring but lead to inaccuracy in the perception of the speller’s elements.

Hence, the present study aims to assess the effect of three different speller sizes, in terms of the delimited visual angle, to determine the most appropriate speller size for severely motor-disabled participants. The sizes studied were proposed by Ron-Angevin et al. (2019). Moreover, a usability approach (ISO, 2000) was employed for the evaluation with three factors studied: effectiveness, efficiency, and satisfaction.

Materials and Methods

Participants

Seven Spanish participants diagnosed with ALS (P1-P7, all males, aged 64.43 ± 11.1) and one diagnosed with DMD (P8, male, aged 26) volunteered for the study. Two ALS volunteers (P9 and P10) could not take part in the experiment because the signal classifier was unable to generate usable weights for their brain waveform classification matrix, so they were unable to control the system. Every participant, or the corresponding legal representative, provided written informed consent.

According to self-reports, the participants had no history of neurological or psychiatric illness besides ALS or DMD and had normal or corrected to normal vision (Table 1). The patients were referred by the ALS Association of Andalusia, and none of them had prior experience with BCI systems. The test took place in their home but was coordinated by the research group UMA-BCI¹. The study was approved by the Ethics Committee of the University of Malaga and met the ethical standards of the Helsinki Declaration.

TABLE 1

Table 1. Participants’ information.

EEG Recording and Signal Processing

EEG data were registered using an acti-Champ amplifier (Brain Products GmbH, Munich, Germany) and recorded using the electrode positions: Fz, Cz, Pz, Oz, P3, P4, PO7, and PO8 according to the 10/20 international system. The electrodes were referenced in TP8 and grounded in AFz. A band-pass filter at 0.1–30 Hz was applied, and the Notch filter (50 Hz) was on. BCI2000 (Schalk et al., 2004) was used to control all aspects of EEG data collection and processing except for the analysis of the waveforms, which was carried out with MATLAB’s toolbox EEGLAB (Delorme and Makeig, 2004).

Spelling Paradigms

Three speller sizes were designed according to Ron-Angevin et al. (2019), where the three of them had a similar appearance as the classic P300 speller: characters in gray color (stimulus off) were presented over a black background; when the “flash” occurred (i.e., stimulus on), the characters turned to white color. A flash lasted 128 ms and the time between flashes (inter-stimuli interval, ISI) was 128 ms as well. After every set of flashes, there was a pause of 6 s except for patients P1, P2, and the first speller of P3 who used 2 s due to a mistake while applying the experimental protocol. This timing difference might not have been a problem as discussed below in the Discussion section. Each sequence of stimulation consisted of flashing one time every row and column (which implies that each character flashed two times per sequence). During the calibration and online phase, ten sequences were used. The spellers consisted of a 6 × 6 character matrix with the English alphabet and numbers from 0 to 9 (Figure 1).

FIGURE 1

Figure 1. Speller’s size parameters. MS stands for “speller size,” SS for “symbol size,” and SD for “symbol distance.”

According to Ron-Angevin et al. (2019), the used symbol sizes and distance between columns and rows were selected as follows (Table 2):

(1) The largest size was the one proposed by Treder and Blankertz (2010), which is usually used by other researchers like Brunner et al. (2010) and Brunner et al. (2011). This matrix size defined a visual angle of 13.96° both horizontally and vertically. The symbol size delimited a visual angle of 1.12°H × 1.12°W (H and W stand for height and width, respectively), and the separation between characters was 1.46° horizontally and vertically.

(2) In the opposite case, the smallest size was selected following what (Salvaris and Sepulveda, 2009) reported as the minimum symbol size that could be used without loss of performance. In this case, the delimited visual angle by each symbol was 0.4°H × 0.45°W. In the present study, the selected symbol size keeps the same metrics, defining a square visual angle of 0.4°H × 0.4°W. The separation between characters was calculated proportionally to the size and separation in the large size case: 0.5° horizontally and vertically.

(3) The selected medium size was the middle size between the large and small ones. The visual angle defined by the matrix was 9.5°H × 9.5°W, the one defined by the symbols was 0.75°H × 0.75°W, and the angle defined by the vertical and horizontal separation was 1° for each of them.

TABLE 2

Table 2. Values of the spellers’ size parameters.

Procedure

The experimental protocol consisted of three sessions of 60 ± 10 min. The order of the spellers’ usage was counterbalanced between participants. The time between sessions was in a range of 5 h and three days. Each session consisted of three phases: (i) a calibration phase; (ii) an online spelling phase; and, finally, (iii) subjective questionnaires fill out phase.

Calibration Task

Participants were asked to mentally count the times that the first desired letter flashed and, when the first set of flashes was over, to focus on the next letter. They had to repeat this procedure until the word was completed. The Spanish words to calibrate were “LUNA,” “RAMO,” “KILO,” and “2015.” Before each word calibration started, the participants were reminded of the word to spell. Only the last three calibrated words were used to obtain the speller classifier’s weights by applying a stepwise linear discriminant analysis (SWLDA) to offer the corresponding feedback during the online phase.

Online Task

Three Spanish words were spelled one after the other: “CHAT,” “PURE,” and “1935.” If the classifier selected a wrong letter, participants had no option to correct the mistake. Participants were reminded of what words to spell during the test. This time, each typed letter was represented in a text box placed above the matrix.

Subjective Questionnaires

The last part of each session consisted of answering three different questionnaires: two visual analog scales (VAS) questionnaires and the NASA-TLX test (Hart and Staveland, 1988). Finally, when the three sessions were concluded, a comparative questionnaire for the three sizes was filled out.

Usability Evaluation

The evaluation of the usability was carried out considering the approach proposed by ISO (2000), including three measures: effectiveness, efficiency, and satisfaction.

Effectiveness

Effectiveness was related to the degree of correctness with which the user completed the tasks. For this purpose, different results were obtained:

(i) Accuracy during the classification phase, which indicates the classifier accuracy after it analyzed and classified the EEG data of a participant in each sequence.

(ii) Error performance (EP) in the online phase, which was calculated by dividing the number of wrong selections by the total of selections and multiplied by 100; and percentage of participants that met the MEP30 criterion, which correlates to the 30% threshold that Kübler et al. (2001) indicated as the maximum EP allowed to establish an efficient communication system.

(iii) Analyses of the ERP target and no-target waveforms and the amplitude difference (AD) of the ERP stimuli waveforms (i.e., ERP target waveform—ERP non-target waveform).

Efficiency

The efficiency relates to the resources expended to complete a task. In this case, three results were considered:

(i) Subjective workload assessed using NASA-TLX, which evaluated the mental, physical, and temporal demand, as well as the performance, effort, and frustration perceived by the participant.

(ii) VAS fatigue (Kim et al., 2010), whose weight varied from 0 to 10 (where 0 is the minimum and 10 the maximum), was used to evaluate the level of fatigue experienced during the test.

(iii) The second VAS of questions regarding the speller’s perception was applied to evaluate the difficulty in perceiving the characters (Q1), the difficulty in perceiving the characters away from the center (Q2), and the difficulty in distinguishing the different rows and columns (Q3).

Satisfaction

Finally, satisfaction was related to the users’ attitude. The subjective feelings about the different speller sizes were analyzed using the comparative questionnaire based on the System Usability Scale (Brooke, 1996). This questionnaire compared complexity, stressfulness, controllability, tiredness, comfortableness, and user preference for the spellers. Specifically, participants had to assign the spellers the ranks “the least,” “the intermediate,” and “the most” preferred.

A satisfaction index was calculated as in Ron-Angevin et al. (2019) to provide a general perspective of this questionnaire. Firstly, the satisfaction’s related variables were categorized as positive (controllable, comfortable, and preferred) or negative (complex, stressful, and tiring). Finally, each rank was associated with a score: rank 1 (the least) as ±1, rank 2 (the intermediate) as ±2, and rank 3 (the most) as ±3. The sign of the score depended on the category of the variables.

Statistical Analyses

The present study employed factorial analyses—unifactorial, since only the speller size factor was studied—with three levels (one for each speller size). Specifically, an ANOVA or a Friedman’s test was applied depending on whether the sample met, respectively, the assumption of normality or not (accuracy, EP, all variables relative to the efficiency dimension and the satisfaction index). Likewise, for the ANOVA, the Greenhouse-Geisser correction was applied in case the sphericity assumption was not satisfied. Afterward, for the multiple comparison analysis, the Bonferroni’s correction method was used. On the other hand, for those variables that aimed to study whether the distribution in each of the variables depended on the speller size, a Fisher’s exact test was employed (concretely, MEP30 threshold, and the variables related to the System Usability Scale). The EEGLAB software (Delorme and Makeig, 2004) was used to carry out the ANOVA related to the study of the speller size factor on the ERP waveform.

Results

The collected results from the patients are presented according to the usability criteria.

Effectiveness

Classification Accuracy During the Calibration Phase

According to Friedman’s tests, no significant differences in accuracy between sizes were found in any sequence (Figure 2).

FIGURE 2

Figure 2. Accuracy (%) obtained by every participant and average (±SD) in each condition and sequence.

Error Performance During Online Spelling

The ANOVA relative to the EP for the three speller sizes did not show significant differences between conditions (Table 3). The percentage of participants that achieved the MEP30 threshold was 50% (four participants out of eight) for small size and 62.5% (five participants out of eight) for medium and large sizes, so no significant difference was noted according to the Fisher’s exact test.

TABLE 3

Table 3. Error performance (%) of every participant in each condition.

Event-Related Potentials During the Calibration Phase

Figure 3 shows the ERP waveform of target and non-target stimulus, and the difference between them (AD) for each condition and channel. Significant differences were not found in any time interval between conditions.

FIGURE 3

Figure 3. (A) ERP target grand average, (B) ERP non-target grand average, and (C) amplitude difference (AD; i.e., ERP target waveform − ERP non-target waveform). Every graph is presented over time (ms) and shows the grand average of each channel and speller.

Efficiency

The results obtained from the subjective questionnaires are presented.

VAS Fatigue and NASA-TLX

According to the ANOVA, the following parameters relative to the NASA-TLX test offered a significant main effect produced by the speller size factor (Table 4): physical demand (F_(2,14) = 4.029; p = 0.041), temporal demand (F_(2,14) = 4.927; p = 0.024) and effort (F_(2,14) = 5.107; p = 0.022). Nevertheless, due to the multiple comparisons’ correction applied, the post hoc analyses only showed significant differences for the effort factor between the medium and small sizes (p = 0.012).

TABLE 4

Table 4. VAS fatigue and NASA-TLX scores (mean ± standard deviation).

Perception of Subjective Questionnaires

Friedman’s test did not show significant differences in any statement, that is, the main effect produced by the speller size factor was not observed (Table 5).

TABLE 5

Table 5. Scores of each perception parameter.

Satisfaction

Figure 4 shows the percentage of patients that recorded satisfaction with each speller size for each factor. According to the Fisher’s exact test, statistical differences were detected between speller sizes and percentage of participants that selected each rank in all factors: complex (p = 0.025), comfortable (p = 0.011), stressful and controllable (p = 0.004), tiring (p = 0.001), and finally, preferred (p = 0.001).

FIGURE 4

Figure 4. Percentage of patients that chose a Rank regarding factors for each speller. Rank 1 stands for “the least,” Rank 2 “the intermediate,” and Rank 3 “the most.”

Figure 5 shows the representation of the satisfaction index, with the medium size speller having the best and the small size the worst. The factor speller size showed a significant main effect (χ²₍₂₎ = 9.25; p = 0.01). Specifically, the multiple comparison analyses showed significant differences between the medium and the small size (p = 0.018), and medium and large size (p = 0.037). However, there were no significant differences between small and large sizes.

FIGURE 5

Figure 5. Representation of the satisfaction index regarding each speller.

Discussion

In this section, the results obtained will be discussed and contextualized concerning previous articles. Specifically, to compare the results of our participants with those obtained by subjects without motor disabilities, the previous work of Ron-Angevin et al. (2019)—which is closely related to the present study—will be used.

Effectiveness

No significant differences were found for any speller in the accuracy achieved in the calibration task. As shown in Figure 2, there are no clear tendencies. Likewise, considering the results obtained in the online task (Table 3) and the MEP30 threshold, no significant differences were found within the spellers. Almost the same number of participants achieved the criterion with the three spellers: for large and medium, five out of eight (62.5%); and for small, four out of eight (50%). These results suggest that the 2-s pause between selections used in the experiments of participants P1, P2, and P3 might have not affected their results, as some participants with a 6-s pause performed even worse than the three of them. Furthermore, the patients’ scores in this study were similar to results of other studies when a speller based on the one of Farwell and Donchin (1988) and with a larger or similar sample size (e.g., Nijboer et al., 2008) and McCane et al. (2014) was used. First, in Nijboer et al. (2008), four out of six participants (i.e., 67%) reached the MEP30 threshold; and in McCane et al. (2014), 17 out of 25 participants (i.e., 68%) overcome the same threshold; in contrast to the present study in which 62.5% participants did reach it with the large and medium sizes.

Nevertheless, a tendency is remarkable in the EP average results obtained in the online phase (Table 6); the lowest and best values are obtained with the medium size as patients can overcome the MEP30 threshold only with this size. A similar tendency was found in the covert condition (Ron-Angevin et al., 2019; Table 6). Most probably, the present article could not statistically confirm this tendency due to the small sample size. The EP averages of the present study and the results of Ron-Angevin et al. (2019) for covert and overt attention (Table 6) indicate that the average results of the patients are substantially worse compared to those of the non-disabled individuals under overt attention, but are closer to healthy subjects under covert attention. These results might suggest that, overall, patients had greater difficulty with speller control, given their possible restricted ocular mobility. Nevertheless, when subtracting the results of those patients who reached the MEP30 criterion for at most one speller (i.e., P1, P5, and P7), the averages obtained are considerably lower (small: 15 ± 17.08%, medium: 8.33 ± 10.21%, large: 11.67 ± 11.18%), reaching the MEP30 criterion all spellers. The worsening of the EP might have been caused by, for example, these three participants’ difficulty to gaze control; however, it is not possible to verify as we do not have this information.

TABLE 6

Table 6. Error performance (%) averages from the online task.

On the other hand, it is worth noting that the performance appears to have no link to the ALSFRS-R score. As declared previously by McCane et al. (2014), this lack of relationship may be due to the ineffectiveness of the ALSFRS-R to measure the ocular deterioration of patients, which is one of the essential requirements to control a visual speller. Specifically, patient P9–who could not control the interface—obtained a score of 0 in the ALSFRS-R and had enormous difficulty in keeping his eyes open during the test. Otherwise, patient P3 had the same ALSFRS-R score, but he achieved a lower EP (0%, 0%, 16.7% for small, medium, and large, respectively) even compared to the average non-disabled participants of Ron-Angevin et al. (2019) for the overt condition (2.8 ± 1.6%, 4.9 ± 2.8%, 16.0 ± 4.5% for small, medium and large, respectively). Thus, some information about their ocular control should be specified.

Regarding the ERP waveforms (i.e., ERP target and non-target stimulus signals) analysis, there were no significant differences between the spellers in any time interval of any channel. Similar results were obtained by Ron-Angevin et al. (2019) with healthy subjects under the covert and overt attention paradigms, as this study did not show significant differences in amplitude nor latency regarding the speller size factor. Therefore, the expected results were obtained in the present study.

Figure 3, in the target and non-target ERP signals of the three spellers, shows a sine wave in every channel possibly due to the constant flashing of the interface, as it has a period close to the SOA (i.e., 256 ms). To remove this side effect, the AD between ERP stimulus signals was calculated. This last study did not show statistical differences. A possible P300 component is shown in Fz, Cz, and Pz between 200 and 600 ms with a maximum peak amplitude at around 300 ms. However, this component is affected by a negative peak at 400 ms that could be provoked by the sinusoidal wave. The P300 component observed in both conditions of Ron-Angevin et al. (2019) is shown in every channel and has a longer latency (between 200 and 500 ms with a maximum peak at around 400 ms) than in the present study. However, our results coincide with what declared other studies with patients (McCane et al., 2015). In the occipital zone (PO7, PO8, and Oz), a possible N200 component can be observed from 200 to 400 ms, which might have canceled the P300 component. On the other hand, this negative component was also found in Ron-Angevin et al. (2019) in the parietal-occipital zone, but only under the overt condition. Therefore, it could be inferred that the patients might possess adequate eye mobility, at least to the point of being able to fix their attention on the desired stimuli, as N200 is the earliest component that correlates with visual awareness (Railo et al., 2011).

Considering the ERP waveforms (Figure 3), the results from the calibration phase could be explained as both measures correlate, especially looking at the AD waveform. We think that the AD waveform—instead of the target or non-target ERP waveforms—might be the most interesting to analyze because it shows how different in amplitude are the ERP target and no-target signals, and thus the ease of distinguishing between both signals for the classifier. On average, no significant differences were observed between the three speller sizes in the AD signal nor in the classification accuracy. Specifically, participants yield the MEP30 threshold in the 6^th sequence and from that sequence, the performance of the three spellers is quite similar with only small differences.

Efficiency

Three dimensions had significant differences in the NASA-TLX results (i.e., physical demand, temporal demand, and effort). However, due to the applied multiple comparisons’ correction, only the medium size speller required less effort than the small size with a significant difference. Remarkably, the small size had the highest score in these three factors, and the medium size had the lowest in the temporal demand and effort dimensions. Interestingly, the results of the healthy subjects of Ron-Angevin et al. (2019) did not present statistical differences between spellers in the overt attention condition for any dimension, what might indicate that they were not highly affected by the speller size in terms of total workload while controlling a speller BCI in contrast to the motor disabled participants. Nevertheless, the average total workload declared by them (i.e., 40.4 ± 7.2, 38.22 ± 4.8, 41.2 ± 6.4, for the small, medium, and large sizes, respectively) is notably higher in contrast to patients (i.e., 40.92 ± 15.28, 29.63 ± 10.35, 33.92 ± 18.7, for the small, medium and large sizes, respectively). A possible explanation for these results could be that patients were more positive or optimistic during the test than the healthy participants due to their condition. Furthermore, these results suggest that: (i) the small speller size was the most complicated for the patients; and (ii) the medium size was the less demanding for patients and healthy subjects. In contrast, the average total workload of the three spellers from the present work was also smaller than described by Pasqualotto et al. (2015), whose motor-impaired participants had an average total workload of 47.64 ± 14.87. This difference may be explained by the lower ALSF-R in patients of Pasqualotto et al. (2015) than in our study (i.e., 15.5 ± 13.26 and 23 ± 16.61, respectively).

Satisfaction

The medium size was selected as the best option for every dimension and the small size is the one with the worst results for most of the dimensions (i.e., for six out of seven dimensions) according to satisfaction questionnaires. On the other hand, the non-preference for the small speller could be explained by the difficulty in perceiving the different stimuli in general (i.e., Q1, Q2, and Q3). The results of the healthy subjects under the overt condition of Ron-Angevin et al. (2019) did not show any trend regarding the most convenient speller size since the large and medium sizes obtained similar scores. However, they showed that the small size is the worst option in the four dimensions that presented significant differences. Thus, it could be affirmed that the small size is the least convenient for patients and healthy subjects.

In the satisfaction index (Figure 5), the medium speller is the only size that had the most positive scores (significantly better than the other two sizes). Similar results were also found by Ron-Angevin et al. (2019) as the medium size was the only speller that got positive scores in both conditions (i.e., cover and overt). Therefore, it seems clear that the most convenient speller size is the medium one.

Limitations

BCI-based studies that include results of severely motor-disabled patients usually share the limitation of having a small sample size due to the difficulty in finding patients that would like to volunteer. The present study was able to include a similar or larger sample size than one reported in the literature (Kaufmann et al., 2013; Severens et al., 2014; Speier et al., 2017; Zhang et al., 2017). Despite the limited sample size used in this article, some conclusions can be drawn from the results. On the one hand, a remarkable tendency was observed of the medium size as the one with the best EP results from the online phase. On the other hand, from the subjective measures, the medium size can be concluded as the most convenient size and the small size the least convenient in a significant manner. Most probably, if the sample size were larger, the trend observed in the objective measures could have been statistically affirmed and the control of different variables that may influence the system performance would have been included.

Conclusions

This work is the first study related to speller sizes for motor disabled people. It has shown that the size of the speller matters and should be considered for this population. Furthermore, it has been proved that the most commonly used speller size (i.e., the large one) might not be the most suitable for patients.

Summarizing, in the present study the medium size is the most and the small size the least usable in terms of satisfaction dimension. Furthermore, a tendency is remarkable in the EP averages (from the effectiveness dimension), which highlights the medium size as the only speller that enables efficient communication according to the MEP30 criterion. Finally, while the medium speller was selected as the least temporal demanding and the one that required less effort to control, the small size was selected as the most physically demanding and the one that required more effort according to the NASA-TLX scores (from the efficiency dimension).

The results from the objective measures show a large variability which suggests that optimization for each individual might be worthwhile. For example, P1 and P6 performed better with the medium size, P4 with the large size, P5 with the small, while P8 achieved 0% EP with the three spellers. On the other hand, considering the EP average results of the online phase and the subjective measures, it can be concluded that, among the three sizes studied, the medium size is the most convenient. Similarly, the small size can be concluded as the least convenient. Nevertheless, the optimal size should be further studied in future works knowing that it might be placed between the large and medium sizes for most patients. It should be noted that even if the optimal speller size is found, most probably in some cases the speller size will have to be adapted to the necessities of the patient.

Most probably, if the present study had a larger sample size, the medium speller could have been statistically affirmed in every usability dimension as the most suitable size. Nevertheless, this tendency has been already validated by Ron-Angevin et al. (2019) with healthy subjects.

Finally, it will be interesting to investigate other applications in the future, e.g., web-browsing or games, with the medium speller size because this size would leave more space, in contrast to the most frequently used large size, within the monitor screen for these types of applications.

Data Availability Statement

The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.

Ethics Statement

The studies involving human participants were reviewed and approved by Comité Ético de Experimentación de la Universidad de Málaga (CEUMA). CEUMA registry number: 51-2019-H. The patients/participants provided their written informed consent to participate in this study. Written informed consent was obtained from the individual(s) for the publication of any potentially identifiable images or data included in this article.

Author Contributions

MM-J, ÁF-R, FV-Á, and RR-A contributed to the conception and design of the study. MM-J and FV-Á contacted the participants. MM-J, ÁF-R, and FV-Á performed the experiments. ÁF-R performed the statistical analysis. MM-J and ÁF-R wrote the first draft of the manuscript. RR-A was in charge of the funding acquisition, project administration, and supervision. All authors contributed to the article and approved the submitted version.

Funding

This work was partially supported by the Spanish Ministry of Economy and Competitiveness through the project SICCAU: RTI2018-100912-B-I00 (MCIU/AEI/FEDER, UE) and by the University of Malaga (Universidad de Málaga).

Conflict of Interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Acknowledgments

We would like to thank all participants for their cooperation and kindness. Special thanks are due to the ALS Association of Andalusia.

Footnotes

^ http://umabci.uma.es

References

Acqualagna, L., Sebastian Treder, M., and Blankertz, B. (2013). “Chroma speller: isotropic visual stimuli for truly gaze-independent spelling,” in International IEEE/EMBS Conference on Neural Engineering, NER, 1041–1044.

Google Scholar

Bauer, G., Gerstenbrand, F., and Rumpl, E. (1979). Varieties of the locked-in syndrome. J. Neurol. 221, 77–91. doi: 10.1007/BF00313105

PubMed Abstract | CrossRef Full Text | Google Scholar

Birbaumer, N. (2006). Breaking the silence: brain-computer interfaces (BCI) for communication and motor control. Psychophysiology 43, 517–532. doi: 10.1111/j.1469-8986.2006.00456.x

PubMed Abstract | CrossRef Full Text | Google Scholar

Brooke, J. (1996). SUS—a quick and dirty usability scale. Usabil. Evaluat. Indust. 189, 4–7.

Brunner, P., Joshi, S., Briskin, S., Wolpaw, J. R., Bischof, H., and Schalk, G. (2010). Does the ‘P300’ speller depend on eye gaze? J. Neural Eng. 7:056013. doi: 10.1088/1741-2560/7/5/056013

PubMed Abstract | CrossRef Full Text | Google Scholar

Brunner, P., Ritaccio, A. L., Emrich, J. F., Bischof, H., and Schalk, G. (2011). Rapid communication with a ‘P300’ matrix speller using electrocorticographic signals (Ecog). Front. Neurosci. 5:5. doi: 10.3389/fnins.2011.00005

PubMed Abstract | CrossRef Full Text | Google Scholar

Cedarbaum, J. M., Stambler, N., Malta, E., Fuller, C., Hilt, D., Thurmond, B., et al. (1999). The ALSFRS-R: a revised ALS functional rating scale that incorporates assessments of respiratory function. J. Neurol. Sci. 169, 13–21. doi: 10.1016/s0022-510x(99)00210-5

PubMed Abstract | CrossRef Full Text | Google Scholar

Delorme, A., and Makeig, S. (2004). EEGLAB: an open source toolbox for analysis of single-trial EEG dynamics including independent component analysis. J. Neurosci. Methods 134, 9–21. doi: 10.1016/j.jneumeth.2003.10.009

PubMed Abstract | CrossRef Full Text | Google Scholar

Emery, A. E. H., Muntoni, F., and Quinlivan, R. C. M. (2015). Duchenne Muscular Dystrophy, 4 Edn. Oxford, UK: Oxford University Press.

Google Scholar

Farwell, L. A., and Donchin, E. (1988). Talking off the top of your head: toward a mental prosthesis utilizing event-related brain potentials. Electroencephalogr. Clin. Neurophysiol. 70, 510–523. doi: 10.1016/0013-4694(88)90149-6

PubMed Abstract | CrossRef Full Text | Google Scholar

Hart, S. G., and Staveland, L. E. (1988). Development of NASA-TLX (task load index): results of empirical and theoretical research. Adv. Psychol. 52, 139–183. doi: 10.1016/S0166-4115(08)62386-9

CrossRef Full Text | Google Scholar

Ikegami, S., Takano, K., Wada, M., Saeki, N., and Kansaku, K. (2012). Effect of the green/blue flicker matrix for P300-based brain-computer interface: an EEG-FMRI study. Front. Neurol. 3:113. doi: 10.3389/fneur.2012.00113

PubMed Abstract | CrossRef Full Text | Google Scholar

ISO. (2000). ISO/DIS 9241–9, ergonomic requirements for office work with visual display terminals. Tech. Rep. 11:22.

PubMed Abstract | Google Scholar

Kaufmann, T., Schulz, S. M., Köblitz, A., Renner, G., Wessig, C., and Kübler, A. (2013). Face stimuli effectively prevent brain-computer interface inefficiency in patients with neurodegenerative disease. Clin. Neurophysiol. 124, 893–900. doi: 10.1016/j.clinph.2012.11.006

PubMed Abstract | CrossRef Full Text | Google Scholar

Kim, E., Lovera, J., Schaben, L., Melara, J., Bourdette, D., and Whitham, R. (2010). Novel method for measurement of fatigue in multiple sclerosis: real-time digital fatigue score. J. Rehabil. Res. Dev. 47, 477–484. doi: 10.1682/jrrd.2009.09.0151

PubMed Abstract | CrossRef Full Text | Google Scholar

Kübler, A., Neumann, N., Kaiser, J., Kotchoubey, B., Hinterberger, T., and Birbaumer, N. P. (2001). Brain-computer communication: self-regulation of slow cortical potentials for verbal communication. Arch. Phys. Med. Rehabil. 82, 1533–1539. doi: 10.1053/apmr.2001.26621

PubMed Abstract | CrossRef Full Text | Google Scholar

Li, Q., Liu, S., Li, J., and Bai, O. (2015). Use of a green familiar faces paradigm improves P300-speller brain-computer interface performance. PLoS One 10:e0130325. doi: 10.1371/journal.pone.0130325

PubMed Abstract | CrossRef Full Text | Google Scholar

Li, Y., Nam, C. S., Shadden, B. B., and Johnson, S. L. (2011). A P300-based brain-computer interface: effects of interface type and screen size. Int. J. Hum. Comput. Inter. 27, 52–68. doi: 10.1080/10447318.2011.535753

CrossRef Full Text | Google Scholar

McCane, L. M., E Sellers, W., Mcfarland, D. J., Mak, J. N., Steve Carmack, C., Zeitlin, D., et al. (2014). Brain-computer interface (BCI) evaluation in people with amyotrophic lateral sclerosis. Amyotroph. Lateral Scler. Frontotemporal Degener. 15, 207–215. doi: 10.3109/21678421.2013.865750

PubMed Abstract | CrossRef Full Text | Google Scholar

McCane, L. M., Heckman, S. M., McFarland, D. J., Townsend, G., Mak, J. N., Sellers, E. W., et al. (2015). P300-based brain-computer interface (BCI) event-related potentials (ERPs): people with amyotrophic lateral sclerosis (ALS) vs. age-matched controls. Clin. Neurophysiol. 126, 2124–2131. doi: 10.1016/j.clinph.2015.01.013

PubMed Abstract | CrossRef Full Text | Google Scholar

Murguialday, A. R., Hill, J., Bensch, M., Martens, S., Halder, S., Nijboer, F., et al. (2011). Transition from the locked in to the completely locked-in state: a physiological analysis. Clin. Neurophysiol. 122, 925–933. doi: 10.1016/j.clinph.2010.08.019

PubMed Abstract | CrossRef Full Text | Google Scholar

Nicolas-Alonso, L. F., and Gomez-Gil, J. (2012). Brain computer interfaces, a review. Sensors 12, 1211–1279. doi: 10.3390/s120201211

PubMed Abstract | CrossRef Full Text | Google Scholar

Nijboer, F., Sellers, E. W., Mellinger, J., Jordan, M. A., Matuz, T., Furdea, A., et al. (2008). A P300-based brain-computer interface for people with amyotrophic lateral sclerosis. Clin. Neurophysiol. 119, 1909–1916. doi: 10.1016/j.clinph.2008.03.034

PubMed Abstract | CrossRef Full Text | Google Scholar

Pal, S., Kumar Mangal, N., and Khosla, A. (2017). “Development of assistive application for patients with communication disability,” in IEEE International Conference on Innovations in Green Energy and Healthcare Technologies—2017, IGEHT 2017, 1–4.

Google Scholar

Pasqualotto, E., Matuz, T., Federici, S., Ruf, C. A., Bartl, M., Belardinelli, M. O., et al. (2015). Usability and workload of access technology for people with severe motor impairment: a comparison of brain-computer interfacing and eye tracking. Neurorehabil. Neural Repair 29, 950–957. doi: 10.1177/1545968315575611

PubMed Abstract | CrossRef Full Text | Google Scholar

Patterson, J. R., and Grabois, M. (1986). Locked-in syndrome: a review of 139 cases. Stroke 17, 758–764. doi: 10.1161/01.str.17.4.758

PubMed Abstract | CrossRef Full Text | Google Scholar

Railo, H., Koivisto, M., and Revonsuo, A. (2011). Tracking the processes behind conscious perception: a review of event-related potential correlates of visual consciousness. Conscious. Cogn. 20, 972–983. doi: 10.1016/j.concog.2011.03.019

PubMed Abstract | CrossRef Full Text | Google Scholar

Rezeika, A., Benda, M., Stawicki, P., Gembler, F., Saboor, A., and Volosyak, I. (2018). Brain-computer interface spellers: a review. Brain Sci. 8:57. doi: 10.3390/brainsci8040057

PubMed Abstract | CrossRef Full Text | Google Scholar

Ron-Angevin, R., Garcia, L., Fernández-Rodríguez, A., Saracco, J., André, J. M., Lespinet-Najib, V., et al. (2019). Impact of speller size on a visual P300 brain-computer interface (BCI) system under two conditions of constraint for eye movement. Comput. Intell. Neurosci. 2019:7876248. doi: 10.1155/2019/7876248

PubMed Abstract | CrossRef Full Text | Google Scholar

Salvaris, M., and Sepulveda, F. (2009). Visual modifications on the P300 speller BCI paradigm. J. Neural Eng. 6:046011. doi: 10.1088/1741-2560/6/4/046011

PubMed Abstract | CrossRef Full Text | Google Scholar

Schalk, G., McFarland, D. J., Hinterberger, T., Birbaumer, N., and Wolpaw, J. R. (2004). BCI2000: a general-purpose brain-computer interface (BCI) system. IEEE Trans. Biomed. Eng. 51, 1034–1043. doi: 10.1109/TBME.2004.827072

PubMed Abstract | CrossRef Full Text | Google Scholar

Sellers, E. W., Krusienski, D. J., McFarland, D. J., Vaughan, T. M., and Wolpaw, J. R. (2006). A P300 event-related potential brain-computer interface (BCI): the effects of matrix size and inter stimulus interval on performance. Biol. Psychol. 73, 242–252. doi: 10.1016/j.biopsycho.2006.04.007

PubMed Abstract | CrossRef Full Text | Google Scholar

Severens, M., Van der Waal, M., Farquhar, J., and Desain, P. (2014). Comparing tactile and visual gaze-independent brain-computer interfaces in patients with amyotrophic lateral sclerosis and healthy users. Clin. Neurophysiol. 125, 2297–2304. doi: 10.1016/j.clinph.2014.03.005

PubMed Abstract | CrossRef Full Text | Google Scholar

Speier, W., Chandravadia, N., Roberts, D., Pendekanti, S., and Pouratian, N. (2017). Online BCI typing using language model classifiers by ALS patients in their homes. Brain Comput. Interfaces 4, 114–121. doi: 10.1080/2326263X.2016.1252143

PubMed Abstract | CrossRef Full Text | Google Scholar

Treder, M. S., and Blankertz, B. (2010). (C)Overt attention and visual speller design in an ERP-based brain-computer interface. Behav. Brain Funct. 6:28. doi: 10.1186/1744-9081-6-28

PubMed Abstract | CrossRef Full Text | Google Scholar

Zhang, R., Wang, Q., Li, K., He, S., Qin, S., Feng, Z., et al. (2017). A BCI-based environmental control system for patients with severe spinal cord injuries. IEEE Trans. Biomed. Eng. 64, 1959–1971. doi: 10.1109/TBME.2016.2628861

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: amyotrophic lateral sclerosis (ALS), patient, brain-computer interface (BCI), electroencephalography (EEG), P300, speller, size, usability

Citation: Medina-Juliá MT, Fernández-Rodríguez Á, Velasco-Álvarez F and Ron-Angevin R (2020) P300-Based Brain-Computer Interface Speller: Usability Evaluation of Three Speller Sizes by Severely Motor-Disabled Patients. Front. Hum. Neurosci. 14:583358. doi: 10.3389/fnhum.2020.583358

Received: 14 July 2020; Accepted: 17 September 2020;
Published: 29 October 2020.

Edited by:

Bin He, Carnegie Mellon University, United States

Reviewed by:

Yijun Wang, Institute of Semiconductors (CAS), China
Qi Li, Changchun University of Science and Technology, China

Copyright © 2020 Medina-Juliá, Fernández-Rodríguez, Velasco-Álvarez and Ron-Angevin. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Francisco Velasco-Álvarez, fvelasco@dte.uma.es

ORIGINAL RESEARCH article

P300-Based Brain-Computer Interface Speller: Usability Evaluation of Three Speller Sizes by Severely Motor-Disabled Patients

Introduction

Materials and Methods

Participants

EEG Recording and Signal Processing

Spelling Paradigms

Procedure

Calibration Task

Online Task

Subjective Questionnaires

Usability Evaluation

Effectiveness

Efficiency

Satisfaction

Statistical Analyses

Results

Effectiveness

Classification Accuracy During the Calibration Phase

Error Performance During Online Spelling

Event-Related Potentials During the Calibration Phase

Efficiency

VAS Fatigue and NASA-TLX

Perception of Subjective Questionnaires

Satisfaction

Discussion

Effectiveness

Efficiency

Satisfaction

Limitations

Conclusions

Data Availability Statement

Ethics Statement

Author Contributions

Funding

Conflict of Interest

Acknowledgments

Footnotes

References

This article is part of the Research Topic

People also looked at