Title: An eye-tracking study on text accessibility and comprehension in university students

SergioNavas-León1✉

JonAndoniDuñabeitia1Emailjduñabeitia@nebrija.es

SergioNavas1Emailsnavas@nebrija.es

JonAndoni1

1Centro de Investigación Nebrija en Cognición (CINC), Department of EducationUniversidad Nebrija28043MadridSpain

Authors: Sergio Navas-León^1*, Jon Andoni Duñabeitia¹

Affiliations:

¹ Centro de Investigación Nebrija en Cognición (CINC), Department of Education, Universidad Nebrija, 28043 Madrid, Spain (snavas@nebrija.es; jduñabeitia@nebrija.es).

*Correspondence: Sergio Navas-León. Centro de Investigación Nebrija en Cognición (CINC), Department of Education, Universidad Nebrija, 28043 Madrid, Spain Email: snavas@nebrija.es

ORCIDs:

Sergio Navas León: https://orcid.org/0000-0002-0889-8252

Jon Andoni Duñabeitia: https://orcid.org/0000-0002-3312-8559

Statements and Declarations

Author Contribution

SNL: Data curation, Formal analysis, Investigation, Writing - Original draft, Writing - Review & Editing, Methodology; JA: Conceptualization, Investigation, Funding acquisition, Supervision, Writing - Review & Editing, Resources, Methodology, Project administration, Software.

Data Availability

The datasets used and analysed during the current study are available from the corresponding author on reasonable request.

Acknowledgement

Not applicable.

Conflict of interests

The authors have no conflict to declare.

Ethical approval

This study was approved by the Ethics Committee of the Universidad Nebrija (Code: UNNE-2022-0017). Data collection in this study followed the ethical standards laid down in the Declaration of Helsinki of 1964 and its subsequent amendments.

Funding

This publication is supported by the research project “Accesibilidad lingüística en la web: investigación psicolingüística para la implementación de la Lectura Fácil” funded by Fundación Ramón Areces, where Jon Andoni Duñabeitia is the Principal Investigator.

Informed Consent

All participants provided informed consent prior to participation.

Abstract

Accessible reading is a crucial aspect of both social equity and digital education, highlighted by legislation such as the European Accessibility Act and the Americans with Disabilities Act. In this regard, Easy-to-Read (E2R) recommendations aim to enhance text comprehension, particularly for individuals with cognitive or linguistic needs, but evidence is still limited regarding whether these adaptations also benefit the general population. This study investigated the effect of different text formats on accessibility and reading comprehension among Spanish university students, using eye-tracking technology. Twenty-four young participants without cognitive disabilities read texts under three conditions: Hard-to-Read (serif font, justified text, high contrast); Control (sans-serif font, left-aligned, standard spacing); and Easy-to-Read (greater spacing, optimised contrast, icons for information organisation). Comprehension was assessed through multiple-choice questions, and eye-tracking data were collected. The results revealed no significant differences in comprehension between conditions, although E2R scores tended to be higher. Eye-tracking showed that Hard-to-Read texts required longer reading times and more fixations, indicating greater cognitive effort, while the E2R format led to shorter fixations and larger saccades, suggesting more efficient visual processing. Overall, E2R modifications, though originally developed for individuals with cognitive needs, may also enhance accessibility for the young population.

Keywords:

Accessibility

Adult

Cognition

Education

Reading

University

1. Introduction

Around 67% of the global population—approximately 5.4 billion people—regularly use internet-based technologies, most in highly developed countries. With this rapid expansion, reading has become central to online activity and essential for accessing information (Leu et al., 2015). Concerns remain about the accessibility of digital content, not only for cognitively vulnerable groups (e.g., individuals with disabilities, older adults) but also for the general population (Droutsas et al., 2025; Van Der Geest & Velleman, 2014; Henni et al., 2022; Mason et al., 2021). Since online reading often involves critical domains such as health and e-government, accessibility is increasingly viewed as an equity and rights issue, as reflected in the European Accessibility Act (European Parliament & Council of the European Union, 2016) and Article 21 of the UN Convention on the Rights of Persons with Disabilities (United Nations, 2006). 15% of Europeans present dyslexia or learning difficulties, and 45.3% of Spaniards over 85 show cognitive impairment (Observatorio Nebrija del Español, 2024). In this context, Easy-to-Read (hereafter, E2R) has emerged as a key strategy for ensuring accessible communication (Fărcașiu et al., 2023).

E2R recommendations include using plain language (e.g., high-frequency words), simple syntactic structures (e.g., avoiding subordinate clauses), visual aids (e.g., pictograms, icons), and accessible design features (e.g., sans-serif fonts, left-aligned text) (González-Sordé & Matamala, 2024). Although research on E2R is limited, it is a growing topic with increasing institutional support (Fărcașiu et al., 2023; Inclusion Europe, 2019). Complementary efforts have focused on digital adaptations of accessible content.

In this regard, the Web Content Accessibility Guidelines (WCAG) produced by the World Wide Web Consortium (W3C) are the primary worldwide technical standard. These principles, adopted into the European standard EN 301549 and the Spanish UNE 153101:2018 EX (Asociación Española de Normalización, 2022), define criteria for ensuring that digital material is perceivable, operable, understandable, and robust (W3C, 2022).

These recommendations are based on empirical data from human–computer interaction and psycholinguistics. However, it is especially important to investigate how specific typographic elements influence reading comprehension. While most psycholinguistic research has concentrated on linguistic characteristics (e.g., word frequency, word length), the importance of visual and spatial features in word recognition and reading fluency has gotten less attention (Moret-Tatay & Perea, 2011). The function of typography in text accessibility is a paradigmatic example. In this respect, research has consistently shown that sans-serif fonts are preferable to serif fonts. According to empirical research, serifs do not help visual word identification; rather, their presence may impede lexical access (Moret-Tatay & Perea, 2011). This situation is especially true for people with visual or cognitive disabilities, as sans-serif fonts are viewed as cleaner and more legible on screens due to their reduced letterforms, which minimise visual clutter and improve character differentiation (British Dyslexia Association, 2012). For example, Bernard et al. (2001) investigated the impact of font type on the reading duration of online text among older adults. The study revealed that older adults read serif typefaces significantly slower than sans serif ones. Similarly, Russell-Minda et al. (2007) discovered in their review that the strongest evidence in the study literature suggests that sans-serif fonts (i.e., Arial, Helvetica, Verdana, or Adsans) are generally more legible than serif fonts such as Times New Roman, particularly for readers with impaired vision.

In addition to font type, other typographic features such as font size and line spacing are also crucial for text accessibility. Research on text-heavy webpages (i.e., Wikipedia) has shown that larger font sizes (18 pt or more) combined with default line spacing (1.5) can significantly improve both readability and comprehension (Rello et al., 2013). Complementing these findings, Rello and Marcos (2012) employed an eye-tracking study to compare five different line spacing values (0.8, 1.0, 1.2, 1.4, and 1.8). Their results indicated a clear preference for 1.4-line spacing, which was associated with shorter fixation times, suggesting more efficient reading processes.

Text alignment also plays a role in readability. Ling and van Schaik (2007) conducted a study to compare the effects of left-aligned and justified text on visual search performance and user preferences while reading web pages. The findings revealed that left-aligned text produced much higher accuracy and faster reaction times than justified text. The findings imply that when developing web material for efficient reading or information retrieval, using left-aligned text provides measurable benefits to user performance.

Another example of how minimal modifications can affect reading fluency is letter spacing. Perea and Gomez (2012) showed that modest increases in interletter spacing (+ 1.0 and + 1.5) reduced fixation durations during sentence reading, suggesting that wider spacing facilitates early word processing and fluency. Similarly, Perea et al. (2012) found that slightly increased spacing (+ 1.2) enhanced word recognition in adults, young readers, and especially children with dyslexia. However, in natural text reading with skilled adults, Perea et al. (2016) observed shorter fixations but more of them, yielding no overall differences in reading time or comprehension. Later work indicated that reading speed generally improved with greater spacing, particularly for slower readers, with benefits plateauing at around 30–40% extra spacing. Dick (2017) identified an optimal range between 0.12 and 0.15 times the font size, which informed the WCAG 2.1 guideline of 0.16 for word spacing and twice the font size for paragraph spacing (W3C, 2022).

Contrast is also important for readability. WCAG 2.1 requires a minimum contrast ratio of 3:1 to guarantee that text is legible for users with moderately low vision in ordinary lighting situations. This threshold is established by both the ISO 9241-3 standard (Ergonomic requirements for office work with visual display terminals - Part 3: Visual display requirements) and the ANSI/HFS 100–1988 standard.

More specifically, research in Human-Computer Interaction (HCI) has long shown that black text on a white backdrop provides the best legibility for most digital reading tasks (Mithun et al., 2019). However, using pure black text against a pure white backdrop can create eye discomfort for some users, particularly those with dyslexia (Uccula et al., 2014). This condition may potentially affect the broader population. For example, Conlon and Sanders (2011) studied the reading performance of university students with and without visual discomfort and discovered that those who were uncomfortable had lower reading comprehension when reading black-on-white text. The authors attributed their findings to the strong visual contrast, which can cause perceptual distortions and physical symptoms such as eye strain and headaches. In recognition of these challenges, the European Commission, for example, implemented a more accessible colour scheme on its websites, with EC Grey (#404040) as the default font colour on a white background to reduce visual strain while maintaining adequate contrast and readability (European Commission, 2025).

Rivero-Contreras et al. (2021) investigated the impact of E2R recommendations, particularly visual help and lexical simplification, on sentence processing in people with and without dyslexia. They used eye-tracking to investigate text- and word-level reading behaviour in 20 young adults with dyslexia and 20 age-matched controls. Participants read 60 sentences with varying word frequencies and associated pictures. The results indicated that both visual help and lexical simplification aided sentence processing, particularly lexical-semantic access. In a follow-up study, Rivero-Contreras et al. (2023) assessed the effects of visual support and lexical simplification on sentence processing in adult readers (aged 16–58) from various educational backgrounds. Thirty university-educated people and thirty adults with lower education levels read 60 phrases with various word frequencies and associated visuals. Eye-tracking data demonstrated that both visual help and lexical simplification aided sentence processing in both groups, with non-university individuals reading substantially slower overall. The findings illustrate how E2R adjustments might aid understanding and offer practical applications for improving readability in varied adult populations.

However, some research on healthy people reveals that improving fluency through simpler text presentations may not always result in better learning. For example, some research among university students has demonstrated that presenting texts in difficult-to-read fonts, such as Haettenschweiler, or low-quality photocopies resulted in better learning outcomes than easier-to-read fonts, such as Arial (Diemand-Yauman, Oppenheimer, and Vaughan, 2011). Some studies have found null effects for font size (i.e., words printed in Arial 48 pt. serving as the fluent condition vs. words printed in Arial 18 pt. serving as the disfluent condition; resulting in similar text recall) or font type (i.e., similar lexical processing for words with diacritics vs. similar lexical processing for " ãmîgô" [friend] compared with "amigo") (Duñabeitia et al., 2023).

Similarly, research in groups with disabled people has highlighted serious questions about the practical efficacy of E2R materials. Hurtado et al. (2014) discovered that E2R formats did not consistently improve comprehension among people with intellectual disabilities, with some participants—particularly those with milder impairments—confused due to the integration of visuals and text. Sutherland and Isherwood (2016) conducted a systematic review that emphasised the variety in efficacy among people with intellectual impairments, highlighting the importance of individualised approaches rather than depending on uniform formats.

1.1 The present study

Given the above, some problems exist in putting accessibility requirements into practice. Research on E2R for individuals with disabilities is scarce or inconsistent (Chinn & Homeyard, 2017; Fajardo et al., 2014; González-Sordé & Matamala, 2024; Rivero-Contreras & Saldaña, 2020; Sutherland & Isherwood, 2016). Implementation is challenging due to limitations such as lack of definition and standardisation (Miniukovich et al., 2017; Rivero-Contreras & Saldaña, 2020). Together, these findings described above expose a key gap in the literature: texts labelled as E2R are not always that E2R.

While E2R guidelines aim to enhance clarity and accessibility, their practical application may not consistently lead to improved text comprehension. Furthermore, empirical evidence on E2R for nondisabled populations remains scarce (Ekin et al., 2025; Schmutz, Sonderegger, & Sauer, 2016; 2017; 2019).

Thus, the aim of the present study is to investigate the effectiveness of the E2R format in young, non-disabled Spanish university students, thereby testing whether recommendations originally designed for people with disabilities also confer benefits to the general population. In methodological terms, we designed an experiment comparing three types of text presentation—Hard-to-Read, Control, and E2R—each combining multiple formatting features (i.e., font type, size, contrast, interletter spacing, alignment) as they typically co-occur in real-world materials. We measured reading comprehension using a custom-designed Likert-type scale, which evaluated participants' ability to recall and understand the text presented after viewing all three experimental conditions. We measured reading time for each condition. Additionally, we used eye-tracking during the reading task. Eye-tracking is a powerful method for investigating the cognitive mechanisms involved in text processing (Scaltritti et al., 2019). It enabled us to collect detailed data on participants' eye movements, including fixations (i.e., periods when the eyes remain relatively still, typically indicating focused processing) and saccades (i.e., rapid movements between fixations). The number and duration of both fixations and saccades are commonly associated with cognitive load and text difficulty, which include shorter saccade amplitudes, more frequent fixations, and longer fixation durations (Id et al., 2018; Palinko et al., 2010; Wang et al., 2024; Zagerman et al., 2016).

Based on this design, we formulated two confirmatory hypotheses: (H1) participants would obtain higher comprehension scores and shorter reading times in the E2R condition compared to the Hard-to-Read and Control conditions, or at least outcomes equivalent to the Control; and (H2) participants would exhibit fewer and shorter fixations, together with fewer but larger saccades under the E2R condition, reflecting more efficient processing.

2. Materials and methods

The study was conducted in accordance with the Declaration of Helsinki and was approved by the ethics committee of Universidad Nebrija (approval code: UNNE-2020-008).

Written informed consent was obtained from all participants at the start of the experiment, and a small financial compensation (€12) was provided a few days after participation.

2.1 Participants

The study's participants were students from Universidad Nebrija in Madrid, Spain, who volunteered and were unaware of its purpose. Twenty-four students, aged between 18 and 31 years (M = 23.74, SD = 4.15), participated in the experiment, including 19 women.

Recruitment took place between April and July 2025 through the laboratory’s internal platform, drawing participants from a pool of individuals registered to take part in experiments. Inclusion criteria were: (a) normal or corrected-to-normal visual acuity, (b) being a young adult aged between 18 and 35 years, and (c) scoring within the normal z-score range on the Cognitive Assessment Battery (CAB™) PRO (CogniFit Inc., San Francisco, CA, USA; https://www.cognifit.com/cab), to rule out cognitive impairment or disability.

Exclusion criteria included: (a) self-reported history of visual and ocular pathology, neurological disorders, or significant brain injury; (b) reported history of mental illness; (c) current use of recreational drugs, whether natural or synthetic; (d) incomplete data collection or unsuccessful eye-tracking system calibration; and (e) lack of comprehension of Spanish.

For comprehension scores, a post hoc sensitivity analysis was conducted using G*Power (version 3.1.9.7; Faul et al., 2007) to estimate the minimum detectable effect size given the final sample size. The analysis was based on a repeated-measures ANOVA with three conditions (within-subjects), assuming an alpha level of .05, statistical power of .80, correlation among repeated measures of 0.5, and nonsphericity correction ε = 1. With a total sample size of 24 participants, the analysis indicated that the design had sufficient power (1 − β = .80) to detect an effect size of f = 0.267 (equivalent to η²p ≈ .067, a medium effect, Cohen, 1992).

For eye movement measures, power simulations were conducted using the simr package in R (Green & MacLeod, 2016) to estimate theoretical statistical power based on the structure of the fitted linear mixed-effects models (LMM). For each dependent variable, models retained the original fixed effect and included random intercepts per participant. The effect of interest was set to β = 0.20, and power was estimated across increasing sample sizes using the powerCurve () function with 50 Monte Carlo simulations per level. Results indicated that with 24 participants and 45 repeated observations per participant (1,080 data points), the models reached between 80% and 100% power across all outcome variables.

Therefore, within the current design, the final sample was deemed adequate to detect small-to-moderate-sized effects in both behavioural and eye-tracking outcomes.

2.2 Stimuli

Three fictional countries were created: Tórmez, Norvia, and Veldina. Each country was associated with 15 stimuli covering various topics, including population and demographic trends, history, economy, politics, energy, education system, tourism, language, religions, cuisine, natural landscapes, healthcare system, infrastructure, climate, and geography. These stimuli represented hypothetical scenarios designed to simulate realistic conditions in each country, resulting in a total of 45 stimuli.

To ensure lexical consistency, word frequency was verified using EsPal, a Spanish lexical frequency database (Duchon et al., 2013). This ensured that the vocabulary employed across the three countries was comparable. The analysis focused on word frequency per million words, prioritising commonly used terms in both formal and everyday Spanish. Functional words (prepositions, articles, conjunctions, and other grammatical markers) were excluded.

An iterative process involving the authors was carried out to progressively reduce the complexity of the content. This process included the draughting of multiple versions, with each iteration focusing on sentence structure, readability, and textual clarity. During these revisions, complex sentences were simplified, jargon was reduced, and content was adapted for accessibility to a broad audience. Simplification strategies included shortening lengthy sentences, minimising the use of adverbs and superlatives, and replacing low-frequency words with simpler synonyms.

The materials were designed using Microsoft PowerPoint (Microsoft Corporation, 2024), and each slide was exported as a JPG image file for presentation during the experimental session. Figure 1 provides an illustrative example of each of the experimental conditions.

In the Hard-to-Read-to-read condition, a serif font (Lucida Calligraphy) was used with a font size of 18, fully justified text without paragraph breaks, 1.5 line spacing, default word spacing, and a high contrast ratio (Background: #EA953B; Font colour: #01020b). In the Control condition, a sans-serif font (Lucida Sans) was used with a font size of 18, fully justified text without paragraph breaks, 1.5 line spacing, default word spacing, and an adequate contrast ratio (Background: #000000; Font colour: #255,255,255). In the E2R condition, a sans-serif font (Lucida Sans) was used with a font size of 18, paragraph spacing twice the font size, left-aligned text, 1.5 line spacing, word spacing of at least 0.16 times the font size, and an optimised contrast ratio (Background: #000000; Font colour: #404040). Additionally, this condition included using color pictograms to present information.

Figure 1

This is an example of the three experimental conditions: panel A corresponds to the Hard-to-Read condition, panel B to the Control condition, and panel C to the E2R condition

A)
B)
C)

2.3 Measures

Sociodemographic data. Information was collected on participants’ sex assigned at birth and chronological age.

Cognitive assessment (Cognitive Assessment Battery, CAB PRO; CogniFit Inc., San Francisco, US; https://www.cognifit.com/cab). This is a 30–35-minute online neuropsychological tool that assesses various cognitive abilities and overall cognitive wellbeing. It includes a screening questionnaire and cognitive tasks presented as mental games. The battery evaluates five core domains—Perception, Coordination, Attention, Reasoning, and Memory—through 17 brief tests, each lasting 2–3 minutes. Each cognitive function receives an objective score compared to a normative database, which is then converted into z-scores and percentiles.

Reading task. Fifteen stimuli were presented per experimental condition, resulting in a total of 45 stimuli. A counterbalancing design was implemented using six lists that rotated the pairing between country (Country1, Country2, Country3) and condition (Hard-to-Read1, Control2, E2R3), to control for order effects. Participants were randomly assigned to one of the lists using sequences generated in Microsoft Excel (Microsoft Corporation, 2024). Prior to the main task, participants completed a practice block consisting of three trials. Figure 2 illustrates a sample of the experimental sequence.

Knowledge questionnaire. This questionnaire, designed using Microsoft Forms, assessed content comprehension through 30 questions per country, totalling 90. The order of country presentations, question items, and answer options was randomised. Each question offered four options, one of which was correct. Two incorrect options were based on information from the other countries to increase task difficulty. The fourth option was designed to be context-independent and aimed to evaluate plausibility, enabling the detection of inconsistent or unrealistic choices. Comprehension scores were calculated by summing the number of correct responses on the administered questionnaire. Higher scores reflected more profound understanding of the material.

Eye-tracking measures. Data were collected on reading time, fixations, and saccadic movements.

Fig. 2

Example of a trial sequence from the reading task

2.4 Procedure

The experimental session lasted approximately one hour, with no time limits imposed at any stage (see Fig. 3).

Before starting, participants were informed that the study concerned learning, that their participation was voluntary, and that they could withdraw at any time. The experimenter briefly explained the procedure and indicated that written instructions would be provided at each stage.

The session began with two tasks: a sociodemographic questionnaire, followed by the Cognitive Assessment Battery (CAB) (see Measures section above). The eye-tracker was then calibrated and validated. Only participants who successfully completed this process proceeded to the experimental phase. During calibration, they were asked to fixate on black dots until they disappeared and to keep their heads as still as possible.

Next, participants were told that the task involved reading a text, progressing by pressing the spacebar, with no possibility of going back. They were instructed to pay close attention, as their understanding would be tested afterwards.

After the reading phase, participants completed a Flanker task (Eriksen & Eriksen, 1974) (a cognitive control task in which they indicate the direction of a central arrow while ignoring surrounding distractor arrows). Subsequently, they completed a multiple-choice comprehension test on a tablet device (Lenovo TB X306F) based on the texts read during the eye-tracking session.

At the end of the session, participants were debriefed about the true purpose of the study, given a summary information sheet, and thanked for their participation, thereby concluding the experiment.

Fig. 3

Schematic representation of the experimental procedure and event sequence for participants

2.5 Data Acquisition

Participants were seated in a room with constant lighting and temperature, facing a 27″, 165 Hz LED monitor (Lenovo Legion Y27h-30) with a resolution of 1920 × 1080 pixels. A height-adjustable chinrest was used to maintain head stability and a fixed viewing distance of 98 cm from the screen.

Eye movements were recorded using the EyeLink Portable Duo system (SR Research, Ontario, Canada), which tracked both eyes at a sampling rate of 500 Hz using the dark pupil–corneal reflection method. The following thresholds were applied: a saccade velocity of 30°/s, acceleration of 8000°/s², and a minimum movement threshold of 0.15°. A random nine-point automatic calibration was performed on a white background prior to the task, and drift correction was applied during the experiment if needed.

Reading time data were extracted using the Experiment Builder software (SR Research, Ontario, Canada), which recorded the time in milliseconds (ms) that each participant spent reading each text, from stimulus onset until any keypress.

Eye-tracking data were pre-processed to ensure accuracy and consistency. Fixations were identified using the I-DT algorithm with a minimum duration threshold of 100 ms, and saccades were detected using a velocity threshold of ≥ 30°/s. Saccades shorter than 2° were excluded (Holmqvist et al., 2011). Mean values were then calculated per participant and experimental condition (Hard-to-Read, Control, E2R). Outliers were defined as values exceeding ± 2.5 standard deviations from the mean and were removed. The data were imported into JASP (JASP Team, 2025, version 0.95.1) for statistical analysis. Practice trials were excluded.

All analyses were conducted by one researcher and independently verified by another. Data were securely stored on an encrypted university server, anonymised using unique participant codes, and protected by password access.

2.6 Data Analysis

Reading comprehension was analysed using a repeated-measures ANOVA with condition (Hard-to-Read, Control, E2R) as a within-subject factor. Statistical significance was set at p < .05, and effect sizes were reported using partial eta squared (η²p), interpreted according to conventional thresholds (Cohen, 1992).

Eye movement measures were analysed using linear mixed models (LMMs) via the lme4 package (v. 1.1.37; Bates et al., 2015) in R (v. 4.2.3; R Core Team, 2018). For each dependent variable (reading time, fixation count, average fixation duration, and average saccade amplitude), the model included condition as a fixed effect and participant ID as a random intercept to account for inter-individual variability. Models were estimated using maximum likelihood with Laplace approximation.

Pairwise comparisons between conditions were explored using the emmeans package (v.1.10.1; Lenth, 2018), with Bonferroni correction. Model fit was assessed using marginal and conditional R² values and the Akaike Information Criterion (AIC), with lower AIC values indicating better fit. Model diagnostics were performed with the DHARMa package (v.0.3.2.0; Hartig & Hartig, 2017), using simulated residuals to check for dispersion, zero inflation, and autocorrelation.

All frequentist statistical analyses were conducted in JASP (JASP Team, 2025, version 0.95.1). LMMs for eye-tracking data pre-processing, model fitting, and residual diagnostics were performed in R. The significance threshold was set at p < .05 for all tests.

3. Results

3.1 Reading Comprehension

A repeated-measures ANOVA was conducted to assess the effect of condition (Hard-to-Read, Control, E2R) on comprehension scores. The analysis revealed no statistically significant differences between conditions, F (2, 44) = 1.126, p = .333, η²p = .049. Descriptive statistics showed a trend towards higher comprehension scores from the Hard-to-Read (M = 17.83, SD = 7.23), to the Control (M = 18.48, SD = 6.73), and E2R conditions (M = 19.91, SD = 6.71), although these differences were not significant.

3.2 Eye Movement Measures

After data filtering, the proportion of missing values was below 1% for all variables.

For reading time, a significant effect of condition was found. Post hoc comparisons revealed significant differences between Hard-to-Read and Control (p < .0001), and Hard-to-Read and E2R (p = .020), with no significant difference between Control and E2R (p = .885). The marginal R² was .007 and the conditional R² was .474. Mean reading times were higher in the Hard-to-Read condition (M = 11,221 ms, SD = 3.159) compared to Control (M = 10,336 ms, SD = 3.059) and E2R (M = 10,582 ms, SD = 3.051).

For fixation count, a significant effect of condition was also found. Post hoc contrasts showed significant differences between Hard-to-Read and Control (p < .0001), and Hard-to-Read and E2R (p = .030), with a marginal difference between Control and E2R (p = .065). The marginal R² was .013 and the conditional R² was .396. On average, participants made more fixations in the Hard-to-Read condition (M = 45.0, SD = 12.5) than in Control (M = 40.4, SD = 11.6) and E2R (M = 42.5, SD = 11.5).

For average fixation duration, a significant effect of condition was observed. Post hoc comparisons confirmed significant differences between Hard-to-Read and E2R (p < .0001), and Control and E2R (p < .0001), but not between Hard-to-Read and Control (p = .444). The marginal R² was .029 and the conditional R² was .386. Average fixation durations were slightly higher in the Hard-to-Read (M = 216 ms, SD = 29.4) and Control (M = 219 ms, SD = 30.1) conditions than in E2R (M = 205 ms, SD = 27.8).

Regarding average saccade amplitude, a significant effect of condition was also found. Post hoc contrasts revealed differences between Hard-to-Read and E2R (p < .0001) and Control and E2R (p < .0001), with no significant difference between Hard-to-Read and Control (p = .512). The marginal R² was .016 and the conditional R² was .672. Saccade amplitudes were similar in the Hard-to-Read (M = 3.34°, SD = 0.37) and Control (M = 3.30°, SD = 0.38) conditions, increasing in the E2R condition (M = 3.49°, SD = 0.38).

Model diagnostics using DHARMa for the average fixation duration model showed no issues with dispersion (p = .944) or zero inflation (p = 1), supporting the model’s adequacy.

4. Discussion

Previous research has highlighted an ongoing debate regarding the design and actual impact of E2R recommendations. Although these adaptations are intended to support individuals with cognitive vulnerabilities, empirical evidence supporting their effectiveness—particularly in terms of reading comprehension—remains mixed. For example, a recent systematic review by Rivero-Contreras and Saldaña (2020) found that most studies evaluating E2R materials did not report significant improvements in comprehension and that their methodological quality was often limited. Moreover, the evidence for the general population is lacking (i.e., Schmutz et al., 2016; 2019). Thus, the aim of the present study was to evaluate the effectiveness of E2R recommendations on reading comprehension and visual processing efficiency among nondisabled Spanish-speaking young adults from the general population. In this regard, two hypotheses were proposed. The first (H1) predicted that participants in the E2R condition would achieve higher comprehension scores and shorter reading times than those in the Control and Hard-to-Read conditions, or at least perform comparably to the Control condition. The second (H2) predicted that participants in the E2R condition would exhibit fewer and shorter fixations and fewer but larger saccades than those in the Control and Hard-to-Read conditions, or at least perform comparably to the Control condition.

H1 showed no significant differences between conditions, but participants achieved slightly higher accuracy in the E2R format. This aligns with Schmutz et al. (2016, 2019), who also reported improved comprehension with simplified content in nondisabled users. However, unlike Schmutz et al. (2019), who did not observe differences in reading time, our results showed significantly shorter reading times in the E2R condition compared to the Hard-to-Read format. This discrepancy may be due to methodological differences: the present study included a control condition and focused on typographic and structural presentation, whereas Schmutz and colleagues manipulated alternative accessibility features (e.g., form labelling, navigation order).

Regarding H2, results from eye-tracking measures suggest that the E2R condition facilitated more efficient reading. Reading efficiency—defined as the ability to extract meaning from text with minimal perceptual and cognitive load (Liu et al., 2022) is typically characterised by shorter fixation durations and longer saccades, both of which were observed in the E2R condition relative to the others. Fixation duration, an indicator of cognitive load during word recognition, was significantly lower in E2R. Similarly, saccade amplitude was greater in E2R, consistent with broader attentional spans and fluent reading. However, this finding should be interpreted with caution, as larger saccades may partly reflect increased paragraph spacing in the E2R layout, rather than purely perceptual facilitation. Finally, the number of fixations was slightly reduced in E2R compared to the Hard-to-Read condition, although this trend did not reach statistical significance.

In practical terms, these results supplement the broader discussion on accessibility and usability in digital environments. The results suggest that E2R recommendations may offer processing advantages even for nondisabled adults, without compromising comprehension. From a practical standpoint, these results may advocate for the adoption of a large-scale implementation of E2R criteria in the design of public communications, educational materials, and digital content, particularly in cognitively demanding contexts. By ensuring that content is both accessible and efficient to process, E2R adaptations may facilitate more inclusive communication strategies that align with legal and normative frameworks for accessibility.

Nonetheless, some limitations should be considered. First, the sample was relatively homogeneous, which may limit the generalisability of the results to broader or more diverse populations, such as older adults or individuals with cognitive disabilities. Second, the materials were short, fictional texts presented in a controlled setting, which may constrain the ecological validity of the findings. Lastly, although some outcome variables revealed clear effects, others yielded more modest differences that might benefit from increased measurement sensitivity. Future research should replicate these findings with larger samples as well as complementary measures of user experience (e.g., acceptability ratings) and cognitive effort (e.g., pupillometry).

5. Conclusions

This study examined the impact of E2R text formatting on reading comprehension and visual processing efficiency in nondisabled Spanish-speaking adults. While comprehension scores did not differ significantly across conditions, participants performed slightly better in the E2R format. Eye-tracking data revealed that E2R texts may facilitate more efficient visual processing, as reflected by shorter fixation durations and longer saccades—key indicators of reduced cognitive load and fluent reading. These findings suggest that E2R adaptations in terms of text formatting, originally developed for populations with cognitive vulnerabilities, may also benefit the general population by promoting easier and more efficient access to written information.

Acknowledgments

References

ANSI/HFS: American National Standard for Human Factors Engineering of Visual Display Terminal Workstations (ANSI/HFS 100–1988, Section 6, pp. 17–20). (1988)

Asociación Española de Normalización (UNE): UNE 153101:2018 EX: Lectura Fácil. Pautas y recomendaciones para la elaboración de documentos. UNE (2018)

Bates, D., Maechler, M., Bolker, B., Walker, S., Christensen, R.H.B., Singmann, H., Bolker, M.B.: Package ‘lme4’. Convergence. 12(1), 2 (2015). https://doi.org/10.32614/CRAN.package.lme4

Bernard, M., Liao, C.H., Mills, M.: The effects of font type and size on the legibility and reading time of online text by older adults. In CHI ’01 Extended Abstracts on Human Factors in Computing Systems (pp. 175–176). ACM. (2001). https://doi.org/10.1145/634067.634173

British Dyslexia Association: Dyslexia style guide, January 2012. (2012). http://www.bdadyslexia.org.uk/

Chinn, D., Homeyard, C.: Easy read and accessible information for people with intellectual disabilities: Is it worth it? A meta-narrative literature review. Health Expect. 20(6), 1189–1200 (2017). https://doi.org/10.1111/hex.12520

Cohen, J.: A power primer. Psychol. Bull. 112(1), 155–159 (1992). https://doi.org/10.1037/0033-2909.112.1.155

Conlon, E., Sanders, M.: The reading rate and comprehension of adults with impaired reading skills or visual discomfort. J. Res. Reading. 34(2), 193–214 (2011). https://doi.org/10.1111/j.1467-9817.2009.01421.x

Dick, W.: Discussion on text spacing. W3C Mailing List Archives. (2017)., June 15 https://lists.w3.org/Archives/Public/public-low-vision-a11y-tf/2017Jun/0047.html

Diemand-Yauman, C., Oppenheimer, D.M., Vaughan, E.B.: Fortune favors the bold (and the italicized): Effects of disfluency on educational outcomes. Cognition. 118, 111–115 (2011). https://doi.org/10.1016/j.cognition.2010.09.012

Droutsas, N., Spyridonis, F., Daylamani-Zad, D., Ghinea, G.: Web accessibility barriers and their cross-disability impact in eSystems: A scoping review. Comput. Stand. Interfaces. 92, 103923 (2025). https://doi.org/10.1016/j.csi.2024.103923

Duchon, A., Perea, M., Sebastián-Gallés, N., Martí, A., Carreiras, M.: EsPal: One-stop shopping for Spanish word properties. Behav. Res. Methods. 45, 1246–1258 (2013). https://doi.org/10.3758/s13428-013-0326-1

Duñabeitia, J.A., Perea, M., Labusch, M.: Rëâdīńg wõrdš wîth ōrńåmêńtš: Is there a cost? Front. Psychol. 14, 1168471 (2023). https://doi.org/10.3389/fpsyg.2023.1168471

Ekin, M., Krejtz, K., Duarte, C., Pereira, L.S., Marcus-Quinn, A., Krejtz, I.: Impact of web accessibility on cognitive engagement in individuals without disabilities: Evidence from a psychophysiological study. PLOS ONE. 20(7), e0328552 (2025). https://doi.org/10.1371/journal.pone.0328552

Typography – Guidelines – ECL v4. Europa Component European Commission, Library: (2025). https://ec.europa.eu/component-library/v1.15.0/eu/components/detail/eu-style-typography-font/

European Parliament & Council of the European Union: Directive (EU) 2016/2102 of the European Parliament and of the Council of 26 October 2016 on the accessibility of the websites and mobile applications of public sector bodies. Official J. Eur. Union (2016). https://eur-lex.europa.eu/eli/dir/2016/2102/oj

Fajardo, I., Ávila, V., Ferrer, A., Tavares, G., Gómez, M., Hernández, A.: Easy-to‐read texts for students with intellectual disability: Linguistic factors affecting comprehension. J. Appl. Res. Intellect. Disabil. 27(3), 212–225 (2014). https://doi.org/10.1111/jar.12065

Fărcașiu, M.A., Gherheș, V., Șimon, S., Dejica-Carțiș, D., Cădariu, L., Kilyeni, A.: Easy-to-Read: Evolution and perspectives. A bibliometric analysis of research, 1978–2021. Int. J. Environ. Res. Public Health. 20(4), 3359 (2023). https://doi.org/10.3390/ijerph20043359

Faul, F., Erdfelder, E., Lang, A.-G., Buchner, A.: G*Power 3: A flexible statistical power analysis program for the social, behavioral, and biomedical sciences. Behav. Res. Methods. 39(2), 175–191 (2007). https://doi.org/10.3758/BF03193146

González-Sordé, M., Matamala, A.: Empirical evaluation of Easy Language recommendations: A systematic literature review from journal research in Catalan, English, and Spanish. Univ. Access Inf. Soc. 23(3), 1369–1387 (2024). https://doi.org/10.1007/s10209-023-00975-2

Green, P., MacLeod, C.J.: SIMR: An R package for power analysis of generalized linear mixed models by simulation. Methods Ecol. Evol. 7(4), 493–498 (2016). https://doi.org/10.1111/2041-210X.12504

Hartig, F., Hartig, M.F.: Package ‘dharma’. R package, 531–532. (2017). https://cran.r-project.org/web/packages/DHARMa/index.html

Henni, S.H., Maurud, S., Fuglerud, K.S., Moen, A.: The experiences, needs and barriers of people with impairments related to usability and accessibility of digital health solutions. BMC Public. Health. 22(1), 93 (2022). https://doi.org/10.1186/s12889-021-12393-1

Holmqvist, K., Nyström, M., Andersson, R., Dewhurst, R., Jarodzka, H., Van de Weijer, J.: Eye tracking: A comprehensive guide to methods and measures. Oxford University Press (2011)

Hurtado, B., Jones, L., Burniston, F.: Is easy read information really easier to read? J. Intellect. Disabil. Res. 58(9), 822–829 (2014). https://doi.org/10.1111/jir.12097

Id, K.K., Duchowski, A.T., Niedzielska, A., Biele, C.: Eye tracking cognitive load using pupil diameter and microsaccades with fixed gaze. PLOS ONE. 13(9), e0203629 (2018). https://doi.org/10.1371/journal.pone.0203629

Inclusion Europe: Best practice for inclusive education. (2019). https://www.inclusion-europe.eu/wp-content/uploads/2019/01/Best-Practice-Education_EN-FINALWEB.pdf

International Telecommunication Union (ITU): Measuring digital development: Facts and Figs. 2024. ITU. (2024). https://www.itu.int/itu-d/reports/statistics/facts-figures-2024/

ISO: (n.d.). ISO 9241-3, Ergonomic requirements for office work with visual display terminals (VDTs) – Part 3: Visual display requirements. Amendment 1

JASP Team: JASP (Version 0.95.1) [Computer software]. (2025)

Lenth, R., Lenth, M.R.: Package ‘lsmeans’. Am. Stat. 34(4), 216–221 (2018). https://rdrr.io/cran/emmeans/

Leu, D.J., Forzani, E., Rhoads, C., Maykel, C., Kennedy, C., Timbrell, N.: The new literacies of online research and comprehension: Rethinking the reading achievement gap. Reading Res. Q. 50(1), 37–59 (2015). https://doi.org/10.1002/rrq.85

Ling, J., van Schaik, P.: The influence of line spacing and text alignment on visual search of web pages. Displays. 28(2), 60–67 (2007). https://doi.org/10.1016/j.displa.2007.04.003

Liu, J.C., Li, K.A., Yeh, S.L., Chien, S.Y.: Assessing perceptual load and cognitive load by fixation-related information of eye movements. Sensors. 22(3), 1187 (2022). https://doi.org/10.3390/s22031187

Mason, A.M., Compton, J., Bhati, S.: Disabilities and the digital divide: Assessing web accessibility, readability, and mobility of popular health websites. J. Health Communication. 26(10), 667–674 (2021). https://doi.org/10.1080/10810730.2021.1987591

Microsoft Corporation: Microsoft Learn. https://learn.microsoft.com (2024)

Miniukovich, A., De Angeli, A., Sulpizio, S., Venuti, P.: Design guidelines for web readability. In DIS 2017 – Proceedings of the 2017 ACM Conference on Designing Interactive Systems (pp. 285–296). ACM. (2017). https://doi.org/10.1145/3064663.3064711

Mithun, A., Bakar, Z.A., Yafooz, W.M.S.: The impact of web contents color contrast on human psychology in the lens of HCI. Int. J. Inform. Technol. Comput. Sci. 11(10), 27–33 (2019). https://doi.org/10.5815/ijitcs.2019.10.04

Moret-Tatay, C., Perea, M.: Do serifs provide an advantage in the recognition of written words? J. Cogn. Psychol. 23(5), 619–624 (2011). https://doi.org/10.1080/20445911.2011.546781

Observatorio Nebrija del Español: Accesibilidad web en español y Lectura Fácil. (2024). https://www.nebrija.com/catedras/observatorio-nebrija-espanol/pdf/accesibilidad-web-espanol-lectura-facil-v03.pdf

Palinko, O., Kun, A.L., Shyrokov, A., Heeman, P.: Estimating cognitive load using remote eye tracking in a driving simulator. In Proceedings of the 2010 Symposium on Eye-Tracking Research & Applications (pp. 141–144). ACM. (2010)., March https://doi.org/10.1145/1743666.1743701

Perea, M., Gomez, P.: Increasing interletter spacing facilitates encoding of words. Psychon. Bull. Rev. 19(2), 332–338 (2012). https://doi.org/10.3758/s13423-011-0214-6

Perea, M., Gomez, P.: Subtle increases in interletter spacing facilitate the encoding of words during normal reading. PLOS ONE. 7(10), e47568 (2012). https://doi.org/10.1371/journal.pone.0047568

Perea, M., Giner, L., Marcet, A., Gomez, P.: Does extra interletter spacing help text reading in skilled adult readers? Span. J. Psychol. 19, E28 (2016). https://doi.org/10.1017/sjp.2016.28

R Core Team: R: A language and environment for statistical computing. R Foundation for Statistical Computing (2018)

Rello, L., Marcos, M.C.: An eye tracking study on text customization for user performance and preference. In Proceedings of the 2012 8th Latin American Web Congress (LA-WEB 2012) (pp. 64–70). IEEE. (2012). https://doi.org/10.1109/LA-WEB.2012.13

Rello, L., Pielot, M., Marcos, M.C., Carlini, R., May:). Size matters (spacing not) 18 points for a dyslexic-friendly Wikipedia. In Proceedings of the 10th International Cross-Disciplinary Conference on Web Accessibility (pp. 1–4). (2013). https://doi.org/10.1145/2461121.2461125

Rivero-Contreras, M., Saldaña, D.: ¿Legibilidad es sinónimo de comprensión en Lectura Fácil? Una revisión. In A. Díez & R. Gutiérrez (Coords.), Lectura y dificultades lectoras en el siglo XXI (pp. 714–728). Octaedro. (2020)

Rivero-Contreras, M., Engelhardt, P.E., Saldaña, D.: An experimental eye-tracking study of text adaptation for readers with dyslexia: Effects of visual support and word frequency. Ann. Dyslexia. 71(1), 170–187 (2021). https://doi.org/10.1007/s11881-021-00217-1

Rivero-Contreras, M., Engelhardt, P.E., Saldaña, D.: Do easy-to-read adaptations really facilitate sentence processing for adults with a lower level of education? An experimental eye-tracking study. Learn. Instruction. 84, 101731 (2023). https://doi.org/10.1016/j.learninstruc.2022.101731

Russell-Minda, E., Jutai, J.W., Strong, J.G., Campbell, K.A., Gold, D., Pretty, L., Wilmot, L.: The legibility of typefaces for readers with low vision: A research review. J. Visual Impairment Blindness. 101(7), 402–415 (2007). https://files.eric.ed.gov/fulltext/EJ772089.pdf

Scaltritti, M., Miniukovich, A., Venuti, P., Job, R., De Angeli, A., Sulpizio, S.: Investigating effects of typographic variables on webpage reading through eye movements. Sci. Rep. 9(1), 1–12 (2019). https://doi.org/10.1038/s41598-019-49051-x

Schmutz, S., Sonderegger, A., Sauer, J.: Implementing recommendations from web accessibility guidelines: Would they also provide benefits to nondisabled users? Hum. Factors. 58(4), 611–629 (2016). https://doi.org/10.1177/0018720816640962

Schmutz, S., Sonderegger, A., Sauer, J.: Implementing recommendations from web accessibility guidelines: A comparative study of nondisabled users and users with visual impairments. Hum. Factors. 59(6), 956–972 (2017). https://doi.org/10.1177/0018720817708397

Schmutz, S., Sonderegger, A., Sauer, J.: Easy-to-read language in disability-friendly websites: Effects on nondisabled users. Appl. Ergon. 74, 97–106 (2019). https://doi.org/10.1016/j.apergo.2018.08.023

Sutherland, R.J., Isherwood, T.: The evidence for Easy-Read for people with intellectual disabilities: A systematic literature review. J. Policy Pract. Intellect. Disabil. 13(4), 297–310 (2016). https://doi.org/10.1111/jppi.12201

United Nations: Convention on the Rights of Persons with Disabilities. (2006). https://www.un.org/disabilities/documents/convention/convoptprot-s.pdf

Van Der Geest, T., Velleman, E.: Easy-to-read meets accessible web in the e-government context. Procedia Comput. Sci. 27, 327–333 (2014). https://doi.org/10.1016/j.procs.2014.02.036

Wang, A., Huang, C., Wang, J., He, D.: The association between physiological and eye-tracking metrics and cognitive load in drivers: A meta-analysis. Transp. Res. Part. F: Traffic Psychol. Behav. 104, 474–487 (2024). https://doi.org/10.1016/j.trf.2024.06.014

World Wide Web Consortium (W3C): W3C accessibility standards overview: Accessibility guidelines. (2022). https://www.w3.org/WAI/standards-guidelines/#guidelines

Zagermann, J., Pfeil, U., Reiterer, H.: Measuring cognitive load using eye tracking technology in visual computing. In Proceedings of the Sixth Workshop on Beyond Time and Errors on Novel Evaluation Methods for Visualization (pp. 78–85). ACM. (2016)., October https://doi.org/10.1145/2993901.2993908

Yes