Associations between Caregiver Mental Health, Early Childhood Language Outcomes, and Caregiver Perception Bias: Evidence from Rural China
A
QiJiang1
YiweiQian2✉Email
HanwenZhang3
EvelynZhang3
EveDill4
TianliFeng3
YueMa3,5
ScottRozelle3
1School of Public HealthUniversity of California, BerkeleyBerkeleyCAUSA
2Research Institute of Economics and ManagementSouthwestern University of Finance and EconomicsChengduSichuanChina
3Stanford Center on China’s Economy and InstitutionStanford UniversityStanfordCAUSA
4Department of SociologyStanford University, Stanford UniversityStanfordCAUSA
5Graduate School of EducationStanford University, Stanford UniversityStanfordCAUSA
Qi Jiang1, Yiwei Qian2*, Hanwen Zhang3, Evelyn Zhang3, Eve Dill4, Tianli Feng3, Yue Ma3,5, Scott Rozelle3
1 School of Public Health, University of California, Berkeley, Berkeley, CA, USA
2 Research Institute of Economics and Management, Southwestern University of Finance and Economics, Chengdu, Sichuan, China
3 Stanford Center on China’s Economy and Institution, Stanford University, Stanford, CA, USA
4 Stanford University, Department of Sociology, Stanford University, Stanford, CA, USA
5 Stanford University, Graduate School of Education, Stanford University, Stanford, CA, USA
Corresponding author:
Yiwei Qian
Research Institute of Economics and Management
Southwestern University of Finance and Economics
qianyw@swufe.edu.cn
Abstract
Language development and the home language environment during early childhood are critical for long-term child outcomes. Caregiver mental health may influence early language outcomes directly, but it can also introduce perception bias, which refers to the discrepancies between caregiver self-assessments and the actual status of child language outcomes. This study examines the associations between caregivers mental health symptoms and (i) child language development and home language environment, and (ii) caregiver perception bias in self-report assessments of child language development and home language environment. The stduy recruited 137 rural Chinese households with children aged 16–24 months. Objective measures of child language development and the home language environment were collected using Language Environment Analysis (LENA) technology. Caregiver perception bias were measured by the descrepency between the objective and caregiver self-report measurements. Results show that caregiver anxiety and stress symptoms were linked to poor child language development, while symptoms of depression and anxiety symptoms were associated with less stimulating home language environment.
A
Caregivers with depressive and anxiety symptoms tended to overestimate their children’s language development, and those with depressive symptoms also overestimated their own verbal inputs. These findings call for caution when implementing self-report assessments of early childhood development.
Key Words:
early childhood language development
early childhood language environment
Language Environment Analysis (LENA)
caregiver mental health
rural China
A
A
Introduction
Language development during the first three years of life is a fundamental component of early childhood development that lays the groundwork for future language, cognitive, and learning outcomes. Delays in early language development can have lifelong consequences, such as low education attainment and diminished employment performance [13]. Unfortunately, many children in low- and middle-income countries (LMICs) are at risk of early language delays. In rural China, for example, a recent meta-analysis revealed that 46% of young children failed to reach their language development potential [4]. Growing evidence indicates that poor home language environments, characterized by insufficient verbal interactions between children and their caregivers, is a main predictor of early language developmental delay in LMICs [57]. Identifying the factors that hinder an interactive, stimulating home language environment is crucial for improving early language development in low-income regions.
Caregiver mental health substantially influences child development outcomes, but its specific impact on language development remains unclear. One in five mothers in LMICs reported depressive symptoms in LMICs during the post-partum period [8]. A suvey of 2541 participants suggested that 39% of primary caregivers (mothers and grandmothers) of children under age three living in rural China reported depressive, anxiety, or stress symptoms [9]. A substantial body of research links caregiver depressive symptoms to impaired child cognitive, motor, and social-emotional development, as well as disturbances in early attachment and increased behavior problems [1012]. A Lancet review suggests that depressive symptoms contribute to these developmental adversities by altering the early caregiving environment, particularly the caregiver’s parenting practices and interactions with the child [13]. However, the much smaller body of literature has focused on early childhood language development and other mental health symtpoms beyond depression.
Research on caregiver mental health and early language development is further complicated by the perception bias associated with caregiver mental health symptoms. Perception bias, the discrepancies between caregiver self-assessments and the actual status of child language outcomes, occurs when caregivers experiencing mental health symptoms misinterpret or overlook their own and their children’s behaviors, making distorted assessments. When perception bias results in negative attributions, caregivers could misinterpret typical child behaviors as problematic or deliberately disobedient rather than situationally driven [14, 15]. Caregiver mental health symptoms could also impair their capacity to pay attention to the child [16], thus reduce the accuracy of caregiver-reported measurements. Desipite most widely implemented measurements of child development rely on caregiver self report, few studies have tried to quantify whether caregiver mental health symptoms were associated with their perception bias in caregiver self-assessments of child language development and their home language environment [17].
Recent advances in the Language Environment Analysis (LENA) technology have enabled the recording and analysis of sounds in the child’s surrounding environment and those produced by the child, which allow for an objective measure of child language development and the verbal interactions that form the home language environment. Leveraging LENA data as benchmark objective measurements, the goals of the current study are to a.) investigate the association between caregiver mental health symptoms, child language development, and the home language environment; and b.) examine the association between those symptoms and perception bias in caregiver-reported assessments of child language development and their language home environment.
Methods
Study Location
A
Data were collected from households in two distinct areas (a peri-urban district and two rural counties) in Sichuan Province, China. The peri-urban district is in the suburbs of Chengdu, the provincial capital, and has rapidly transitioned from an agricultural economy into one of the fast-developing areas in the province. Due to rapid urbanization, the district hosts a large population of a.) households with rural backgrounds, including rural migrants, that migrated from rural regions without access to urban public welfare benefits due to their rural rural household registration status, and b.) urbanized rural households, whose farmland in the area had been acquired by the government. In 2019, the average per capita disposable income of rural residents in this district was 30,405 RMB (4,403 USD), much lower than the average per capita income in the Chengdu’s central urban districts (49,193 RMB or 7,613 USD) [18]. The second sampling location was in two counties in a rural, prefecture-level city in Sichuan. The average per capita income for rural residents in this prefecture was 14,670 RMB (2,124 USD), less than half of the average per capita income of urban residents in Sichuan (36,153 RMB or 5,235 USD) [19].
Participants
The participants in the study are children aged 18 to 24 months and their primary caregivers, within the households with rural backgrounds. We focus on toddlers aged 18 to 24 months due to the rapid language development that occurs during this period: Around 18 months of age, many children experience a “vocabulary burst” and start to acquire new words at a much faster rate than before (Goldfield & Reznick, 1990). By this age, they can also distinguish are grammatically correct or incorrect placement of familiar function words (Santelmann & Jusczyk, 1998). By 24 months, the speed and accuracy in spoken word understanding increase dramatically over the second year of life (Fernald et al., 1998). Therefore, any disparity in the home language environment at this stage is likely to have measurable impacts on the child’s language skills.
A
Sampling protocol
A
Due to the geographic and administrative differences between the rural and peri-urban communities, the research team separately sampled participants from the two study locations. To select the peri-urban sample, the research team followed a two-step protocol. First, the research team obtained a list of children between 18 and 24 months of age based on birth records from two local hospitals. Second, households were then randomly selected from the list and trained enumerators screened them for eligibility via phone calls. Households were invited to participate in a survey if their residency status met the legal criteria for rural migrants or urbanized farmers: A household was considered a rural migrant family if a.) the legal residence of either parent was in a rural area outside of Chengdu Municipality; and b.) one parent had lived in the sample district for at least 6 months. To be considered urbanized farmers, a.)
A
the legal residence of either parent must be within the district; b.) all or part of the household’s farmland must have been reclaimed by the government; and c.) any remaining cultivated land owned by the household must be less than 200 square meters per adult family member on average. Following this protocol, a total of 109 households were enrolled in the survey. Of these, 81 consented to the collection of home language environment data (34 rural migrant households, 47 urbanized rural households).
To select the rural sample, the research team followed a three-step protocol. First, all townships in the two sample counties were included in the study, except the townships that housed the county seat, which are typically wealthier and more urbanized than townships In the the rest of the county. This resulted in 11 townships.
A
Second, we obtained a list of households with children aged between 18 and 24 months based on birth records from the township governments, which resulted in a total of 175 households across the 11 townships. Third, we excluded those that were not legally classified as rural residents and enrolled all remaining 109 eligible households in the study. Of these, 77 consented to the collection of home language environment data.
Of the 158 households that completed home language environment data collection, we excluded 21 in which the primary caregiver was not the respondent of the household survey. The final analytic sample includes 137 households: 40 urbanized rural, 27 rural migrant, and 70 rural households. A balance test found no significant differences in demographic information between households that consented to LENA data collection and those that did not [See Apendix Table 1].
Table 1
Descriptive Information among the participants (N = 137)
Panel A: Demographic characteristics
 
Child age (months)
21 (1.6)
Child is male (%)
80 (58%)
Mother is the primary caregiver (%)
80 (58%)
Primary caregiver age (years)
37 (13)
Primary caregiver graduated from high school (%)
74 (50%)
Family asset index
-0.01 (1.3)
Number of adults at home (n)
2.8 (1.0)
Sample Strata
 
Urbanized rural households (%)
40 (29%)
Rural migrant households (%)
27 (20%)
Rural households (%)
70 (51%)
Panel B: Caregiver mental health
 
Caregiver Mental Health Symptoms
 
Depressive symptoms (%)
17 (12%)
Anxiety symptoms (%)
21 (15%)
Stress symptoms (%)
11 (8%)
Panel C: Child language development
 
Recorded Child Language Development
 
Child vocalizations count (CVC, n)
1923 (800)
Caregiver-report Child Language Development
CDI language score
45 (29)
Panel D: Home language environment
 
Recorded Child Language Environment
 
Adult word count (AWC, n)
13,773 (5609)
Conversational turns count (CTC, n)
557 (286)
Caregiver-report Child Language Environment
Read books yesterday (mins)
5.4 (18)
Told story yesterday (mins)
4.8 (11)
Sang songs yesterday (mins)
4.9 (9.3)
Played games yesterday (mins)
28 (39)
Note: Family asset index was calculated using polychromic principal component analysis based on whether households owned or had access to a number of household items such as tap water, computer, internet, and a car. Caregiver mental health was assessed using the Depression, Anxiety, and Stress Scale-21 (DASS-21). CDI refers to the Macarthur-Bates Communicative Development Inventory. CVC, AWC, and CTC were measured using the LENA technogy for 16 hours and then standardized into 12 hours. Caregiver-report language environment was measured using Family Care Index (FCI).
Data Collection
The research team collected data on the home language environment, caregiver mental health, child language development, and demographic characteristics of the household young children and their households. Trained enumerators collected data from all sample households in the summer and fall of 2020 following a standardized four-day protocol. On the first day, enumerators surveyed the primary caregiver of each sample child at their home or the hospital.
A
After the survey, caregivers were instructed on how to record home language environment data with LENA. Caregivers then recorded their home language environment during the second and third days. On the fourth day, enumerators retrieved the recording devices and conducted an exit interview to ensure compliance with the data collection protocol.
Measures
Caregiver mental health
A
To measure caregiver mental health, enumerators administered the Depression, Anxiety, and Stress Scale-21 (DASS-21) to the primary caregiver of the child, identified by each household as the person primarily responsible for the child’s daily care. The DASS-21, a short-form version of the DASS-42 [20], is a self-report questionnaire with three subscales evaluating if the caregiver was at risk for the depression, anxiety, and stress, respectively (and the severity of the risk). Caregivers were asked to rank, on a scale of 0 to 3, how each of 21 statements (seven describing each mental health issue) applied to them in the past week. The score on each subscale is the sum of their ratings for the seven statements multiplied by two. Hence, the score for each subscale could range from 0 to 42, with higher scores indicating more severe symptoms. Following the DASS-21 scoring guidelines, participants were considered experiencing symptoms of mental health if they scored above 9 for depression, 8 for anxiety, and 14 for stress [20]. Notably, while the score cannot be used for clinical diagnosis, studies in multiple countries, including China, have demonstrated the validity of DASS-21 as a measurement of the severity of depression, anxiety, and stress symptoms [21, 22].
Home language environment
The direct-assessment of children’s home language environment used the LENA system. LENA consists of a small recording device worn by the child and the corresponding software that records and analyzes all sounds in the child’s surrounding environment over the course of a day [23]. Previous research has shown LENA to be a valid and reliable measure of language environments in American English [24], Spanish [25], Dutch [26], Korean [27], Vietnamese [28], and Mandarin-speaking populations [29, 30].
Each sample household was given a LENA recording device, a specialized LENA shirt, and a LENA charger. Caregivers were instructed to record 16 hours per day over two days that were representative of what their child typically experienced at home. The recording device was placed in the chest pocket of the LENA shirt which the child wore throughout the day and was removed only when the child bathed or slept at night. Caregivers also was instructed to charge the LENA device overnight between the first and second days of recording.
In this study, the home language environment is measured by two variables automatically generated by the LENA software: The first is the adult word count (AWC), defined as the number of words spoken to or near the child by adults in the recording. The second is conversational turn count (CTC), defined as the number of adult-child alternations in conversation in the recording.
In addition, the caregiver-report home language environment was measured using the four items in the Family Care Indicators (FCI) scale. The FCI is a validated survey instrument developed by UNICEF to evaluate the quality of the home environment and quantify the different types of parenting practices [31].
A
Caregivers reported the time they spent with their child in the previous day on interactive reading, storytelling, singing, and playing. These four parenting practices were selected because they are the most commonly reported measures of stimulating parenting practices for language development in the literature on rural China [4]. Factor analysis also was used to generate a standard index for stimulating parenting practices for language development.
Child language development
The LENA-generated measure on child’s language development is the child vocalization count (CVC). CVC captures the frequency of child-produced utterances, including all sounds, words, vocalizations that are not vegetative noises, cries, or coughs. Notably, researchers have proposed that children’s vocalizations can indicate the quality of the home language environment (Ramírez-Esparza et al., 2014; Werwach et al., 2021) because they are driven by the children’s interactions with other social entities (Moeller et al., 2007) and, as auditory feedback models suggest, hearing one's own voice supports language development (Brainard & Doupe, 2000). Nevertheless, we focus on CVC as an indicator of children’s language development in this study because it has been widely used as a measure of language development (Ramírez-Esparza et al., 2014) and has been shown to be highly predictive of language skills later in life (Y. Wang et al., 2020; Werwach et al., 2021).
The caregiver-report measure on child language skills is the Chinese version of the short-form MacArthur-Bates Communicative Development Inventory (CDI). The CDI measures expressive vocabulary of children aged 16 to 30 months [32], and the short-form CDI has been validated in Mandarin Chinese [33].
A
The primary caregiver was given an inventory of 113 words and asked to identify the words that their child could say.
A
For words the child could say, caregivers were asked to give an example of the context in which the child said the word. The number of words the child could say were then summed to reach a final CDI score which could range between 0 and 113, following CDI scoring guidelines [33].
Caregiver perception bias
Perception bias is measured as the discrepancy between the caregiver-report assessment and the LENA-generated measures of child language development and language environment. Since the LENA measures, CDI, and the parenting practices index are on different scales, direct comparisons between them are not feasible. To enable meaningful within-sample comparisons and account for increases in the raw scores with the child’s age, we internally standardize the scores by removing the effect of age (using the child’s age for child measures and caregiver’s age for caregiver measures) [34]. We compute internal z-scores based on age-specific averages and standard deviations, which are estimated through non-parametric regression. This method reduces sensitivity to outliers and small sample sizes, producing smooth, standardized scores with a mean of zero across all age groups.
Then, we compute differences between the standardized scores to derive perception bias. Specifically, the perception bias in child language development is defined as the difference between the self-reported CDI score and the objectively collected CVC. The perception biases in child language environment are similarly defined as the difference between a parenting practices index computed from self-reported FCI items and AWC, as well as the difference between the parenting practices index and CTC. For the resulting bias scores, a positive score indicates an overestimation of the child’s language development or language environment relative to their LENA outcomes, while a negative score indicates underestimation.
Statistical Analysis
We standardized the LENA measurements, including AWC, CTC, and CVC. The two 16-hour recordings were standardized into 12-hour data to adjust for skewing that is common for count data. We used Chebyshev polynomials transformation to normalize the data: First, we used LASSO regression models to select the final Chebyshev polynomials model for transforming the data. We also predicted residuals with the final model. Then, we estimated the residualized count variables from the transformed data and rescaled them back to the original count metric.
A
We used ordinary least squares (OLS) regression models to examine the associations between caregiver mental health, child language environment, and language development, as well as the association between caregiver mental health and perceptions bias. The models controlled for demographic information and sample strata fixed effects (urbanized rural, rural migrant, or rural), with cluster-robust standard errors estimated by the sample strata.
A
Demographic information including child age, sex, whether mother was the primary caregiver, primary caregiver’s age and education level, family assets, and the number of adults stayed at home were collected through a questionnaire administered by trained enumerators to caregivers.
A
To estimate family assets, caregivers reported whether they owned or had access to a series of household items like tap water, computer, internet, and a car. A family asset index was then calculated using polychromic principal component analysis [35]. Given the small sample size, we used bootstrap resampling with replacement (1,000 replications) to obtain more robust estimates [36].
Results
Summary statistics
The demographic characteristics of the participants are presented in Panel A, Table 1.
A
The average age of the children was over 21 months, 58% of them were male, and 58% were primarily cared for by their mothers.
A
The average age of the primary caregivers was approximately 37 years, and 50% of them (n = 74) had completed high school. Households from the rural counties make up 51% of the sample (n = 70), while another 29% were urbanized rural households (n = 40), and the rest 20% were rural migrant households (n = 27).
Of the sample caregivers, 17 (12%) reported depressive symptoms, 21 (15%) reported anxiety symptoms, and 11 (8%) reported stress symptoms (Panel B). The average CDI score was 45 words (out of 113), and the average CVC was 1,923 vocalizations (Panel C). The average AWC was 13,773 words and the mean CTC was 557 turns. On average, caregivers reported spending 5 minutes reading books, 5 minutes telling stories, 5 minutes singing songs, and 28 minutes playing with their child throughout the day prior to the survey (Panel D).
[insert Table 1 here]
Caregiver mental health symptoms and child language development
The raw distributions of CVC were left-skewed for caregivers who reported depressive, anxiety or stress symptoms, compared to those who did not (Fig. 1), suggesting an unadjusted association between caregiver mental health symptoms and reduced CVC. After adjusting for demographic characteristics and sample status fixed effects (Table 2, Panel A), caregiver anxiety symptoms were significantly associated with lower CVC (
= -0.44, 95% CI [-0.69, -0.18], p < 0.01), and caregiver stress symptoms were significantly associated with both lower CVC (
= -0.75, 95% CI [-1.27, -0.24, p < 0.01) and lower CDI score (
= -0.57, 95% CI [-1.00, -0.15], p < 0.01).
Fig. 1
LENA Child Vocalizations Count (CVC) by Caregiver Mental Health Symptoms
Click here to Correct
[insert Fig. 1 here]
[insert Table 2 here]
Caregiver mental health symptoms and child language environment
The raw distributions of both AWC and CTC were also left-skewed among caregivers who reported depressive, anxiety, or stress symptoms, compared to those who did not (Fig. 2), indicating that, without adjustment, caregiver mental health symptoms were linked to lower AWC and CTC. Controlling for demographic characteristics and sample status fixed effects (Table 2, Panel B), AWC was not significantly associated with any mental health symptoms. In contrast, lower CTC was significantly associated with caregiver depressive symptoms (
= -0.51, 95% CI [-0.91, -0.12], p = 0.011) and anxiety symptoms (
= -0.54, 95% CI [-0.76, -0.31], p < 0.01).
Fig. 2
LENA Child Language Environment by Caregiver Mental Health Symptoms
Click here to Correct
[insert Fig. 2 here]
Caregiver mental health symptoms and perception bias in child language development
The mean of the distributions of caregiver perception bias in child language development were right of zeros for caregivers with depressive and anxiety stymptoms (Fig. 3). After controlling for confounders (Table 2, Panel C), there is a marginally significant association between caregiver depressive symptoms and a positive bias (
= 0.26, 95% CI [-0.02, 0.54], p = 0.064). Caregiver anxiety symptoms were also significantly associated with a positive bias (
= 0.44, 95% CI [0.10, 0.77], p = 0.012). These findings suggest that caregivers experiencing depressive and anxiety symptoms were more likely to overestimate their children’s language development.
Fig. 3
Bias in Child Language Development by Caregiver Mental Health Symptoms
Click here to Correct
[insert Fig. 3 here]
Caregiver mental health symptoms and perception bias in child language environment
The mean of perception bias in child language environment among caregivers were right or zeros for caregivers with depressive, anxiety, or symptoms, suggesting a tendency of overestimation linked to those symptoms (Fig. 4). After adjusting for demographic characteristics and sample status fixed effects (Panel D, Table 2), depressive symptoms were significantly associated with a stronger positive bias using CTC (
= 0.66, 95% CI [0.40, 0.93], p < 0.01), suggesting that caregivers with depressive symptoms were more likely to overestimate their home language environment. No significant associations were observed between anxiety or stress symptoms and perception bias in child home language environment.
Fig. 4
Bias in Child Language Environment by Caregiver Mental Health Symptom
Click here to Correct
[insert Fig. 4 here]
Discussion
Drawing on novel Language Environment Analysis (LENA) data collected from low-resource households with rural backgrounds in China, this study examines the associations between caregiver mental health, the home language environment, and children’s language development. The paper also investigates whether mental health symptoms are associated with the perception bias of caregivers in those outcomes.
As measured by LENA, we found that caregiver anxiety and stress symptoms were significantly associated with lower levels of child language development. Additionally, caregivers with depressive or anxiety symptoms engaged in fewer verbal interactions with their children. These findings align with prior studies using caregiver-report meausres, which have shown that maternal mental health symptoms affect children’s language development in both high-income countries [3739] and LMICs [40]. Caregiver mental health symptoms are associated with reduced engagement in interactive parenting, including numerous studies in rural China [4143]. Our results also support recent studies from high-income countries using LENA, which suggests that caregiver depression and anxiety symptoms were associated with adverse home language environment [39, 44].
Caregiver mental health symptoms were also associated with perception biases in their assessments of their children’s language development and the home language environment. Specifically, caregivers with depressive symptoms were more likely to overestimate their children’s language development. Depressive symptoms often manifest as persistent negative thoughts and cognitions [45], causing caregivers to adopt a more pessimistical view of their children’s language skills and undervalue the developmental milestones they reach. Additionally, the fatigue and low energy associated with depression may reduce caregiver’s awareness of their children’s day-to-day behaviors [16] and lead them to overlook developmental achievements. These factors suggest a tendency toward underestimation—making the observed overestimation unexpected. These results highlight the need for furture research to explore why caregiver depression may be linked overestimation of child langauge development.
A
Caregivers with depressive and anxiety symptoms also tended to overestimate how many verbal interactions they had with their children. While existing studies have documented a general tendency for caregivers to overreport verbal interactions [46], our result suggests that this bias could be more pronounced among those experiencing depressive or anxiety symptoms. One possible explanation is that verbal interactions may demand greater physical, cognitive, and emotional effort for individuals experiencing depression, making the interactions more salient to them and thus more likely to be overestimated [47].
In addition, anxiety symptoms have been linked to heightened concerns on social evaluation and a greater susceptibility of social desirability bias [48], suggesting that caregivers with anxiety symptoms may overreport their children’s language milestones to present the household in a more favorable light. Furthermore, anxiety symptoms are often associated with maintaining positive illusions that lead to overly optimistic assessments of oneself and the world [49]. In this context, caregivers may exaggerate their children’s language skills to mitigate distress about their children’s development.
The perception bias associated with caregiver mental health symptoms are concerning, as most assessments of child development and parenting practices rely on caregiver self-reports [17]. In LMICs, where caregiver mental health symptoms are common, such bias may compromise the accuracy of these measures. Relying on caregiver reports in surveys may introduce bias in estimation of intervention impacts on child development if caregiver mental health changes differentially between treatment and control groups over time. Similarly, using caregiver-reported assessments to screen for developmental delays may fail to identify vulnerable children, particularly those whose caregivers experience depression or anxiety symptoms.
These findings highlight the need for caution when using and interpreting self-report tools in early childhood development research. Whenever possible, researchers should employ direct-assessment tools to measure child outcomes. When self-reports are unavoidable, studies should account for caregiver mental health status during both data collection and analysis periods. Additionally, developing benchmarks that adjust for caregiver mental health symptoms may improve the accuracy of caregiver-reported assessments.
We acknowledge several limitations of this study. First, the construction of perception bias relies on the assumption that non-parametric regressions fully account for the effects of child age. This may not always hold, since residual age-related variation could still influence the standardized scores and introduce bias into our estimates. To mitigate potential confounding, we included child age (in months) as a covariate in our analyses. Second, the measurement of perception bias is based on two specific instruments, which limit the generalizability of our findings to other self-report instruments. Third, although our sample size of 137 is the largest of LENA-based studies worldwide to date, it remains small relative to studies using other assessment approaches. Future research should validate and expand our findings using larger, representative samples to enhance statistical power.
Despite these limitations, the study makes several noteworthy contributions. To the best of our knowledge, this is the first study to examine associations between caregiver mental health and LENA-generated measures of language development and the home language environment in LMICs. It is also among the first to quantify the perception bias in caregiver self-report assessments of early development and to examine the association between this bias and their mental health issues. Despite the exploratory nature of our analysis, our findings provide initial evidence that depressive and anxiety symptoms can distort caregiver assessment of their children’s language skills and the home language environment, calling for greater caution when administering self-report assessments in ECD research.
Conclusion
This study provides novel evidence that caregiver mental health symptoms, particularly depression, anxiety and stress, are significantly associated with both diminished home language environments and child language development. Those symtpoms are also associated with perception biases in caregiver-reported assessments of children’s language development. These results underscore the importance of incorporating caregiver mental health considerations into both the design and interpretation of assessments in early development studies. When direct assessments are not feasible, researchers should adjust for potential reporting biases and consider developing correction benchmarks to enhance the accuracy of caregiver-reported data.
List of abbreviations
AWC
Adult Word Count
CDI
MacArthur–Bates Communicative Development Inventory
CTC
Conversational Turn Count
CVC
Child Vocalization Count
DASS-21
Depression, Anxiety, and Stress Scale–21
FCI
Family Care Indicators
LENA
Language Environment Analysis (LENA) technology
LMICs
Low- and Middle-Income Countries
OLS
Ordinary Least Squares
Declarations
Ethics approval and consent to participate
A
The study was approved by the Stanford University Institutional Review Board (Protocol ID 49552).
A
The study was conducted in accordance with the Declaration of Helsinki.
A
Informed consent was obtained from the legal guardians or parents.
A
Enumerators introduced the study, allowed time for questions, and used a standardized script in both Mandarin and the Sichuanese dialect to obtain consent. Caregivers were made aware that their recordings would be collected and reviewed for research purposes.
Consent for publication
Not applicable
A
Data Availability
Data and analytic code necessary to reproduce the analyses presented in this paper are available from the first author upon reasonable request. The analyses presented here were not preregistered.
A
Funding
The study was supported by an unrestricted gift from the Enlight Foundation, Palo Alto, CA, USA.
A
Author Contribution
Q.J. : Conceptualization, Investigation, Methodology, Software, Formal Analysis, Writing- Original draft preparation, Writing- Reviewing and Editing. Y.Q. : Writing- Reviewing and Editing, Methodology, Formal Analysis. T.F. : Investigation. H.Z. : Writing- Reviewing and Editing. E.Z. : Writing- Original draft preparation. E.D. : Writing- Reviewing and Editing. Y.M. : Conceptualization, Data curation. S.R. : Conceptualization, Writing- Reviewing and Editing.
Acknowledgements
Not applicable.
Electronic Supplementary Material
Below is the link to the electronic supplementary material
References
1.
Attanasio, O. P. The determinants of human capital formation during the early years of life: theory, measurement, and policies. J. Eur. Econ. Assoc. 13, 949–997. https://doi.org/10.1111/jeea.12159 (2015).
2.
Hay, I., Elias, G., Fielding-Barnsley, R., Homel, R. & Freiberg, K. Language Delays, Reading Delays, and Learning Difficulties: Interactive Elements Requiring Multidimensional Programming. J. Learn. Disabil. 40, 400–409. https://doi.org/10.1177/00222194070400050301 (2007).
3.
Wake, M. et al. Improving outcomes of preschool language delay in the community: protocol for the Language for Learning randomised controlled trial. BMC Pediatr. 12, 96. https://doi.org/10.1186/1471-2431-12-96 (2012).
4.
Emmers, D. et al. Early childhood development and parental training interventions in rural China: a systematic review and meta-analysis. BMJ Glob Health. 6, e005578. https://doi.org/10.1136/bmjgh-2021-005578 (2021).
5.
Ece Demir-Lira, Ö., Applebaum, L. R., Goldin‐Meadow, S. & Levine, S. C. Parents’ early book reading to children: Relation to children’s later language and literacy outcomes controlling for other parent language input. Dev. Sci. 22, e12764. https://doi.org/10.1111/desc.12764 (2019).
6.
Nandi, A., Behrman, J. R. & Laxminarayan, R. The Impact of a National Early Childhood Development Program on Future Schooling Attainment: Evidence from Integrated Child Development Services in India. Econ. Dev. Cult. Change. 69, 291–316. https://doi.org/10.1086/703078 (2020).
7.
Romeo, R. R. et al. Beyond the 30-Million-Word Gap: Children’s Conversational Exposure Is Associated With Language-Related Brain Function. Psychol. Sci. 29, 700–710. https://doi.org/10.1177/0956797617742725 (2018).
8.
Fisher, J. et al. Prevalence and determinants of common perinatal mental disorders in women in low- and lower-middle-income countries: a systematic review. Bull. World Health Organ. 90, 139–149. https://doi.org/10.2471/BLT.11.091850 (2012). H.
9.
Zhang, S. et al. Effect of Caregiver’s Mental Health on Early Childhood Development across Different Rural Communities in China. Int. J. Environ. Res. Public. Health. 15, 2341. https://doi.org/10.3390/ijerph15112341 (2018).
10.
Gelaye, B., Rondon, M. B., Araya, R. & Williams, M. A. Epidemiology of maternal depression, risk factors, and child outcomes in low-income and middle-income countries. Lancet Psychiatry. 3, 973–982. https://doi.org/10.1016/S2215-0366(16)30284-X (2016).
11.
Pitchik, H. O., Chung, E. O. & Fernald, L. C. Cross-cultural research on child development and maternal mental health in low- and middle-income countries. Curr. Opin. Behav. Sci. 36, 90–97. https://doi.org/10.1016/j.cobeha.2020.07.013 (2020).
12.
Yue, A. et al. Caregiver Depression and Early Child Development: A Mixed-Methods Study From Rural China. Front. Psychol. 9 December, 1–18. https://doi.org/10.3389/fpsyg.2018.02500 (2018).
13.
Herba, C. M., Glover, V., Ramchandani, P. G. & Rondon, M. B. Maternal depression and mental health in early childhood: an examination of underlying mechanisms in low-income and middle-income countries. Lancet Psychiatry. 3, 983–992. https://doi.org/10.1016/S2215-0366(16)30148-1 (2016).
14.
Bugental, D. B. & Grusec, J. E. Socialization Processes. In: Damon W, Lerner RM, editors. Handbook of Child Psychology. 1st edition. Wiley; (2007). https://doi.org/10.1002/9780470147658.chpsy0307
15.
Azar, S. T., Miller, E. A., Stevenson, M. T. & Johnson, D. R. Social Cognition, Child Neglect, and Child Injury Risk: The Contribution of Maternal Social Information Processing to Maladaptive Injury Prevention Beliefs Within a High-Risk Sample. J. Pediatr. Psychol. 42, 759–767. https://doi.org/10.1093/jpepsy/jsw067 (2017).
16.
World Health Organization. Depressive disorder (depression). Depressive disorder (depression). (2025). https://www.who.int/news-room/fact-sheets/detail/depression. Accessed 24 Apr 2025.
17.
Bliznashka, L. et al. Psychometric properties of early childhood development assessment tools in low- and middle-income countries: a systematic review. BMJ Open. 15, e096365. https://doi.org/10.1136/bmjopen-2024-096365 (2025).
18.
Chengdu Bureau of Statistics. 2019年龙泉驿区国民经济和社会发展统计公报 [2019 People’s Economic and Social Development Statistical Report of Longquanyi District, Chengdu Municipality]. (2020).
19.
National Bureau of Statistics of China. China Statistical Yearbook. 2020. (2020). http://www.stats.gov.cn/tjsj/ndsj/2020/indexeh.htm. Accessed 3 Aug 2021.
20.
Lovibond, S. H. & Lovibond, P. F. Psychology Foundation of Australia. Manual for the depression anxiety stress scales. 2nd edition. Sydney, N.S.W.: Psychology Foundation of Australia; (1995).
21.
Chan, R. C. K. et al. Extending the utility of the Depression Anxiety Stress scale by examining its psychometric properties in Chinese settings. Psychiatry Res. 200, 879–883. https://doi.org/10.1016/j.psychres.2012.06.041 (2012).
22.
Wang, K. et al. Cross-cultural validation of the Depression Anxiety Stress Scale–21 in China. Psychol. Assess. 28, e88–100. https://doi.org/10.1037/pas0000207 (2016).
23.
Ford, M., Baer, C., Xu, D., Yapanel, U. & Gray, S. The LENA Language Environmental Analysis System: Audio Specifications of the DP-0121. (2008).
24.
Xu, D., Yapanel, U. & Gray, S. Reliability of the LENA Language Environemnt Analysis System in Young Children’s Natural Home Environment. (2009).
25.
Weisleder, A. & Fernald, A. Talking to Children Matters: Early Language Experience Strengthens Processing and Builds Vocabulary. Psychol. Sci. 24, 2143–2152. https://doi.org/10.1177/0956797613488145 (2013).
26.
Busch, T., Sangen, A., Vanpoucke, F. & van Wieringen, A. Correlation and agreement between Language ENvironment Analysis (lena) and manual transcription for Dutch natural language recordings. Behav. Res. Methods. 50, 1921–1932. https://doi.org/10.3758/s13428-017-0960-0 (2018).
27.
Pae, S. et al. Effects of feedback on parent–child language with infants and toddlers in Korea. First Lang. 36, 549–569. https://doi.org/10.1177/0142723716649273 (2016).
28.
Ganek, H. V. & Eriks-Brophy, A. A Concise Protocol for the Validation of Language ENvironment Analysis (LENA) Conversational Turn Counts in Vietnamese. Commun. Disord Q. 39, 371–380. https://doi.org/10.1177/1525740117705094 (2017).
29.
Gilkerson, J. et al. Evaluating Language Environment Analysis System Performance for Chinese: A Pilot Study in Shanghai. J. Speech Lang. Hear. Res. 58, 445–452. https://doi.org/10.1044/2015_JSLHR-L-14-0014 (2015).
30.
Zhang, Y. et al. Effects of Quantitative Linguistic Feedback to Caregivers of Young Children: A Pilot Study in China. Commun. Disord Q. 37, 16–24. https://doi.org/10.1177/1525740115575771 (2015).
31.
Kariger, P. et al. Indicators of family care for development for use in multicountry surveys. J. Health Popul. Nutr. 30, 472–486. https://doi.org/10.3329/jhpn.v30i4.13417 (2012).
32.
Fenson, L. et al. MacArthur-Bates Communicative Development Inventories. (2007).
33.
Tardif, T. & Fletcher, P. Chinese communicative development inventories: user’s guide and manual (Peking University Medical Press, 2008).
34.
Attanasio, O., Cattan, S., Fitzsimons, E., Meghir, C. & Rubio-Codina, M. Estimating the Production Function for Human Capital: Results from a Randomized Controlled Trial in Colombia. Am. Econ. Rev. 110, 48–85. https://doi.org/10.1257/aer.20150183 (2020).
35.
Kolenikov, S. & Angeles, G. Socioeconomic Status Measurement with Discrete Proxy Variables: Is Principal Component Analysis a Reliable Answer? Rev. Income Wealth. 55, 128–165. https://doi.org/10.1111/j.1475-4991.2008.00309.x (2009).
36.
Mooney, C. Z., Duval, R. D., Duvall, R. & Bootstrapping A Nonparametric Approach to Statistical Inference. SAGE; (1993).
37.
Paulson, J. F., Keefe, H. A. & Leiferman, J. A. Early parental depression and child language development. J. Child. Psychol. Psychiatry. 50, 254–262. https://doi.org/10.1111/j.1469-7610.2008.01973.x (2009).
38.
Zajicek-Farber, M. L. The Contributions of Parenting and Postnatal Depression on Emergent Language of Children in Low-Income Families. J. Child. Fam Stud. 19, 257–269. https://doi.org/10.1007/s10826-009-9293-7 (2010).
39.
Brookman, R. et al. Depression and Anxiety in the Postnatal Period: An Examination of Infants’ Home Language Environment, Vocalizations, and Expressive Language Abilities. Child. Dev. 91, e1211–e1230. https://doi.org/10.1111/cdev.13421 (2020).
40.
Quevedo, L. A. et al. The impact of maternal post-partum depression on the language development of children at 12 months. Child. Care Health Dev. 38, 420–424. https://doi.org/10.1111/j.1365-2214.2011.01251.x (2012).
41.
Jie, Y., Qi, J., Ai, Y., Lei, T. & Jingchun, N. Effects of Caregiver Depression on Parenting Behaviors in Poor Rural China. 华东师范大学学报(教育科学版). ;37:97–106. (2019). https://doi.org/10.16382/j.cnki.1000-5560.2019.03.008
42.
Zhang, S. et al. Effect of Caregiver’s Mental Health on Early Childhood Development across Different Rural Communities in China. Int. J. Environ. Res. Public. Health. 15, 2341. https://doi.org/10.3390/ijerph15112341 (2018).
43.
Zhong, J. et al. Interrelationships of caregiver mental health, parenting practices, and child development in rural China. Child. Youth Serv. Rev. 121, 105855. https://doi.org/10.1016/j.childyouth.2020.105855 (2021).
44.
Treat, A. E., Morris, A. S., Hays-Grudo, J. & Williamson, A. C. The impact of positive parenting behaviors and maternal depression on the features of young children’s home language environments. J. Child. Lang. 47, 382–400. https://doi.org/10.1017/S030500091900062X (2020).
45.
Beck, A. T. et al. Cognitive Therapy of Depression (Guilford, 2024).
46.
Canault, M., Le Normand, M-T., Foudil, S., Loundon, N. & Thai-Van, H. Reliability of the Language ENvironment Analysis system (LENA) in European French. Behav. Res. Methods. 48, 1109–1124. https://doi.org/10.3758/s13428-015-0634-8 (2016).
47.
Horne, S. J., Topp, T. E. & Quigley, L. Depression and the willingness to expend cognitive and physical effort for rewards: A systematic review. Clin. Psychol. Rev. 88, 102065. https://doi.org/10.1016/j.cpr.2021.102065 (2021).
48.
Edwards, A. L. Social desirability and the description of others. J. Abnorm. Soc. Psychol. 59, 434–436. https://doi.org/10.1037/h0041442 (1959).
49.
Taylor, S. E., Collins, R. L., Skokan, L. A. & Aspinwall, L. G. Maintaining Positive Illusions in the Face of Negative Information: Getting the Facts Without Letting Them Get to You. J. Soc. Clin. Psychol. 8, 114–129 (1989).
Tables
Table 2. Regressions of Caregiver Mental Health Problems on Child Language Development, Language Environment, and Caregiver Reporting Bias (N = 137)
 
 
Panel A
Language Development
Panel B
Language Environment
Panel C
Perception Bias in Language Development
Panel D
Perception Bias in Language Environment
 
 
CVC
CDI
AWC
CTC
FCI
Using CVC
Using AWC
Using CTC
Depressive Symptoms
-0.38
-0.12
-0.24
-0.51**
0.15
0.26+
0.39
0.66***
(0.30)
(0.33)
(0.26)
(0.20)
(0.20)
(0.14)
(0.25)
(0.14)
Anxiety Symptoms
-0.44***
0.00
-0.20
-0.54***
0.10
0.44**
0.30
0.63
(0.13)
(0.19)
(0.20)
(0.12)
(0.52)
(0.17)
(0.35)
(0.43)
Stress Symptoms
-0.75***
-0.57***
0.03
-0.62
-0.36
0.18
-0.39
0.26
(0.26)
(0.22)
(0.33)
(0.39)
(0.24)
(0.17)
(0.24)
(0.29)
Note: (1)+p < 0.10; **p < 0.05; ***p < 0.01. (2) Each row and column represent independent regressions. All regressions control for maternal age, maternal education, child age, child sex, the mother being the primary caregiver, and the family asset index. Regressions are bootstrapped 1,000 times, with cluster-robust standard errors estimated at the sample source level (urbanized, migrant, or rural). (3) CVC refers to LENA Child Vocalization Count, CDI refers to the Macarthur-Bates Communicative Development Inventory, AWC refers to LENA Adult Word Count, CTC refers to LEAN Conversational Turns Count, and FCI refers to the Family Care Indicator.
Total words in MS: 5119
Total words in Title: 17
Total words in Abstract: 196
Total Keyword count: 5
Total Images in MS: 4
Total Tables in MS: 2
Total Reference count: 49