Establishing the reliability of metrics extracted from long-form recordings using LENA and the ACLEW pipeline
Cristia, Alejandrina; Gautheron, Lucas; Zhang, Zixing; Schuller, Björn; Scaff, Camila; Rowland, Caroline; Räsänen, Okko; Peurey, Loann; Lavechin, Marvin; Havard, William; Fausey, Caitlin M.; Cychosz, Margaret; Bergelson, Elika; Anderson, Heather; Al Futaisi, Najla; Soderstrom, Melanie (2024)
Cristia, Alejandrina
Gautheron, Lucas
Zhang, Zixing
Schuller, Björn
Scaff, Camila
Rowland, Caroline
Räsänen, Okko
Peurey, Loann
Lavechin, Marvin
Havard, William
Fausey, Caitlin M.
Cychosz, Margaret
Bergelson, Elika
Anderson, Heather
Al Futaisi, Najla
Soderstrom, Melanie
2024
BEHAVIOR RESEARCH METHODS
This publication is copyrighted. You may download, display and print it for Your own personal use. Commercial use is prohibited.
Julkaisun pysyvä osoite on
https://urn.fi/URN:NBN:fi:tuni-202504103582
https://urn.fi/URN:NBN:fi:tuni-202504103582
Kuvaus
Peer reviewed
Tiivistelmä
Long-form audio recordings are increasingly used to study individual variation, group differences, and many other topics in theoretical and applied fields of developmental science, particularly for the description of children’s language input (typically speech from adults) and children’s language output (ranging from babble to sentences). The proprietary LENA software has been available for over a decade, and with it, users have come to rely on derived metrics like adult word count (AWC) and child vocalization counts (CVC), which have also more recently been derived using an open-source alternative, the ACLEW pipeline. Yet, there is relatively little work assessing the reliability of long-form metrics in terms of the stability of individual differences across time. Filling this gap, we analyzed eight spoken-language datasets: four from North American English-learning infants, and one each from British English-, French-, American English-/Spanish-, and Quechua-/Spanish-learning infants. The audio data were analyzed using two types of processing software: LENA and the ACLEW open-source pipeline. When all corpora were included, we found relatively low to moderate reliability (across multiple recordings, intraclass correlation coefficient attributed to the child identity [Child ICC], was < 50% for most metrics). There were few differences between the two pipelines. Exploratory analyses suggested some differences as a function of child age and corpora. These findings suggest that, while reliability is likely sufficient for various group-level analyses, caution is needed when using either LENA or ACLEW tools to study individual variation. We also encourage improvement of extant tools, specifically targeting accurate measurement of individual variation.
Kokoelmat
- TUNICRIS-julkaisut [22159]
