Skip to main navigation Skip to search Skip to main content

A Mispronunciation-Based Voice-Omics Representation Framework for Screening Specific Language Impairments in Children

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

1 Scopus citations

Abstract

This paper introduces an innovative end-to-end (E2E) framework for screening Specific Language Impairment (SLI) in children, centralizing phoneme-level mispronunciation (PLM) detection to enhance the precision and reliability. We have developed a unique voice-omics representation that translates PLM predictions into symbolic sequences, yielding significant phenotyping biomarkers that provide objective and quantifiable assessments of children's speech patterns. Through meticulous fine-tuning of the Connectionist Temporal Classification (CTC) model on the L2-ARCTIC dataset and rigorous five-fold cross-validation, our E2E models have demonstrated remarkable ac-curacy, with Area Under the Curve (AUC) values exceeding 0.71 and a notable recall rate of up to 71.5 % on the CHILDES dataset. Our approach signifies a substantial advancement in SLI screening, leveraging cutting-edge technology to capture the complexities of spontaneous speech in children.

Original languageEnglish
Title of host publicationProceedings - 2024 IEEE 12th International Conference on Healthcare Informatics, ICHI 2024
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages294-304
Number of pages11
ISBN (Electronic)9798350383737
DOIs
StatePublished - 2024
Event12th IEEE International Conference on Healthcare Informatics, ICHI 2024 - Orlando, United States
Duration: Jun 3 2024Jun 6 2024

Publication series

NameProceedings - 2024 IEEE 12th International Conference on Healthcare Informatics, ICHI 2024

Conference

Conference12th IEEE International Conference on Healthcare Informatics, ICHI 2024
Country/TerritoryUnited States
CityOrlando
Period06/3/2406/6/24

Keywords

  • Connectionist Temporal Classification Model
  • Phenotyping Biomarkers
  • Phoneme-level Mispronunciation Detection
  • SLI Screening
  • Symbolic Sequence

Fingerprint

Dive into the research topics of 'A Mispronunciation-Based Voice-Omics Representation Framework for Screening Specific Language Impairments in Children'. Together they form a unique fingerprint.

Cite this