TY - JOUR
T1 - Data harmonization framework for neonatal hypoxic-ischemic encephalopathy studies
AU - behalf of Consortium Of MRI Biomarkers In Neonatal Encephalopathy (COMBINE)
AU - Hsiao, Chuan Heng
AU - Foster, Anna N.
AU - Mcdonald, Scott A.
AU - Vyas, Rutvi
AU - Ashraf, Aseelah
AU - Bao, Rina
AU - Tran, Lena
AU - Kesri, Ankush
AU - Darzidehkalani, Erfan
AU - Soldatelli, Matheus D.
AU - Auman, Jeanette O.
AU - Soul, Janet S.
AU - Chalak, Lina F.
AU - Cotten, C. Michael
AU - Shankaran, Seetha
AU - Laptook, Abbot R.
AU - Grant, P. Ellen
AU - Ou, Yangming
AU - Walsh, Michele C.
AU - Eichenwald, Eric C.
AU - Truog, William E.
AU - Merhar, Stephanie L.
AU - Poindexter, Brenda L.B.
AU - He, Lili
AU - Cotten, C. Michael
AU - Hamrick, Shannon E.G.
AU - Sokol, Gregory M.
AU - Bates, Sara V.
AU - Sanchez, Pablo J.
AU - Shepherd, Edward G.
AU - Ho, Mai Lan
AU - Auman, Jeanette O.
AU - Van Meurs, Krisa P.
AU - Hintz, Susan R.
AU - Bonifacio, Sonia L.
AU - Frantz, Ivan D.
AU - Ambalavanan, Namasivayam
AU - Bell, Edward F.
AU - McNamara, Patrick
AU - Watterberg, Kristi
AU - D Angio, Carl T.
AU - Tyson, Jon E.
AU - Green, Charles E.
AU - Yoder, Bradley A.
AU - Shankaran, Seetha
AU - Chawla, Sanjay
AU - Caplan, Michael S.
AU - Farooq, Osman
AU - Reynolds, Anne Marie
AU - Hartley-McAndrew, Michele
N1 - Publisher Copyright: © 2025 The Author(s).
PY - 2025/10/1
Y1 - 2025/10/1
N2 - Objectives To develop a data harmonization framework for neonatal hypoxic-ischemic encephalopathy (HIE) studies and demonstrate its suitability for prognostic biomarker development. Materials and Methods Variables were first categorized by chronological stages and then by medical topics. We created a dictionary to harmonize variable names and value coding. We began by merging comprehensive data from 2 landmark nationwide therapeutic hypothermia for HIE trials (2008-2016, 21 sites) in the Eunice Kennedy Shriver National Institute of Child Health and Human Development (NICHD) Neonatal Research Network (NRN). The 2 datasets differ in available variables, variable naming and coding, necessitating harmonization. To demonstrate the utility of this data harmonization framework, we computed the distributions of variables and ranked them by the strength of associations with 18- to 22-month outcomes. Associations were measured using Pearson's correlation analysis. Outcomes were defined as (a) a 5-class variable: survivors with normal, mild, moderate, severe disability, or death and (b) the Bayley-III Scales. Results We created a dictionary of 1181 variables on 532 patients across 5 chronologic categories and 60 medical subcategories. The distribution of major predictive and outcome variables, and the variables strongly associated with neurodevelopmental outcomes at 18-22 months were presented. The modified Sarnat scores at the Post-intervention and NICU-discharge stage, and the NRN pattern of MRI injury score showed strong associations with outcome variables. Conclusion We designed a data harmonization framework specifically for HIE. Our initial effort in merging 2 iconic clinical trials shows strong predictor-outcome associations, allowing subsequent development of advanced prognostic biomarkers of neonatal HIE.
AB - Objectives To develop a data harmonization framework for neonatal hypoxic-ischemic encephalopathy (HIE) studies and demonstrate its suitability for prognostic biomarker development. Materials and Methods Variables were first categorized by chronological stages and then by medical topics. We created a dictionary to harmonize variable names and value coding. We began by merging comprehensive data from 2 landmark nationwide therapeutic hypothermia for HIE trials (2008-2016, 21 sites) in the Eunice Kennedy Shriver National Institute of Child Health and Human Development (NICHD) Neonatal Research Network (NRN). The 2 datasets differ in available variables, variable naming and coding, necessitating harmonization. To demonstrate the utility of this data harmonization framework, we computed the distributions of variables and ranked them by the strength of associations with 18- to 22-month outcomes. Associations were measured using Pearson's correlation analysis. Outcomes were defined as (a) a 5-class variable: survivors with normal, mild, moderate, severe disability, or death and (b) the Bayley-III Scales. Results We created a dictionary of 1181 variables on 532 patients across 5 chronologic categories and 60 medical subcategories. The distribution of major predictive and outcome variables, and the variables strongly associated with neurodevelopmental outcomes at 18-22 months were presented. The modified Sarnat scores at the Post-intervention and NICU-discharge stage, and the NRN pattern of MRI injury score showed strong associations with outcome variables. Conclusion We designed a data harmonization framework specifically for HIE. Our initial effort in merging 2 iconic clinical trials shows strong predictor-outcome associations, allowing subsequent development of advanced prognostic biomarkers of neonatal HIE.
KW - biomarker development
KW - common data elements
KW - data harmonization framework
KW - hypoxic-ischemic encephalopathy
UR - https://www.scopus.com/pages/publications/105015429725
U2 - 10.1093/jamiaopen/ooaf086
DO - 10.1093/jamiaopen/ooaf086
M3 - Article
SN - 2574-2531
VL - 8
JO - JAMIA Open
JF - JAMIA Open
IS - 5
M1 - ooaf086
ER -