Skip to main navigation Skip to search Skip to main content

St. Jude cloud: A pediatric cancer genomic data-sharing ecosystem

  • Clay McLeod
  • , Alexander M. Gout
  • , Xin Zhou
  • , Andrew Thrasher
  • , Delaram Rahbarinia
  • , Samuel W. Brady
  • , Michael Macias
  • , Kirby Birch
  • , David Finkelstein
  • , Jobin Sunny
  • , Rahul Mudunuri
  • , Brent A. Orr
  • , Madison Treadway
  • , Bob Davidson
  • , Tracy K. Ard
  • , Arthur Chiao
  • , Andrew Swistak
  • , Stephanie Wiggins
  • , Scott Foy
  • , Jian Wang
  • Edgar Sioson, Shuoguo Wang, J. Robert Michael, Yu Liu, Xiaotu Ma, Aman Patel, Michael N. Edmonson, Mark R. Wilkinson, Andrew M. Frantz, Ti Cheng Chang, Liqing Tian, Shaohua Lei, S. M.Ashiqul Islam, Christopher Meyer, Naina Thangaraj, Pamella Tater, Vijay Kandali, Singer Ma, Tuan Nguyen, Omar Serang, Irina McGuire, Nedra Robison, Darrell Gentry, Xing Tang, Lance E. Palmer, Gang Wu, Ed Suh, Leigh Tanner, James McMurry, Matthew Lear, Alberto S. Pappo, Zhaoming Wang, Carmen L. Wilson, Yong Cheng, Soheil Meshinchi, Ludmil B. Alexandrov, Mitchell J. Weiss, Gregory T. Armstrong, Leslie L. Robison, Yutaka Yasui, Kim E. Nichols, David W. Ellison, Chaitanya Bangur, Charles G. Mullighan, Suzanne J. Baker, Michael A. Dyer, Geralyn Miller, Scott Newman, Michael Rusch, Richard Daly, Keith Perry, James R. Downing, Jinghui Zhang
  • St. Jude Children Research Hospital
  • Microsoft USA
  • DNANexus
  • University of Washington
  • University of California at San Diego

Research output: Contribution to journalReview articlepeer-review

177 Scopus citations

Abstract

Effective data sharing is key to accelerating research to improve diagnostic precision, treatment efficacy, and long-term survival in pediatric cancer and other childhood catastrophic diseases. We present St. Jude Cloud (https://www.stjude.cloud), a cloud-based data-sharing ecosystem for accessing, analyzing, and visualizing genomic data from >10,000 pediatric patients with cancer and long-term survivors, and >800 pediatric sickle cell patients. Harmonized genomic data totaling 1.25 petabytes are freely available, including 12,104 whole genomes, 7,697 whole exomes, and 2,202 transcriptomes. The resource is expanding rapidly, with regular data uploads from St. Jude’s prospective clinical genomics programs. Three interconnected apps within the ecosystem—Genomics Platform, Pediatric Cancer Knowledgebase, and Visualization Community—enable simultaneously performing advanced data analysis in the cloud and enhancing the Pediatric Cancer knowledgebase. We demonstrate the value of the ecosystem through use cases that classify 135 pediatric cancer subtypes by gene expression profiling and map mutational signatures across 35 pediatric cancer subtypes. Significance: To advance research and treatment of pediatric cancer, we developed St. Jude Cloud, a data-sharing ecosystem for accessing >1.2 petabytes of raw genomic data from >10,000 pediatric patients and survivors, innovative analysis workflows, integrative multiomics visualizations, and a knowledgebase of published data contributed by the global pediatric cancer community.

Original languageEnglish
Pages (from-to)1082-1099
Number of pages18
JournalCancer Discovery
Volume11
Issue number5
DOIs
StatePublished - May 2021

Fingerprint

Dive into the research topics of 'St. Jude cloud: A pediatric cancer genomic data-sharing ecosystem'. Together they form a unique fingerprint.

Cite this