Skip to main navigation Skip to search Skip to main content

Benchmarking of human Y-chromosomal haplogroup classifiers with whole-genome and whole-exome sequence data

  • Víctor García-Olivares
  • , Adrián Muñoz-Barrera
  • , Luis A. Rubio-Rodríguez
  • , David Jáspez
  • , Ana Díaz-de Usera
  • , Antonio Iñigo-Campos
  • , Krishna R. Veeramah
  • , Santos Alonso
  • , Mark G. Thomas
  • , José M. Lorenzo-Salazar
  • , Rafaela González-Montelongo
  • , Carlos Flores

Research output: Contribution to journalArticlepeer-review

1 Scopus citations

Abstract

In anthropological, medical, and forensic studies, the nonrecombinant region of the human Y chromosome (NRY) enables accurate reconstruction of pedigree relationships and retrieval of ancestral information. Using high-throughput sequencing (HTS) data, we present a benchmarking analysis of command-line tools for NRY haplogroup classification. The evaluation was performed using paired Illumina data from whole-genome sequencing (WGS) and whole-exome sequencing (WES) experiments from 50 unrelated donors. Additionally, as a validation, we also used paired WGS/WES datasets of 54 individuals from the 1000 Genomes Project. Finally, we evaluated the tools on data from third-generation HTS obtained from a subset of donors and one reference sample. Our results show that WES, despite typically offering less genealogical resolution than WGS, is an effective method for determining the NRY haplogroup. Y-LineageTracker and Yleaf showed the highest accuracy for WGS data, classifying precisely 98% and 96% of the samples, respectively. Yleaf outperforms all benchmarked tools in the WES data, classifying approximately 90% of the samples. Yleaf, Y-LineageTracker, and pathPhynder can correctly classify most samples (88%) sequenced with third-generation HTS. As a result, Yleaf provides the best performance for applications that use WGS and WES. Overall, our study offers researchers with a guide that allows them to select the most appropriate tool to analyze the NRY region using both second- and third-generation HTS data.

Original languageEnglish
Pages (from-to)4613-4618
Number of pages6
JournalComputational and Structural Biotechnology Journal
Volume21
DOIs
StatePublished - Jan 2023

Keywords

  • Comparative genomics
  • NRY haplogroup classification
  • Next-generation sequencing
  • Population genetics
  • Y chromosome

Fingerprint

Dive into the research topics of 'Benchmarking of human Y-chromosomal haplogroup classifiers with whole-genome and whole-exome sequence data'. Together they form a unique fingerprint.

Cite this