MENU

PRINCESS: A Framework for Comprehensive Detection and Haplotype Phasing of SNPs and Structural Variants

Speaker

Abstract

Long-read DNA sequencing technologies such as the Pacific Biosciences (PacBio) and Oxford Nanopore (ONT) platforms, have demonstrated enhanced detection of genomic variation, including Single Nucleotide Variants (SNVs), Structural Variants (SVs) and methylation changes. Individual studies so far have, however, have focused only on one of the three classes of variation: SNVs, SVs or methylation changes. Furthermore, only a few studies include phasing information to improve prediction of these classes of variation, to better associate genetic variation with phenotypes. Thus, clinical and research studies both currently lack a comprehensive view of genomic variation, even though the primary data is present in their DNA sequences. Here we introduce PRINCESS, a method that provides haplotype resolved SNVs, SVs and methylation changes based on a single long-read sequencing run from either PacBio or ONT. PRINCESS automatically adapts to different sequence coverage levels to optimally leverage the data set at hand. Thus, PRINCESS provides cost and time efficient comprehensive insights of haplotype resolved genomic variation. This information can be leveraged to simultaneous study the interaction of SNVs, SVs and methylation changes and their impact on phenotypic changes. PRINCESS was evaluated using Genome in a Bottle (GIAB) Oxford Nanopore standard and ultra-long reads as well as PacBio Continuous Long Reads (CLR) and Circular Consensus Sequencing (CCS) data. Using only one SMRT or PromethION flow cell Princess achieved high SNV precision (97.01%, 99.54%, 92.11%) and sensitivity (80.32%, 70.32%, 87.45%) for PacBio CLR, CCS and ONT PromethION, respectively, with minimum Genotype accuracy 98% of all read types. For SVs Princess also reached a high precision (93%, 94%, 86%) and a high sensitivity (77%, 79%, 79%). Both variant types were phased, achieving high N50s of 152Kbp, 117kbp and 17.42Mbp for PacBio CLR, CCS and ONT PromethION, respectively. We are currently evaluating methylation results from ONT. This highlights the versatility and performance of PRINCESS. PRINCESS applied to 18 PacBio with matching RNA-Seq data samples improved the detection of SVs (on average 22,105), SNVs and phasing (~5 Mbp average N50) and thus allowed the detection of eQTL in an automated, fast and comprehensive fashion.

Learning Objectives:

1. Princess: one-stop for all variant detection

2. Comprehensive understanding of variations using long-reads (PacBio and Oxford Nanopore Technologies)

3. Effect of CCS inserts size on Single Nucleotide and Structural Variation detection


Show Resources
You May Also Like
OCT 11, 2022 8:00 AM PDT
C.E. CREDITS
OCT 11, 2022 8:00 AM PDT
Date: October 11, 2022 Time: 8:00am (PDT), 11:00pm (EDT), 5:00pm (CEST) Multiomic profiling of cell populations at single-cell resolution is revolutionizing scientists’ understanding o...
JUN 21, 2022 6:00 AM PDT
JUN 21, 2022 6:00 AM PDT
Date: June 21, 2022 Time: 6:00am (PDT), 9:00am (EDT), 3:00pm (CEST) The global understanding and practice of medicine is currently undergoing a revolutionary change. This shift to precision...
JUN 28, 2022 7:00 AM PDT
JUN 28, 2022 7:00 AM PDT
Date: June 28, 2022 Time: 3:00pm (BST), 4:00pm (CET), 9:00am (CST), 7am (PST) Light-sheet microscopy is an extremely versatile imaging technique with a vast range of implementations that are...
MAR 02, 2022 9:00 AM PST
C.E. CREDITS
MAR 02, 2022 9:00 AM PST
Date: March 02, 2022 Time: 9:00am (PST), 12:00pm (EST) Single cell RNA-seq is known to only capture a small fraction of the transcriptome of each cell. Often, this is due to inherent limitat...
MAY 17, 2022 9:00 AM PDT
MAY 17, 2022 9:00 AM PDT
Date: May 17, 2022 Time: 9:00am (PDT), 12:00pm (EDT), 8:00pm (CEST) Gene therapeutics have great potential to treat many severe diseases in an unprecedented, targeted manner. The biopharmace...
MAR 23, 2022 11:00 AM PDT
MAR 23, 2022 11:00 AM PDT
Date: March 23, 2021 Time: 11:00am (PDT), 2:00pm (EDT), 8:00pm (CEDT) In this presentation, Dr. Middleton will review the development and deployment of large-scale saliva-based COVID-19 test...
Loading Comments...
Show Resources