documentation on DRAGEN bgen and plink release

Hello,

I can't find any documentation on how the variants/genotypes of the DRAGEN bgen/plink files relate to the original pVCFs- is it just a straight up conversion? Has there been any filtering of genotype calls on depth/allele balance before conversion to pgen? Would be great to know!

Ruby

Comments

1 comment

  • Comment author
    Lucy BG The helpers that keep the community running smoothly. UKB Community team Data Analyst

    Hi Ruby,

    We have two articles on the DRAGEN datasets released which may assist you in understanding the steps taken to prepare the data, the filters applied, and how best to use it in your current workflow.

    The first article covers the gVCF, pVCF, and CRAM format files which were released in November 2023: Initial DRAGEN whole genome sequencing (WGS) data release

    The second article covers the ML-corrected pVCF, BGEN and PLINK2 format files which were released in March 2025: ML-Corrected DRAGEN whole genome sequencing (WGS) release

    Each article covers the respective pipeline and quality control steps which occurred to prepare the data for release. 

    Hope this helps!

    0

Please sign in to leave a comment.