documentation on DRAGEN bgen and plink release
Hello,
I can't find any documentation on how the variants/genotypes of the DRAGEN bgen/plink files relate to the original pVCFs- is it just a straight up conversion? Has there been any filtering of genotype calls on depth/allele balance before conversion to pgen? Would be great to know!
Ruby
Comments
1 comment
Hi Ruby,
We have two articles on the DRAGEN datasets released which may assist you in understanding the steps taken to prepare the data, the filters applied, and how best to use it in your current workflow.
The first article covers the gVCF, pVCF, and CRAM format files which were released in November 2023: Initial DRAGEN whole genome sequencing (WGS) data release
The second article covers the ML-corrected pVCF, BGEN and PLINK2 format files which were released in March 2025: ML-Corrected DRAGEN whole genome sequencing (WGS) release
Each article covers the respective pipeline and quality control steps which occurred to prepare the data for release.
Hope this helps!
Please sign in to leave a comment.