The full name of the reference genome in UK Biobank is GRCh38_full_analysis_set_plus_decoy_hla.fa, and the version number of the gene used is GCA_000001405.15, which is the initial version of GRCh38 at the end of 2013, and Iam now going to use the gff file corresponding to the reference genome, and I should choose Which one, the 2013 or the 2019 or 2023 gff file. Thank you!
Comments
2 comments
We can see the gff file https://ftp.ncbi.nlm.nih.gov/genomes/all/GCF/000/001/405/GCF_000001405.26_GRCh38/GCF_000001405.26_GRCh38_genomic.gff.gz there. Can you try this?
2. go to GCF_000001405.26_GRCh38_genomic.gff.gz
3. download and it says it is a gff3 file:
$ gunzip -c GCF_000001405.26_GRCh38_genomic.gff.gz|head -n6
##gff-version 3
#!gff-spec-version 1.20
#!processor NCBI annotwriter
#!genome-build GRCh38
#!genome-build-accession NCBI_Assembly:GCF_000001405.26
##sequence-region NC_000001.11 1 248956422
Thank you for your helpful answer! I will try it.?
Please sign in to leave a comment.