How to extract specific SNPs from data?

Jun Yuan Cheng

Hi! I'm looking for help to correct my understanding of the data available in Biobank. Please correct me if I'm wrong:

My question is then: how do I extract the genotypes for participants in the UK Biobank for further analysis in a Jupyter Notebook? I have already compiled a list of RSID I'm interested in, e.g., ["rs8176749", “rs8176746”, …]. I also understand that I can filter the participants to generate a smaller cohort (e.g. all males age 40 at time of study), and save this smaller cohort in a .dataset, e.g., “target_cohort.dataset”.

Now, how can I get the genotype at the RSIDs in my list for the participants in my target cohort? (I have access to the bulk data folder, I just have no clue how to access the data I'm interested in) Any help would be appreciated, thanks!

Comments

1 comment

Please sign in to leave a comment.