Filtering Null values in Cohorts
I have a doubt regarding the cohorts in UKB RAP. Whenever we try to extract fields for a cohort there are a lot of null values. And since there are a large number of fields, it becomes really tedious to go through these fields one by one. Is there a faster way where we can filter and retrieve only those fields that have null values less than some threshold number, say 20?
Comments
2 comments
It is not possible to do that using the Cohort Browser GUI. It should be possible to write it programmatically using a JupyterLab instance from the Tools tab. See documentation at https://dnanexus.gitbook.io/uk-biobank-rap/working-on-the-research-analysis-platform/using-jupyterlab-on-the-research-analysis-platform .
There are github notebooks which provide tutorials on how to filter the data (using python 101-103 and R 104-105) https://github.com/UK-Biobank/UKB-RAP-Notebooks/tree/main/NBs_Prelim
Please sign in to leave a comment.