What is the best way to filter samples to keep only unrelated individuals as well as only one individual from related pairs? I am working with WES pVCF files.

Permanently deleted user
I found this package which seems to do exactly what I want: https://www.rdocumentation.org/packages/ukbtools/versions/0.11.3/topics/ukb_gen_samples_to_remove   but I have never used R before and am unsure how to execute this. Is there an alternative way of doing something similar?

Comments

1 comment

  • Comment author
    Former User of DNAx Community_28

    The easiest way to use UKB field 22021. "Genetic kinship to other participants" get the list of all eids where f22021==0.

     

    That will give you a total of 339516 subjects who are completely unrelated to one another.

     

    Otherwise, you need to generate your own kinship matrix and use a filter like the one above.

    0

Please sign in to leave a comment.