How do I load .p.vcf.gz files into the python environment on the spark cluster? Do I need to use dxdata?

I want to annotate the files but can't find the documentation on actually loading the file into the environment. This documentation gives a rough outline of how to annotate (not precise) but it does not specify how to load the files from the DNAnexus platform to the python environment: https://documentation.dnanexus.com/user/jupyter-notebooks/dxjupyterlab-spark-cluster

Comments

3 comments

Please sign in to leave a comment.