Former User of DNAx Community_47
Comments
Recent activity by Former User of DNAx Community_47
-
Thanks for your reply.I guess this means that if I produce a large file (many terabytes) then the local HD of a single node would need to be large enough to hold that file? It seems like there ough...
- View comment
- 0 votes
-
Hello - I'm just wondering if there has been any progress on this? It seems to be a major issue if the WGS data cannot be processed using Hail.
- View comment
- 0 votes
-
Thank you very much!I hope this can be resolved soon. It seems like no commands that actually operate on the data using Hail are working.
- View comment
- 0 votes
-
Ah, I didn't realise that dx-spark-submit needed to be called from within the app's script. I'll give that a try.Does this work to submit a pyspark script to be run? E.g. within script.sh, I would...
- View comment
- 0 votes
-
Thanks Ondrej.I can't for the life of me figure out how to get the dx-spark-submit utility though. Where is it?I've installed the dx toolkit and other commands work fine.I've tried to run it on Clo...
- View comment
- 0 votes
-
Interestingly, I found that this OOM error is only "masked" when the offending process is given as a process substitution file input.That is, when I do the command like the below, it may incorrectl...
- View comment
- 0 votes
-
I realised that passing --get-streams to dx watch restricts the output to the stdout and stderr streams of the app, whereas if I omit this parameter (just dx watch jobid) then I get more detail, an...
- View comment
- 0 votes
-
That's a good idea to try, thanks!
- View comment
- 0 votes
-
Thank you! I would be interested to hear any comments the team has on the subject.
- View comment
- 0 votes
-
Some people will probably want those formats, which will at least be smaller than VCF.For my own purposes parquet would be preferable, since it would enable more efficiently querying the data and/o...
- View comment
- 0 votes