-
Notifications
You must be signed in to change notification settings - Fork 6
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
ERROR ~ Error executing process > 'graph_align_reads (41)' #33
Comments
Seems like you are running out of time and the job gets killed.
You can try increasing the allocated time for GraphAligner, there is a parameter for that.
…-------- Original Message --------
On 7/12/24 5:06 AM, Yasin K. wrote:
Hi,
I'm attempting to run GraffiTE in GT-sv-GA mode, but I encounter an error during the execution of the graph_align_reads (41) process, as shown in the message below. (However, when I run this step manually using GraphAligner, it completes successfully but the job gets killed.) Do you have any idea what might be causing this issue?
> Tip: you can replicate the issue by changing to the process work dir and entering the command bash .command.run
>
> -- Check '.nextflow.log' file for details
>
> executor > local (507)
> [21/3da086] process > map_asm (1) [100%] 115 of 115 ✔
> [cb/7c0a52] process > svim_asm (115) [100%] 115 of 115 ✔
> [13/513669] process > survivor_merge [100%] 1 of 1 ✔
> [b8/15025a] process > repeatmask_VCF (1) [100%] 1 of 1 ✔
> [a8/d2e2df] process > tsd_prep (1) [100%] 1 of 1 ✔
> [94/e3d127] process > tsd_search (148) [100%] 157 of 157 ✔
> [60/be72ec] process > tsd_report (1) [100%] 1 of 1 ✔
> [77/79f066] process > make_graph (1) [100%] 1 of 1 ✔
> [1a/75fbbf] process > graph_align_reads (49) [100%] 115 of 115, failed: 89 ✘
> [- ] process > vg_call -
> [- ] process > merge_VCFs -
> ERROR ~ Error executing process > 'graph_align_reads (41)'
>
> Caused by:
> Process graph_align_reads (41) terminated with an error exit status (134)
>
> Command executed:
>
> GraphAligner -t 1 -x vg -g index/index.vg -f X.fastq.gz -a X.gam
>
> vg pack -x index/index.vg -g X.gam -o Cant-1.pack -Q 0
>
> Command exit status:
> 134
>
> Command output:
> GraphAligner bioconda 1.0.13-
> Load graph from index/index.vg
> Build minimizer seeder from the graph
> Minimizer seeds, length 15, window size 20, density 10
> Seed cluster size 1
> Alignment bandwidth 10
> Clip alignment ends with identity < 66%
> X-drop DP score cutoff 14705
> write alignments to Cant-1.gam
> Align
>
> Command error:
> INFO: Environment variable SINGULARITYENV_TMPDIR is set, but APPTAINERENV_TMPDIR is preferred
> INFO: fuse2fs not found, will not be able to mount EXT3 filesystems
> GraphAligner bioconda 1.0.13-
> GraphAligner bioconda 1.0.13-
> Load graph from index/index.vg
> Build minimizer seeder from the graph
> Minimizer seeds, length 15, window size 20, density 10
> Seed cluster size 1
> Alignment bandwidth 10
> Clip alignment ends with identity < 66%
> X-drop DP score cutoff 14705
> write alignments to Cant-1.gam
> Align
> Signal 11. Read: 2a9030c3-0fd1-4219-8496-ad2d7f1fb33b runid=b8d4812b34a5a9fdf693319045b7ee7e26775c97 read=30068 ch=2021 start_time=2019-11-30T05:26:38Z flow_cell_id=PAD94007 protocol_group_id=191128-CLIMARES sample_id=MULTIPLEX6-2-M40-S
> 00-L14 barcode=barcode07. Seed: 34132+,3021,15,752
> .command.sh: line 2: 16 Aborted GraphAligner -t 1 -x vg -g index/index.vg -f X.fastq.gz -a X.gam
This is from .nextflow log file:
> Jul-12 03:21:25.913 [Task monitor] DEBUG nextflow.processor.TaskProcessor - Handling unexpected condition for
> task: name=graph_align_reads (35); work-dir=/raven/ptmp/ykaya/Pangenome_project/SV/TE_SV/work/58/afee1f76707eac941eba261f63cc7d
> error [nextflow.exception.ProcessFailedException]: Process graph_align_reads (35) failed
> Jul-12 03:21:25.914 [Task monitor] DEBUG n.processor.TaskPollingMonitor - Task completed > TaskHandler[id: 426; name: graph_align_reads (34); status: COMPLETED; exit: -; error: nextflow.exception.ProcessE
> xception: Process exceeded running time limit (12h); workDir: /raven/ptmp/ykaya/Pangenome_project/SV/TE_SV/work/39/b2a0764669a366a56a5e598a98e656]
> Jul-12 03:21:25.914 [Task monitor] DEBUG nextflow.processor.TaskProcessor - Handling unexpected condition for
> task: name=graph_align_reads (34); work-dir=/raven/ptmp/ykaya/Pangenome_project/SV/TE_SV/work/39/b2a0764669a366a56a5e598a98e656
> error [nextflow.exception.ProcessFailedException]: Process graph_align_reads (34) failed
> Jul-12 03:21:25.915 [Task monitor] DEBUG n.processor.TaskPollingMonitor - Task completed > TaskHandler[id: 440; name: graph_align_reads (48); status: COMPLETED; exit: -; error: nextflow.exception.ProcessE
> xception: Process exceeded running time limit (12h); workDir: /raven/ptmp/ykaya/Pangenome_project/SV/TE_SV/work/dd/a38cb1d0b16fac105668ec2bf9234a]
> Jul-12 03:21:25.915 [Task monitor] DEBUG nextflow.processor.TaskProcessor - Handling unexpected condition for
> task: name=graph_align_reads (48); work-dir=/raven/ptmp/ykaya/Pangenome_project/SV/TE_SV/work/dd/a38cb1d0b16fac105668ec2bf9234a
> error [nextflow.exception.ProcessFailedException]: Process graph_align_reads (48) failed
> Jul-12 03:21:25.915 [Task monitor] DEBUG n.processor.TaskPollingMonitor - Task completed > TaskHandler[id: 396; name: graph_align_reads (4); status: COMPLETED; exit: -; error: nextflow.exception.ProcessException: Process exceeded running time limit (12h); workDir: /raven/ptmp/ykaya/Pangenome_project/SV/TE_SV/work/97/785f099c658f091c3b16c7b0abe50e]
> Jul-12 03:21:25.915 [Task monitor] DEBUG nextflow.processor.TaskProcessor - Handling unexpected condition for
> task: name=graph_align_reads (4); work-dir=/raven/ptmp/ykaya/Pangenome_project/SV/TE_SV/work/97/785f099c658f091c3b16c7b0abe50e
> error [nextflow.exception.ProcessFailedException]: Process graph_align_reads (4) failed
> Jul-12 03:21:25.916 [Task monitor] DEBUG n.processor.TaskPollingMonitor - Task completed > TaskHandler[id: 434; name: graph_align_reads (42); status: COMPLETED; exit: -; error: nextflow.exception.ProcessException: Process exceeded running time limit (12h); workDir: /raven/ptmp/ykaya/Pangenome_project/SV/TE_SV/work/24/4cc79fecfa1f92029986601a22bd88]
> Jul-12 03:21:25.916 [Task monitor] DEBUG nextflow.processor.TaskProcessor - Handling unexpected condition for
> task: name=graph_align_reads (42); work-dir=/raven/ptmp/ykaya/Pangenome_project/SV/TE_SV/work/24/4cc79fecfa1f92029986601a22bd88
> error [nextflow.exception.ProcessFailedException]: Process graph_align_reads (42) failed
> Jul-12 03:21:25.929 [Task monitor] DEBUG n.processor.TaskPollingMonitor - Task completed > TaskHandler[id: 483; name: graph_align_reads (91); status: COMPLETED; exit: -; error: nextflow.exception.ProcessException: Process exceeded running time limit (12h); workDir: /raven/ptmp/ykaya/Pangenome_project/SV/TE_SV/work/fd/a12ae6a90d2858d7514f4992c60e24]
> Jul-12 03:21:25.929 [Task monitor] DEBUG nextflow.processor.TaskProcessor - Handling unexpected condition for
> task: name=graph_align_reads (91); work-dir=/raven/ptmp/ykaya/Pangenome_project/SV/TE_SV/work/fd/a12ae6a90d2858d7514f4992c60e24
> error [nextflow.exception.ProcessFailedException]: Process graph_align_reads (91) failed
> Jul-12 03:21:25.953 [Task monitor] DEBUG n.processor.TaskPollingMonitor - Task completed > TaskHandler[id: 405; name: graph_align_reads (13); status: COMPLETED; exit: -; error: nextflow.exception.ProcessException: Process exceeded running time limit (12h); workDir: /raven/ptmp/ykaya/Pangenome_project/SV/TE_SV/work/58/5a8dbb35508e8365cbef0793e7f1f1]
> Jul-12 03:21:25.954 [Task monitor] DEBUG nextflow.processor.TaskProcessor - Handling unexpected condition for
> task: name=graph_align_reads (13); work-dir=/raven/ptmp/ykaya/Pangenome_project/SV/TE_SV/work/58/5a8dbb35508e8365cbef0793e7f1f1
> error [nextflow.exception.ProcessFailedException]: Process graph_align_reads (13) failed
> Jul-12 03:21:25.954 [Task monitor] DEBUG n.processor.TaskPollingMonitor - Task completed > TaskHandler[id: 462; name: graph_align_reads (70); status: COMPLETED; exit: -; error: nextflow.exception.ProcessException: Process exceeded running time limit (12h); workDir: /raven/ptmp/ykaya/Pangenome_project/SV/TE_SV/work/58/6143156afdcd9ead22067413ce58d3]
> Jul-12 03:21:25.954 [Task monitor] DEBUG nextflow.processor.TaskProcessor - Handling unexpected condition for
> task: name=graph_align_reads (70); work-dir=/raven/ptmp/ykaya/Pangenome_project/SV/TE_SV/work/58/6143156afdcd9ead22067413ce58d3
> error [nextflow.exception.ProcessFailedException]: Process graph_align_reads (70) failed
> Jul-12 03:21:25.955 [Task monitor] DEBUG n.processor.TaskPollingMonitor - Task completed > TaskHandler[id: 484; name: graph_align_reads (92); status: COMPLETED; exit: -; error: nextflow.exception.ProcessException: Process exceeded running time limit (12h); workDir: /raven/ptmp/ykaya/Pangenome_project/SV/TE_SV/work/50/5f76f2b85046ea4410442f61413fb8]
> Jul-12 03:21:25.955 [Task monitor] DEBUG nextflow.processor.TaskProcessor - Handling unexpected condition for
> task: name=graph_align_reads (92); work-dir=/raven/ptmp/ykaya/Pangenome_project/SV/TE_SV/work/50/5f76f2b85046ea4410442f61413fb8
> error [nextflow.exception.ProcessFailedException]: Process graph_align_reads (92) failed
> Jul-12 03:21:25.955 [Task monitor] DEBUG n.processor.TaskPollingMonitor - Task completed > TaskHandler[id: 441; name: graph_align_reads (49); status: COMPLETED; exit: -; error: nextflow.exception.ProcessException: Process exceeded running time limit (12h); workDir: /raven/ptmp/ykaya/Pangenome_project/SV/TE_SV/work/1a/75fbbffbd47b5475851ec1121ee26c]
> Jul-12 03:21:25.956 [Task monitor] DEBUG nextflow.processor.TaskProcessor - Handling unexpected condition for
> task: name=graph_align_reads (49); work-dir=/raven/ptmp/ykaya/Pangenome_project/SV/TE_SV/work/1a/75fbbffbd47b5475851ec1121ee26c
> error [nextflow.exception.ProcessFailedException]: Process graph_align_reads (49) failed
> Jul-12 03:21:26.093 [main] DEBUG nextflow.Session - Session await > all barriers passed
> Jul-12 03:21:26.103 [main] DEBUG nextflow.util.ThreadPoolManager - Thread pool 'PublishDir' shutdown completed (hard=false)
> Jul-12 03:21:26.210 [main] DEBUG nextflow.trace.WorkflowStatsObserver - Workflow completed > WorkflowStats[succeededCount=418; failedCount=89; ignoredCount=0; cachedCount=0; pendingCount=0; submittedCount=0; runningCount=0; retriesCount=0; abortedCount=0; succeedDuration=9d 21h 44m 9s; failedDuration=43d 16h 36m 38s; cachedDuration=0ms;loadCpus=0; loadMemory=0; peakRunning=115; peakCpus=115; peakMemory=480 GB; ]
> Jul-12 03:21:26.517 [main] DEBUG nextflow.cache.CacheDB - Closing CacheDB done
> Jul-12 03:21:26.571 [main] DEBUG nextflow.util.ThreadPoolManager - Thread pool 'FileTransfer' shutdown completed (hard=false)
> Jul-12 03:21:26.571 [main] DEBUG nextflow.script.ScriptRunner - > Execution complete -- Goodbye
The command I tried:
/u/ykaya/nextflow run https://github.com/cgroza/GraffiTE --genotype true \
--assemblies assemblies.csv \
--TE_library TE.nonredun.fa \
--reference /ptmp/X.fasta \
--graph_method graphaligner \
--reads longreads.csv
—
Reply to this email directly, [view it on GitHub](#33), or [unsubscribe](https://github.com/notifications/unsubscribe-auth/AAE4SC7VED3Z7Y7US74MNPDZL6MCLAVCNFSM6AAAAABKYQ6XEOVHI2DSMVQWIX3LMV43ASLTON2WKOZSGQYDKMJQGY4DAMA).
You are receiving this because you are subscribed to this thread.Message ID: ***@***.***>
|
I disagree that the issue is solely due to time limits. Despite attempting the task three times with different configurations and time limits, I consistently encounter the error "ERROR ~ Error executing process > 'graph_align_reads (105)'" for various accessions. My jobs run on HPC nodes equipped with GPUs, allowing for a 24-hour execution window. The tool typically runs for 18-19 hours before encountering these errors. Here are the updated commands I'm using (currently running for 115 samples):
|
I see this line in your logs coming from GraphAligner:
It indicates the name of the read that crashed GraphAligner. |
I had similar issue in the past, and I would lean onto the possibility that this is a bug in GraphAligner. I would suggest to perhaps remove this read from the dataset (though another one may be problematic). This error seem to be non-deterministic. What seemed to work in some case was changing the number of CPU. I had mentioned the problem here but did not get any answer. I will update my message there in case they can help. Best, Clément |
Despite multiple attempts to run with GraphAligner, the issue persists. I have now attempted the genotyping step using vg giraffie, but encountered the following error message. This issue appears to have been addressed in commit For reference, you can review the related issue
any comments and suggestions are welcome! |
Seems that your data contains unsupported IUPAC nucleotides. I had this break GraphAligner in the past and is consistent with the new warning messages in VG.
…-------- Original Message --------
On 7/19/24 8:34 AM, Yasin K. wrote:
Despite multiple attempts to run with GraphAligner, the issue persists.
I have now attempted the genotyping step using vg giraffie, but encountered the following error message. This issue appears to have been addressed in [commit](vgteam/vg#3866) For reference, you can review the related [issue](vgteam/vg#3838)
> executor > local (621)
> [d0/8f092f] process > map_longreads (76) [100%] 115 of 115 ✔
> [fd/0cc43b] process > sniffles_sample_call (115) [100%] 115 of 115 ✔
> [44/c4b4f6] process > sniffles_population_call (1) [100%] 1 of 1 ✔
> [ba/97c7eb] process > map_asm (105) [100%] 115 of 115 ✔
> [45/77378f] process > svim_asm (115) [100%] 115 of 115 ✔
> [b4/e3ced0] process > survivor_merge [100%] 1 of 1 ✔
> [2c/60a236] process > merge_svim_sniffles2 (1) [100%] 1 of 1 ✔
> [91/1b394e] process > repeatmask_VCF (1) [100%] 1 of 1 ✔
> [50/838199] process > tsd_prep (1) [100%] 1 of 1 ✔
> [c5/ed6b2e] process > tsd_search (3) [100%] 154 of 154 ✔
> [c7/9d28ed] process > tsd_report (1) [100%] 1 of 1 ✔
> [d5/b8a4b2] process > make_graph (1) [100%] 1 of 1, failed: 1 ✘
> [- ] process > graph_align_reads -
> [- ] process > vg_call -
> [- ] process > merge_VCFs -
> ERROR ~ Error executing process > 'make_graph (1)'
>
> Caused by:
> Process make_graph (1) terminated with an error exit status (134)
>
> Command executed:
>
> bcftools sort -Oz -o sorted.vcf.gz pangenome.vcf
> tabix sorted.vcf.gz
> mkdir index
>
> vg autoindex --tmp-dir $PWD -p index/index -w giraffe -v sorted.vcf.gz -r X.fasta
>
> vg snarls index/index.giraffe.gbz > index/index.pb
>
> Command exit status:
> 134
>
> Command output:
> (empty)
>
> Command error:
> INFO: Environment variable SINGULARITYENV_TMPDIR is set, but APPTAINERENV_TMPDIR is preferred
> INFO: fuse2fs not found, will not be able to mount EXT3 filesystems
> Writing to /tmp/bcftools.OAKXDo
> [W::bcf_hrec_check] Invalid tag name: "svim-asm_SUPP"
> [W::bcf_hrec_check] Invalid tag name: "svim-asm_SVLEN"
> [W::bcf_hrec_check] Invalid tag name: "svim-asm_SVTYPE"
> [W::bcf_hrec_check] Invalid tag name: "svim-asm_ID"
> Merging 1 temporary files
> [W::bcf_hrec_check] Invalid tag name: "svim-asm_SUPP"
> [W::bcf_hrec_check] Invalid tag name: "svim-asm_SVLEN"
> [W::bcf_hrec_check] Invalid tag name: "svim-asm_SVTYPE"
> [W::bcf_hrec_check] Invalid tag name: "svim-asm_ID"
> Cleaning
> Done
> [vg autoindex] Executing command: vg autoindex --tmp-dir /raven/ptmp/ykaya/path/work/d5/b8a4b27a8b0aa20f39868bb276d9a5 -p index/index -w giraffe -v sorted.vcf.gz -r X.fasta
> [IndexRegistry]: Checking for phasing in VCF(s).
> [IndexRegistry]: Chunking inputs for parallelism.
> index file X.fasta.fai not found, generating...
> [IndexRegistry]: Chunking FASTA(s).
> [IndexRegistry]: Chunking VCF(s).
> [IndexRegistry]: Constructing VG graph from FASTA and VCF input.
> warning:[vg::Constructor] Unsupported IUPAC ambiguity codes found in Chr2; coercing to N.
> warning:[vg::Constructor] Unsupported IUPAC ambiguity codes found in Chr3; coercing to N.
> warning:[vg::Constructor] Skipping duplicate variant with hash 7a4a5efd14440d32a6d1bb6f91a992cd6178fcf7 at Chr3:3110404
> warning:[vg::Constructor] Skipping duplicate variant with hash d1c7edc8494207c7c32084674a18a25159078735 at Chr4:4150882
> warning:[vg::Constructor] Skipping duplicate variant with hash e37e77706134d082ccb2d9eb8ac109fa35b4f3a3 at Chr2:4593323
> warning:[vg::Constructor] Unsupported IUPAC ambiguity codes found in Chr4; coercing to N.
> vg: src/constructor.cpp:528: vg::ConstructedChunk vg::Constructor::construct_chunk(std::string, std::string, std::vectorvcflib::Variant, size_t) const: Assertion `!variant->isSymbolicSV()' failed.
> ERROR: Signal 6 occurred. VG has crashed. Visit https://github.com/vgteam/vg/issues/new/choose to report a bug.
> Stack trace path: /tmp/vg_crash_9IYtPC/stacktrace.txt
> Please include the stack trace file in your bug report!
any comments and suggestions are welcome!
—
Reply to this email directly, [view it on GitHub](#33 (comment)), or [unsubscribe](https://github.com/notifications/unsubscribe-auth/AAE4SC66Y4NFDLU5PGPSZMLZNEBUHAVCNFSM6AAAAABKYQ6XEOVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDEMZZGA2DCMJYHA).
You are receiving this because you commented.Message ID: ***@***.***>
|
Also
This tells me that there are SVs in the VCF that are not fully sequence resolved. |
Hi,
I'm attempting to run GraffiTE in GT-sv-GA mode, but I encounter an error during the execution of the graph_align_reads (41) process, as shown in the message below. (However, when I run this step manually using GraphAligner, it completes successfully but the job gets killed.) Do you have any idea what might be causing this issue?
This is from .nextflow log file:
The command I tried:
The text was updated successfully, but these errors were encountered: