Resource requirement tweaks #107

mvanniekerkHartwig · 2024-10-25T09:10:52Z

Solves #106 .

When testing out oncoanalyser against some more difficult samples I ran into quite some erratic behaviour of stages running out of memory.
In general this was caused by the java process allocating 95% of available memory, followed by the java process invoking an external (usually R) script. This would cause the container to go over the memory limits set in the base.config, and the container would be killed by the supervisor (in my case k8s).
Even without an external application (outside of java) being invoked this can happen. The Xmx parameter only sets the size of the heap. The GC operates outside of this. So if the heap fills up, the GC kicks in, filling up the remaining memory, and causing the whole container to go OOM.
Also, some stages, like purple and sage are set to a resource class that is too low for the amount of work they need to do, especially for more difficult samples.

Therefore we don't set the size of the heap higher than 75% of the container memory limits when running hmftools (see pipeline5, second is heap memory and third arg is total memory). After changing this the stability of the pipeline greatly improved. I went from sage+purple pretty much always going OOM to the pipeline completing successfully even for difficult samples.

I also bumped the process requirements for purple, sage, and orange. This brings it more in line with the resource requirements set in pipeline5. These extra resources are necessary to finish more difficult samples in a reasonable of time and without going OOM.

…in the container

nf-core-bot · 2024-10-30T00:27:30Z

Warning

Newer version of the nf-core template is available.

Your pipeline is using an old version of the nf-core template: 2.14.1.
Please update your pipeline to the latest version.

For more documentation on how to update your pipeline, please see the nf-core documentation and Synchronisation documentation.

github-actions · 2024-10-30T00:28:32Z

`nf-core pipelines lint` overall result: Passed ✅ ⚠️

Posted for pipeline commit 9caa5b0

+| ✅ 194 tests passed       |+
#| ❔   5 tests were ignored |#
!| ❗  14 tests had warnings |!

❗ Test warnings:

files_exist - File not found: assets/multiqc_config.yml
readme - README contains the placeholder zenodo.XXXXXXX. This should be replaced with the zenodo doi (after the first release).
pipeline_todos - TODO string in nextflow.config: Optionally, you can add a pipeline-specific nf-core config at https://github.com/nf-core/configs
pipeline_todos - TODO string in main.nf: Optionally add bibliographic entries to this list.
pipeline_todos - TODO string in main.nf: Only uncomment below if logic in toolCitationText/toolBibliographyText has been filled!
pipeline_todos - TODO string in awsfulltest.yml: You can customise AWS full pipeline tests as required
schema_params - Schema param panel not found from nextflow config
schema_params - Schema param genome_version not found from nextflow config
schema_params - Schema param genome_type not found from nextflow config
schema_params - Schema param ref_data_hmf_data_path not found from nextflow config
schema_params - Schema param ref_data_panel_data_path not found from nextflow config
schema_params - Schema param ref_data_virusbreakenddb_path not found from nextflow config
schema_params - Schema param ref_data_genome_gtf not found from nextflow config
schema_params - Schema param ref_data_hla_slice_bed not found from nextflow config

❔ Tests ignored:

files_exist - File is ignored: lib/Utils.groovy
files_exist - File is ignored: lib/WorkflowMain.groovy
files_exist - File is ignored: lib/WorkflowOncoanalyser.groovy
actions_ci - actions_ci
multiqc_config - multiqc_config

✅ Tests passed:

files_exist - File found: .gitattributes
files_exist - File found: .gitignore
files_exist - File found: .nf-core.yml
files_exist - File found: .editorconfig
files_exist - File found: .prettierignore
files_exist - File found: .prettierrc.yml
files_exist - File found: CHANGELOG.md
files_exist - File found: CITATIONS.md
files_exist - File found: CODE_OF_CONDUCT.md
files_exist - File found: LICENSE or LICENSE.md or LICENCE or LICENCE.md
files_exist - File found: nextflow_schema.json
files_exist - File found: nextflow.config
files_exist - File found: README.md
files_exist - File found: .github/.dockstore.yml
files_exist - File found: .github/CONTRIBUTING.md
files_exist - File found: .github/ISSUE_TEMPLATE/bug_report.yml
files_exist - File found: .github/ISSUE_TEMPLATE/config.yml
files_exist - File found: .github/ISSUE_TEMPLATE/feature_request.yml
files_exist - File found: .github/PULL_REQUEST_TEMPLATE.md
files_exist - File found: .github/workflows/branch.yml
files_exist - File found: .github/workflows/ci.yml
files_exist - File found: .github/workflows/linting_comment.yml
files_exist - File found: .github/workflows/linting.yml
files_exist - File found: assets/email_template.html
files_exist - File found: assets/email_template.txt
files_exist - File found: assets/sendmail_template.txt
files_exist - File found: assets/nf-core-oncoanalyser_logo_light.png
files_exist - File found: conf/modules.config
files_exist - File found: conf/test.config
files_exist - File found: conf/test_full.config
files_exist - File found: docs/images/nf-core-oncoanalyser_logo_light.png
files_exist - File found: docs/images/nf-core-oncoanalyser_logo_dark.png
files_exist - File found: docs/output.md
files_exist - File found: docs/README.md
files_exist - File found: docs/README.md
files_exist - File found: docs/usage.md
files_exist - File found: main.nf
files_exist - File found: conf/base.config
files_exist - File found: conf/igenomes.config
files_exist - File found: conf/igenomes_ignored.config
files_exist - File found: .github/workflows/awstest.yml
files_exist - File found: .github/workflows/awsfulltest.yml
files_exist - File found: modules.json
files_exist - File not found check: .github/ISSUE_TEMPLATE/bug_report.md
files_exist - File not found check: .github/ISSUE_TEMPLATE/feature_request.md
files_exist - File not found check: .github/workflows/push_dockerhub.yml
files_exist - File not found check: .markdownlint.yml
files_exist - File not found check: .nf-core.yaml
files_exist - File not found check: .yamllint.yml
files_exist - File not found check: bin/markdown_to_html.r
files_exist - File not found check: conf/aws.config
files_exist - File not found check: docs/images/nf-core-oncoanalyser_logo.png
files_exist - File not found check: lib/Checks.groovy
files_exist - File not found check: lib/Completion.groovy
files_exist - File not found check: lib/NfcoreTemplate.groovy
files_exist - File not found check: lib/Workflow.groovy
files_exist - File not found check: parameters.settings.json
files_exist - File not found check: pipeline_template.yml
files_exist - File not found check: Singularity
files_exist - File not found check: lib/nfcore_external_java_deps.jar
files_exist - File not found check: .travis.yml
nextflow_config - Found nf-schema plugin
nextflow_config - Config variable found: manifest.name
nextflow_config - Config variable found: manifest.nextflowVersion
nextflow_config - Config variable found: manifest.description
nextflow_config - Config variable found: manifest.version
nextflow_config - Config variable found: manifest.homePage
nextflow_config - Config variable found: timeline.enabled
nextflow_config - Config variable found: trace.enabled
nextflow_config - Config variable found: report.enabled
nextflow_config - Config variable found: dag.enabled
nextflow_config - Config variable found: process.cpus
nextflow_config - Config variable found: process.memory
nextflow_config - Config variable found: process.time
nextflow_config - Config variable found: params.outdir
nextflow_config - Config variable found: params.input
nextflow_config - Config variable found: validation.help.enabled
nextflow_config - Config variable found: manifest.mainScript
nextflow_config - Config variable found: timeline.file
nextflow_config - Config variable found: trace.file
nextflow_config - Config variable found: report.file
nextflow_config - Config variable found: dag.file
nextflow_config - Config variable found: validation.help.beforeText
nextflow_config - Config variable found: validation.help.afterText
nextflow_config - Config variable found: validation.help.command
nextflow_config - Config variable found: validation.summary.beforeText
nextflow_config - Config variable found: validation.summary.afterText
nextflow_config - Config variable (correctly) not found: params.nf_required_version
nextflow_config - Config variable (correctly) not found: params.container
nextflow_config - Config variable (correctly) not found: params.singleEnd
nextflow_config - Config variable (correctly) not found: params.igenomesIgnore
nextflow_config - Config variable (correctly) not found: params.name
nextflow_config - Config variable (correctly) not found: params.enable_conda
nextflow_config - Config variable (correctly) not found: params.max_cpus
nextflow_config - Config variable (correctly) not found: params.max_memory
nextflow_config - Config variable (correctly) not found: params.max_time
nextflow_config - Config variable (correctly) not found: params.validationFailUnrecognisedParams
nextflow_config - Config variable (correctly) not found: params.validationLenientMode
nextflow_config - Config variable (correctly) not found: params.validationSchemaIgnoreParams
nextflow_config - Config variable (correctly) not found: params.validationShowHiddenParams
nextflow_config - Config timeline.enabled had correct value: true
nextflow_config - Config report.enabled had correct value: true
nextflow_config - Config trace.enabled had correct value: true
nextflow_config - Config dag.enabled had correct value: true
nextflow_config - Config manifest.name began with nf-core/
nextflow_config - Config variable manifest.homePage began with https://github.com/nf-core/
nextflow_config - Config dag.file ended with .html
nextflow_config - Config variable manifest.nextflowVersion started with >= or !>=
nextflow_config - Config manifest.version ends in dev: 1.1.0dev
nextflow_config - Config params.custom_config_version is set to master
nextflow_config - Config params.custom_config_base is set to https://raw.githubusercontent.com/nf-core/configs/master
nextflow_config - Lines for loading custom profiles found
nextflow_config - nextflow.config contains configuration profile test
nextflow_config - Config default value correct: params.force_genome= false
nextflow_config - Config default value correct: params.prepare_reference_only= false
nextflow_config - Config default value correct: params.create_stub_placeholders= false
nextflow_config - Config default value correct: params.isofox_functions= TRANSCRIPT_COUNTS;ALT_SPLICE_JUNCTIONS;FUSIONS;RETAINED_INTRONS
nextflow_config - Config default value correct: params.igenomes_ignore= true
nextflow_config - Config default value correct: params.igenomes_base= s3://ngi-igenomes/igenomes/
nextflow_config - Config default value correct: params.hmf_genomes_base= https://pub-cf6ba01919994c3cbd354659947f74d8.r2.dev/genomes
nextflow_config - Config default value correct: params.custom_config_version= master
nextflow_config - Config default value correct: params.custom_config_base= https://raw.githubusercontent.com/nf-core/configs/master
nextflow_config - Config default value correct: params.publish_dir_mode= copy
nextflow_config - Config default value correct: params.validate_params= true
nextflow_config - Config default value correct: params.pipelines_testdata_base_path= https://raw.githubusercontent.com/nf-core/test-datasets/oncoanalyser
files_unchanged - .gitattributes matches the template
files_unchanged - .prettierrc.yml matches the template
files_unchanged - CODE_OF_CONDUCT.md matches the template
files_unchanged - LICENSE matches the template
files_unchanged - .github/.dockstore.yml matches the template
files_unchanged - .github/CONTRIBUTING.md matches the template
files_unchanged - .github/ISSUE_TEMPLATE/bug_report.yml matches the template
files_unchanged - .github/ISSUE_TEMPLATE/config.yml matches the template
files_unchanged - .github/ISSUE_TEMPLATE/feature_request.yml matches the template
files_unchanged - .github/PULL_REQUEST_TEMPLATE.md matches the template
files_unchanged - .github/workflows/branch.yml matches the template
files_unchanged - .github/workflows/linting_comment.yml matches the template
files_unchanged - .github/workflows/linting.yml matches the template
files_unchanged - assets/email_template.html matches the template
files_unchanged - assets/email_template.txt matches the template
files_unchanged - assets/sendmail_template.txt matches the template
files_unchanged - assets/nf-core-oncoanalyser_logo_light.png matches the template
files_unchanged - docs/images/nf-core-oncoanalyser_logo_light.png matches the template
files_unchanged - docs/images/nf-core-oncoanalyser_logo_dark.png matches the template
files_unchanged - docs/README.md matches the template
files_unchanged - .gitignore matches the template
files_unchanged - .prettierignore matches the template
actions_awstest - '.github/workflows/awstest.yml' is triggered correctly
actions_awsfulltest - .github/workflows/awsfulltest.yml is triggered correctly
actions_awsfulltest - .github/workflows/awsfulltest.yml does not use -profile test
readme - README Nextflow minimum version badge matched config. Badge: 24.04.2, Config: 24.04.2
plugin_includes - No wrong validation plugin imports have been found
pipeline_name_conventions - Name adheres to nf-core convention
template_strings - Did not find any Jinja template strings (0 files)
schema_lint - Schema lint passed
schema_lint - Schema title + description lint passed
schema_lint - Input mimetype lint passed: 'text/csv'
system_exit - No System.exit calls found
actions_schema_validation - Workflow validation passed: linting.yml
actions_schema_validation - Workflow validation passed: branch.yml
actions_schema_validation - Workflow validation passed: fix-linting.yml
actions_schema_validation - Workflow validation passed: release-announcements.yml
actions_schema_validation - Workflow validation passed: awsfulltest.yml
actions_schema_validation - Workflow validation passed: template_version_comment.yml
actions_schema_validation - Workflow validation passed: download_pipeline.yml
actions_schema_validation - Workflow validation passed: ci.yml
actions_schema_validation - Workflow validation passed: clean-up.yml
actions_schema_validation - Workflow validation passed: awstest.yml
actions_schema_validation - Workflow validation passed: linting_comment.yml
merge_markers - No merge markers found in pipeline files
modules_json - Only installed modules found in modules.json
modules_structure - modules directory structure is correct 'modules/nf-core/TOOL/SUBTOOL'
base_config - conf/base.config found and not ignored.
modules_config - conf/modules.config found and not ignored.
modules_config - WRITE_REFERENCE_DATA found in conf/modules.config and Nextflow scripts.
modules_config - STAR_GENOMEGENERATE found in conf/modules.config and Nextflow scripts.
modules_config - GATK4_MARKDUPLICATES found in conf/modules.config and Nextflow scripts.
modules_config - MARKDUPS found in conf/modules.config and Nextflow scripts.
modules_config - AMBER found in conf/modules.config and Nextflow scripts.
modules_config - COBALT found in conf/modules.config and Nextflow scripts.
modules_config - PURPLE found in conf/modules.config and Nextflow scripts.
modules_config - BAMTOOLS found in conf/modules.config and Nextflow scripts.
modules_config - CHORD found in conf/modules.config and Nextflow scripts.
modules_config - EXTRACTCONTIG found in conf/modules.config and Nextflow scripts.
modules_config - LILAC found in conf/modules.config and Nextflow scripts.
modules_config - SIGS found in conf/modules.config and Nextflow scripts.
modules_config - VIRUSBREAKEND found in conf/modules.config and Nextflow scripts.
modules_config - VIRUSINTERPRETER found in conf/modules.config and Nextflow scripts.
modules_config - ISOFOX found in conf/modules.config and Nextflow scripts.
modules_config - CUPPA found in conf/modules.config and Nextflow scripts.
modules_config - SAMTOOLS_FLAGSTAT found in conf/modules.config and Nextflow scripts.
modules_config - ORANGE found in conf/modules.config and Nextflow scripts.
nfcore_yml - Repository type in .nf-core.yml is valid: pipeline
nfcore_yml - nf-core version in .nf-core.yml is set to the latest version: 3.0.2

Run details

nf-core/tools version 3.0.2
Run at 2024-10-30 00:28:14

scwatts · 2024-10-30T00:30:08Z

I've made the modifier controlling the proportion of JVM maximum heap space allocated from total available memory configurable for users with defaults defined at the module level.

For processes where large amounts of memory is required I've increased the default proportion from the suggested 75%. Similarly for processes with low amounts of memory allocated, the default has also been increased since the modifier doesn't scale well at the low-end imo.

I'd rather not sacrifice usable memory to optimise where not strictly needed - please let me know if these defaults are a problem with your set up and we can adjust as necessary.

I'll run some tests to ensure these changes to indeed work, let's discuss any points you'd like review in the meantime!

mvanniekerkHartwig · 2024-10-31T15:14:44Z

Looks great, I think it's a good solution to make this setting configurable.

mvanniekerkHartwig and others added 4 commits October 25, 2024 10:50

java processes should not allocate more than 75% of memory available …

f3ce66b

…in the container

sage processes should also fall in the high resource usage category

de76615

Expose JVM heap space modifier to user via config

df86c4b

Adjust process labels

9caa5b0

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Resource requirement tweaks #107

Resource requirement tweaks #107

mvanniekerkHartwig commented Oct 25, 2024 •

edited

Loading

nf-core-bot commented Oct 30, 2024

github-actions bot commented Oct 30, 2024 •

edited

Loading

❗ Test warnings:

❔ Tests ignored:

✅ Tests passed:

Run details

scwatts commented Oct 30, 2024

mvanniekerkHartwig commented Oct 31, 2024

Resource requirement tweaks #107

Are you sure you want to change the base?

Resource requirement tweaks #107

Conversation

mvanniekerkHartwig commented Oct 25, 2024 • edited Loading

nf-core-bot commented Oct 30, 2024

github-actions bot commented Oct 30, 2024 • edited Loading

nf-core pipelines lint overall result: Passed ✅ ⚠️

❗ Test warnings:

❔ Tests ignored:

✅ Tests passed:

Run details

scwatts commented Oct 30, 2024

mvanniekerkHartwig commented Oct 31, 2024

mvanniekerkHartwig commented Oct 25, 2024 •

edited

Loading

github-actions bot commented Oct 30, 2024 •

edited

Loading

`nf-core pipelines lint` overall result: Passed ✅ ⚠️