Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fix an error caused by system's /tmp being full when running checks in lbnl_hw.nhc #152

Open
wants to merge 2 commits into
base: dev
Choose a base branch
from

Conversation

RJukkara
Copy link

@RJukkara RJukkara commented Oct 2, 2024

The error in question can be easily reproduced by having for example check_hw_cpuinfo, check_hw_eth or check_hw_physmem configured in the system's nhc.conf and filling up /tmp.
Currently what happens is:

[root@c1104 ~]# df -h /tmp/
Filesystem      Size  Used Avail Use% Mounted on
tmpfs           512M  512M     0 100% /tmp
[root@c1104 ~]# nhc -ad
DEBUG:  Debugging activated via -d option.
ERROR:  nhc:  Health check failed:  check_hw_cpuinfo:  Actual CPU socket count (1) does not match expected (2).
ERROR:  nhc:  Health check failed:  check_hw_physmem:  Actual RAM size (0 kB) less than minimum allowed (263504064 kB).
ERROR:  nhc:  Health check failed:  check_hw_eth:  Ethernet device ib0 not detected.
ERROR:  nhc:  3 health checks failed.

After the changes in the PR:

[root@c1104 ~]# df -h /tmp/
Filesystem      Size  Used Avail Use% Mounted on
tmpfs           512M  512M     0 100% /tmp
[root@c1104 ~]# nhc -ad
DEBUG:  Debugging activated via -d option.
[root@c1104 ~]# echo $?
0

This commit fixes a
"cannot create temp file for here-document: No space left on device"
error when running NHC on a system where /tmp is full.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant