Automated HPC/AI compute node health-checks Integrated with the SLURM scheduler