Checking Cluster Health
IntelĀ® Cluster Checker
Using IntelĀ® Cluster Checker will ensure your system is intact and configured to run a parallel application, pinpoint trouble spots and get detailed diagnostic information, and resolve issues.
Learn more about Cluster Checker.
Run Cluster Checker
To run Cluster Checker with 2 compute nodes and the Intel HPC Platform Specification compat-hpc-2018.0, get a terminal on the login node. Then, submit the clck.sh
job script at /software/samplejobs/intel/
.
$ ccqsub /software/samplejobs/intel/clck.sh
The job has successfully been submitted to the scheduler sched and is currently being processed. The job id is: 244789 you can use this id to look up the job status using the ccqstat utility.
$ ccqstat
Id Name Username Scheduler Status
--------------------------------------------------------------------------------
244789 clck.sh ccqadmin sched Completed
$ cat clck.sh244789.*
Intel(R) Cluster Checker 2019 Update 10 (build 20200921)
Running Collect
..................................................................
Running Analyze
SUMMARY
Command-line: clck -F intel_hpc_platform_compat-hpc-2018.0
Tests Run: intel_hpc_platform_compat-hpc-2018.0
Overall Result: No issues found
--------------------------------------------------------------------------------
VALIDATION PASSED
Intel HPC Platform Specification compat-hpc-2018.0
--------------------------------------------------------------------------------
2 nodes tested: cctest-3a82ccauto-244789-cgas-[00001-00002]
2 nodes with no issues: cctest-3a82ccauto-244789-cgas-[00001-00002]
0 nodes with issues:
--------------------------------------------------------------------------------
FUNCTIONALITY
No issues detected.
HARDWARE UNIFORMITY
No issues detected.
PERFORMANCE
No issues detected.
SOFTWARE UNIFORMITY
No issues detected.
See the following files for more information: clck_results.log, clck_execution_warnings.log