Checking Cluster Health

IntelĀ® Cluster Checker

Using IntelĀ® Cluster Checker will ensure your system is intact and configured to run a parallel application, pinpoint trouble spots and get detailed diagnostic information, and resolve issues.

Learn more about Cluster Checker.

Run Cluster Checker

To run Cluster Checker with 2 compute nodes and the Intel HPC Platform Specification compat-hpc-2018.0, get a terminal on the login node. Then, submit the clck.sh job script at /software/samplejobs/intel/.

$ ccqsub /software/samplejobs/intel/clck.sh
The job has successfully been submitted to the scheduler sched and is currently being processed. The job id is: 244789 you can use this id to look up the job status using the ccqstat utility.

$ ccqstat
Id      Name                                   Username  Scheduler  Status
--------------------------------------------------------------------------------
244789  clck.sh                                ccqadmin  sched      Completed

$ cat clck.sh244789.*

Intel(R) Cluster Checker 2019 Update 10 (build 20200921)

Running Collect
..................................................................
Running Analyze

SUMMARY
  Command-line:   clck -F intel_hpc_platform_compat-hpc-2018.0
  Tests Run:      intel_hpc_platform_compat-hpc-2018.0
  Overall Result: No issues found
--------------------------------------------------------------------------------
VALIDATION PASSED
  Intel HPC Platform Specification compat-hpc-2018.0
--------------------------------------------------------------------------------
2 nodes tested:         cctest-3a82ccauto-244789-cgas-[00001-00002]
2 nodes with no issues: cctest-3a82ccauto-244789-cgas-[00001-00002]
0 nodes with issues:
--------------------------------------------------------------------------------
FUNCTIONALITY
No issues detected.

HARDWARE UNIFORMITY
No issues detected.

PERFORMANCE
No issues detected.

SOFTWARE UNIFORMITY
No issues detected.

See the following files for more information: clck_results.log, clck_execution_warnings.log