Note: This document describes the latest FlashGrid Health Checker tool. For information about older versions 20.04 or 20.02 see FlashGrid Health Checker article.
You can download the latest version from here.
FlashGrid Health Checker tool performs comprehensive checking of multiple points across storage, network, OS, and other components to identify any errors, misconfiguration, or risk items. The tool can be executed on any system and will perform checks on a FlashGrid server instance or all nodes of the FlashGrid cluster. Execution of the tool is non-disruptive and can be performed on a live system.
flashgrid-health-check -h
usage: flashgrid-health-check [-h] [--version] [command] ...
FlashGrid HealthCheck CLI
optional arguments:
-h, --help show this help message and exit
--version show program's version number and exit
Commands:
[command] Default: show
show Show cluster status
reset-rpm-list Reset rpm list
reset-cfg-list Reset list of cfg files
reset-services-list Reset services list
Health Checker performs the following checks:
memory_max_target
, memory_target
, sga_max_size
, pga_aggregate_target
, pga_aggregate_limit
(db v12.1 or higher). Also does the total database memory allocation check across all databases.flashgrid-node
status is goodflashgrid-cluster verify
command# flashgrid-health-check-health-check reset-cfg-list
resets list of cfg files# flashgrid-health-check-health-check reset-rpm-list
command will regenerate the list of installed RPMs# flashgrid-health-check-health-check reset-services-list
regenerates the list of enabled servicesSample report from a two-node cluster:
# flashgrid-health-check
HealthCheck 20.9.63.75654 #e965ed97d34c1c24a9e4938d46de9dfc31b9b026
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
Check: ASM DiskGroup status
rac1: WARNING
---------------------------------------------------------------------------------------------------------
GroupName Status Mounted Type TotalMiB FreeMiB OfflineDisks LostDisks Resync ReadLocal Vote
---------------------------------------------------------------------------------------------------------
DATA Good AllNodes NORMAL 40960 27224 0 0 No Enabled None
FRA Warning AllNodes NORMAL 30720 30381 0 0 Yes Enabled None
GRID Good AllNodes NORMAL 10240 9288 0 0 No Enabled 3/3
---------------------------------------------------------------------------------------------------------
rac2: WARNING
---------------------------------------------------------------------------------------------------------
GroupName Status Mounted Type TotalMiB FreeMiB OfflineDisks LostDisks Resync ReadLocal Vote
---------------------------------------------------------------------------------------------------------
DATA Good AllNodes NORMAL 40960 27224 0 0 No Enabled None
FRA Warning AllNodes NORMAL 30720 30381 0 0 Yes Enabled None
GRID Good AllNodes NORMAL 10240 9288 0 0 No Enabled 3/3
---------------------------------------------------------------------------------------------------------
Check: Alerts in Storage Fabric logs in the last 7 days
rac1: WARNING : /opt/flashgrid/log/fg-cluster-error.log: 53 alerts
rac2: WARNING : /opt/flashgrid/log/fg-cluster-error.log: 54 alerts
racq: WARNING : /opt/flashgrid/log/fg-cluster-error.log: 53 alerts
Check: Available memory
rac1: WARNING : avail mem: 15.4%
rac2: OK : avail mem: 27.7%
racq: OK : avail mem: 75.5%
Check: Check db memory settings
rac1: WARNING
All DBs: sum(pga_aggregate_limit) + max(HugePages, sum(sga_max_size)) >= TotalMemory - 12 GiB
: sum(pga_aggregate_limit) = 4 GiB
: HugePages = 17 GiB
: sum(sga_max_size) = 0 GiB
: TotalMemory = 31 GiB
rac2: WARNING : Failed to query the db instance 'orcl2'. Check that it is running.
Check: Check local_listener for each db
rac1: OK
rac2: WARNING : Failed to query the db instance 'orcl2'. Check that it is running.
Check: Check tnsnames.ora
rac1: OK : Warning: Multiple listener endpoints detected. Skipping tnsnames.ora check.
rac2: OK : Warning: Multiple listener endpoints detected. Skipping tnsnames.ora check.
Check: Flashgrid CLAN check
rac1: OK
rac2: OK
racq: OK
Check: Free system disk space
rac1: OK : /u01: avail 59%, /: avail 85%
rac2: OK : /u01: avail 60%, /: avail 89%
racq: OK : /: avail 89%
Check: Kernel taint check
rac1: OK
rac2: OK
racq: OK
Check: SF node status
rac1: OK
rac2: OK
racq: OK
Check: Storage Fabric cluster verification status
rac1: OK
rac2: OK
racq: OK
Check: Swap disabled
rac1: OK : Swap disabled
rac2: OK : Swap disabled
racq: OK : Swap disabled
Check: System config file modifications
rac1: WARNING
Checksum file not found, using fg_setup.log modification time instead.
/etc/dnsmasq.conf modified since install
rac2: WARNING
Checksum file not found, using fg_setup.log modification time instead.
/etc/sysconfig/iptables modified since install
racq: WARNING
Checksum file not found, using fg_setup.log modification time instead.
/etc/ssh/sshd_config modified since install
Check: System services
rac1: OK
rac2: OK
racq: OK
Check: Unexpected or 3rd party RPMs installed
rac1: OK
rac2: OK
racq: WARNING : telnet
Check: Unexpected or 3rd party services enabled
rac1: OK
rac2: OK
racq: OK