Next Steps min read


Command-Line Parameters

Refer to sdb-report send-alert for more information on the available command-line arguments.

Alerts Reference and Default Thresholds

Check + Description Warn default Fail default Configurable?
leavesNotOnline

Offline leaf nodes
Fail if >=1 Leaf Offline Yes
offlineAggregators

Offline aggregator nodes
Fail if >= 1 Aggregator offline Yes
explainRebalancePartitionsChecker

Identifies if partitions are not balanced across the cluster
Any output Yes

Configure to switch this to Warning
orphanDatabases

Identifies if any orphan databases are found. Orphan databases should be examined and dropped
Any output Yes

Configure to switch this to Failure
pendingDatabases

Identifies databases that are in a pending state. Pending databases are not available for read/write queries.
Any output Yes

Configure to switch this to Warning
unrecoverableDatabases

Identifies databases that are unrecoverable.
Any output Yes

Configure to switch to Warning
userDatabaseRedundancy

Determines if a database is redundant
Any output Yes

Configure to Switch to Warning
clusterMemoryUsage

Checks free memory against total available
Less than 15% of the memory available Less than 10% memory available Yes
userDatabaseRedundancy High availability not enabled (not configurable) Master Partition missing its replica partition Yes
secondaryDatabases

Checks for the presence of secondary replicating databases
Any output Yes

Configure to switch to Warning

System Checks Thresholds

Check + Description Warn default Fail default Configurable?
cpuIdle

Checks the percentage of CPU idle time
25.0% 5.0% Yes
diskLatencyRead

Determines the average time taken by the device to complete read requests
10 ms 25 ms Yes
diskLatencyWrite

Determines the average time taken by the device to complete write requests
10 ms 25 ms Yes
diskUsage

Checks free disk space and identifies if the disk is approaching its capacity limits
70% 80% Yes
diskInodesUsage

Checks free disk inodes
70% 85% Yes
majorPageFaults

Checks the number of major page faults generated by the system per second
10 majftl/s 20 majflt/s Yes
swapUsage

Checks the percentage of swap space used
5% 10% Yes
memoryCommitted

Determines the percentage of memory required for a given workload
70% 90% Yes