The Matrix

A complete overview about all checks grouped by bundle which are part of Check NetAppPRO.

Would you like to get a free trial license for the checks? Just drop us an email.

This table includes all checks from

  • Check NetApp-ZAPI
  • Check NetApp-REST
  • Check NetApp-SnapCenter

Base Bundle

The Base Bundle includes all the checks needed for base monitoring such as hardware, shelfs, disk status or usage of aggregates and volumes. This bundle also includes a snapshot check and one that provides an overview of active network interfaces.
Check Status
ZAPI REST Snap Center
7m cdot cdot
check_netapp7_head.pl monitors the 7m-heads hardware objects (fans, NVRAM, power-supplies and the temperature-sensors) stable - - -
check_netapp_health monitors the system- and subsystems health state. Sends an alarm if the system health status does not match a given pattern like 'ok'. [help] - - stable -
check_netapp_health.pl monitors the system health. Sends an alarm if the system health status is anything other than 'ok'. stable stable - -
check_netapp_netport checks if the network-interfaces are *enabled* and/or *up*. [help] - - stable -
check_netapp_node checks the uptime (time since last reboot) and the Ontap version. [help] - - stable -
check_netapp_shelfenv checks, the shelf-status, power-supplys, temperature, fans, voltage- and current-sensors but also the coin-batteries on the shelves. [help] - - stable -
Disk checks for failed, offline or unassigned disks on the filer. [help] stable stable - -
Head monitors the heads hardware objects (fans, NVRAM, power-supplies, health-state, temperature-sensors) [help] - stable - -
NetPort checks if the network-interfaces are enabled or not [help] - stable - -
ShelfEnvironment checks, the shelf-status, power-supplys, temperature, fans, voltage-sensor and current-sensor on the shelves. [help] stable stable - -
Snapshots checks, if the snap-reserve is still sufficient. Thresholds are set in percent; performance-data can be either in percent or absolute (Byte). Additional criteria are the age or name of the snapshot. This can be used for monitoring snapshot-backups and whether they are up to date or not. Also can be used to find snapshots related to a specific application like SNMV and check all volumes for left-over snapshots. [help] stable stable - -
Uptime checks the seconds since last reboot. [help] - stable - -
Usage checks the used space in volumes and aggregates. Thresholds can be set in GB or percent. [help] stable stable - -
check_netapp_aggregate checks the aggregate object stores to see if they are available. [help] - - stable -
check_netapp_disk checks if disks are in a specific container (e.g. aggregate or spare) [help] - - stable -
check_netapp_volume and its `aggregated` command allows to do some math on the aggregated values of all (filtered) volumes. Both the sum and the average for several volumes can be calculated. [help] - - stable -

Advanced Bundle

The Advanced Bundle includes additional status checks for clusters, aggregates, volumes, LUNs, V-Servers and SnapMirrors/SnapVaults. Furthermore, this bundle contains the necessary tools to verify the redundancy of disk paths, RAIDs and interface groups. We give you the means to react to possible storage shortages ahead of time by monitoring the overcommitments of aggregates as well as providing usage predictions using trend interpolation.
Check Status
ZAPI REST Snap Center
7m cdot cdot
AggregateState checks the aggregates-state. Alarms if they are not online (configurable). [help] stable stable - -
AutosizeMode checks the autosize-mode of autosized volumes if they are all set to given value (grow, grow_shrink, ...) [help] - stable - -
check_netapp7_cluster.pl checks the status of the high availability service (connected, taken over, takeover failed, ...). stable - - -
check_netapp7_fcpstats.pl monitors the FCP adapters for crc-errors and other values. stable - - -
check_netapp7_snapvault.pl monitors the status and lag-time of Snapvault relations onyl on 7m filers. (Cdot filers are checked with the SnapMirror checks). stable - - -
check_netapp7_vfiler.pl monitors the status of a vFiler (if the vfiler is running and if the network resources are configured) stable - - -
check_netapp_asup.pl monitors the ASUP-log and alarms if failed transmissions or collections were found. - stable - -
check_netapp_certificate checks SVM (and possibly also other) certificates for their expiration time. Will trigger an alarm if certificates expire soon (configurable thresholds). [help] - - stable -
check_netapp_ems checks the ems-log for the number of specific events per time-unit (rate). Alarm if e.g. too many autogrow-events took place within the last hour or day. [help] - - stable -
check_netapp_license.pl checks the filer for expiring (demo-)licenses. stable stable - -
check_netapp_nfs-persist.pl checks for non-persistant NFS shares. - pre-alpha - -
check_netapp_process.pl checks for runaway processes on a filer (as shown with the ps command). - stable - -
check_netapp_quotas.pl monitors quotas on a NetApp-filer (cluster mode only). - stable - -
check_netapp_scrub.pl sends an alarm if the last scrubs timestamp of an aggregate is over a certain age. stable stable - -
check_netapp_snapcenter checks the SnapCenter database for failed or missing jobs. This way alarms are sent immediately if backups do not run as expected. [help] - - - stable
check_netapp_spare counts the number of available spare-disks or -partitions and sends an alarm if this number falls below a given threshold. Considers also the type of the disk (or partition) and its location. [help] - - stable -
check_netapp_takeover.pl sends an alarm if the storage failover facility is disabled or otherwise not active. - stable - -
check_netapp_time.pl checks the filers NTP configuration (at least one ntp server must be configured) and measures the drift between the filers system-time and the monitoring server. Can alarm if that drift is getting too high. - stable - -
check_netapp_unused_lun.pl checks for luns which are online but do not have an initiator connected. - stable - -
DiskCount counts the number of disks matching defineable criteria (disk-type, container (spare, ...), storage-pool). Mostly used to monitor the number of spare-disks of a certain type. [help] - stable - -
DiskPathQuality hecks disk path qualities, reports i/o-error percentages and raises a CRITICAL error whenever an error percentage is above zero. [help] beta stable - -
DiskPaths checks if each disk has a given number and pattern of paths (A/B, B/A, ABAB, ABBA, ...). [help] beta stable - -
FCPAdapter checks the operational status of all fcp adapters. [help] - stable - -
IfGrp checks if an interface-group has enough links in up-state to still be redundant. [help] stable stable - -
Job checks for failed jobs. [help] - stable - -
LunAlignment searches for misaligned luns. Alarms if a certain number of misaligned luns is reached. [help] - stable - -
LunSize checks the unused but allocated blocks inside of a LUN. Notfifys the admin if they exceed a certain number (he may than run an unmap procedure on vmware). [help] stable stable - -
LunState checks the LUN-states. Alarms if they are offline or not mapped to an initiator. [help] stable stable - -
NetInterface checks if a network interfaces current-port is not equal to its home-port (output of the CLI command `network interface show -is-home false`). Can also check it's operational mode (up/down). [help] - stable - -
OvercommitAggr returns a list of aggregates together with their overcommitment in percent. Overcommitment is the relation between the aggregates size and the total of all its (thin provisioned) volumes sizes. [help] stable stable - -
Raidstatus alarms if one of the RAIDs is degraded. [help] stable stable - -
ReportIOPS reports how many iops are consumed by a given tenant. [help] - stable - -
ReportSpace reports how much space in bytes are consumed by a given tenant. [help] - stable - -
ServiceProcessor checks the status of the nodes service-processor and if they are correctly configured (autoupdate, IP-address). [help] - stable - -
ShelfBay checks, the shelf- and disk-port status. Can alarm BYP-status disks. [help] stable stable - -
Sis checks dedup-values (stale-fingerprint-percentage, run-time of last successfull operation). [help] - stable - -
SisStatus find volumes whose compression or deduplication is not enabled. [help] - stable - -
SnapMirrorMetrics checks and logs SnapMirrors (including type Vault): lag-time, last-transfer-duration, last-transfer-size [help] - stable - -
SnapMirrorState checks and logs for SnapMirror (including type Vault): health, mirror-state [help] - stable - -
SnapshotLessVolume searches for volumes which do not have snapshots. [help] - stable - -
StorageUtilization Storage Utilization answers the question, “Am I effectively using the storage capacity available to my applications. [help] stable stable - -
UnprotectedVolume checks for volumes not protected by SnapMirror. [help] - alpha - -
UsageTrend checks the time how long ist would last until an aggregate or volume is full, if the trend of the last 48h (configurable) would continue. Checks both bytes and inodes. [help] stable stable - -
VolumeAge searches for and flags volumes which have been created a (configurable) long time ago. An old age may be an indication for a forgotten and unused volume-clone. The logic can be also inverted to search for volumes with an exceptional short age (which have been created within the last day or so). [help] beta stable stable -
VolumeAutosize checks a volumes total-size and alerts when the volume is close to being full relative to the autosize maximum. [help] stable stable stable -
VolumeState checks the volume-states. Alarms if they are not online (configurable). [help] stable stable stable -
Vserver monitors the admin-state or the operational-status of a Vserver (running, stopped, inconsistent or defunct) [help] - stable - -
check_netapp_storageport checks for degraded or otherwise non-ok storage-ports. [help] - - stable -
check_netapp_snapshot checks the space used by volume-snapshots. [help] - - stable -

Performance Bundle

The Performance Bundle includes all the checks needed for monitoring and trend analysis of performance indicators. NetApp recommends monitoring “per-volume-latency” as a primary indicator for performance bottlenecks - the PerfVolume check makes this possible.
Check Status
ZAPI REST Snap Center
7m cdot cdot
BadlyPerformingDisks checks all disks in a NetApp system or in a specific raid-group. If a certain number of them performes badly (=has a high utilization) an alarm is send. [help] stable stable - -
BufferCache checks several metrics of the system buffer cache (=system memory) like Buffers being read, Buffers being written, Empty (unused) buffers, Buffers with modified data, Buffers associated with CP IO, ... [help] stable stable - -
FlashCache checks several metrics of the external FlashCache (PAM II) like External cache hit rate, Average latency of read I/Os, Number of wafl buffers served off the external cache, ... [help] stable stable - -
LunLatency Checks the 'latency' and 'operations per second' (ops) per LUN. Shows details for total, read, write and other. NetApp recommends monitoring latency as the primary performance indicator. [help] stable stable - -
NVRAM checks data-rates and latency of the NVRAM. [help] stable stable - -
PerfAggregate checks the 'latency', 'transfer-rate' and other performance counters per aggregate. Shows details for total, read, write and other. Also averages and totals over all aggregates of the filer can be measured and monitored, which allows the monitoring of the aggregate-latency and aggretate-transfer-rate on the filer level. [help] beta stable - -
PerfCpu checks one or all processors in a NetApp system for their utilization. [help] stable stable - -
PerfDisk checks all disks in a NetApp system for their utilization (Percentage of time there was at least one outstanding request to the disk). Optional the check can be limited to the disks of a single aggregate. [help] stable stable - -
PerfHostadapter checks and counts rates per host adapter (Fibre Channel, Serial Attached SCSI, and parallel SCSI). [help] stable stable - -
PerfIf checks and counts transfer-rates and errors per network-interface (ifnet). Especially useful for monitoring 10GbE-ports. [help] stable stable - -
PerfLif checks and counts transfer-rates and errors per network-interface (lif) for DataONTAP 8.2.x. or higher. [help] stable stable - -
PerfNic checks various performance counters of a NetApps *physical* network interface (NIC). Among them are crc/transmit-error-counters which can be used to detect errors on the physical network-layer. [help] - stable - -
PerfQtree checks some ops-counters per q-tree (nfs-ops, cifs-ops, ...). [help] alpha stable - -
PerfSys checks various performance counters of the NetApp-system (mostly operations/second and transfer-rates). Counters supported: net_data_sent, dafs_ops, total_ops, disk_data_written, net_data_recv, cifs_ops, streaming_pkts, http_ops, nfs_ops, fcp_ops, disk_data_read, iscsi_ops [help] stable stable - -
PerfSysNode checks various performance counters of the NetApp-system (mostly operations/second and transfer-rates). Counters supported: net_data_sent, dafs_ops, total_ops, disk_data_written, net_data_recv, cifs_ops, streaming_pkts, http_ops, nfs_ops, fcp_ops, disk_data_read, iscsi_ops. The check evaluates these counters per Node and works only for DataONTAP 8.3 or later. [help] beta stable - -
PerfTcpIp checks CRC errors and packets send/received for both the IP and TCP layer. [help] - stable - -
PerfVolume checks the 'latency' and 'operations per second' (ops) per volume. Shows details for total, read, write and other. NetApp recommends monitoring latency as the primary performance indicator. [help] stable stable - -
Wafl reads WAFL performance-counters like cp_count twice and calculates the rate of CPs per second. Different types of consistency-points (wafl-timer, back-to-back, ...) can be checked. The information gathered from this plugin corresponds to the CPty-column of 'sysstat -x 1'. [help] stable stable - -

MetroCluster Bundle

Checks exclusive for the Metro Cluster: configuration-status, ping-status (icmp, data), cluster-health, node-availability, rdb-health and mirror-status of the cluster-aggregates
Check Status
ZAPI REST Snap Center
7m cdot cdot
ClusterPeerHealth checks the health of cluster peer relationships by evaluating several ping- and health-status. [help] - stable - -
MetroClusterVserver sends an alarm if the configuration state of a MetroCluster vserver changes to unhealthy. [help] - stable - -
SyncMirror checks the mirror-status on Metro Cluster aggregates. [help] stable stable - -

Status Descriptions

What do the above status mean?
Status Description
- no status
alpha First versions available for testing. Both program- and documentation-errors are likely. Not recommended for production.
beta Unstable, but for production somehow useable version.
deprecated No more development.
on_road_map Planed with a specified release date.
pre-alpha Developer pre-design, mostly just documentation without code.
stable Stable, fully tested and documented version.
unsupported Check must not be offered or distributed any more.