Predictive Analytics & Forecasting

Predictive Analytics

Predictive Analytics is the Avantra Enterprise Edition approach to “spikey”, “bursty”, or “flapping” monitoring situations for checks based on time series data. It combines Machine Learning algorithms to predict future trends with classic threshold based monitoring.

This function is only available with the Avantra Enterprise Edition. For more information about editions please see the Avantra Editions Matrix.

There are situations where thresholds are too static, or become too strict. The intention of thresholds is to define a long term separator between good and not so good. Usually a situation only becomes critical if a threshold is exceeded over a longer period of time. Whenever resource usage is spikey, or there are usage bursts, thresholds can be exceeded for a short period of time, although this does not constitute a critical situation.

From a technical perspective it tricky is to decide: is a certain situation only a usage spike, or is it the beginning of a longer term problem?

Predictive Analytics tries to remediate this dilemma. The Avantra agent applies ML based algorithms to understand usage patterns and to predict - at every given point in time - how the usage will evolve in the near future. These predictions are used to determine a trend whenever the agent evaluates a check. And this trend is taken into account, in addition to the defined thresholds.

If a threshold is exceeded and the predictive analytics engine projects the situation clears again within a short period of time, the check status will not be changed. Only if the situation is projected to get worse the check status is changed immediately.

Staring with Avantra 21.11, the Predictive Analytics Engine is enabled in the CPULOAD and the HDB_CPULoad checks. It will successively be rolled out to other checks based on time series data.

Forecasting

Avantra includes a feature called forecasting which is implemented for many Avantra Checks. In fact, it is implemented wherever it is possible. A check requires a resource to be monitored, i.e. something where usage in percent of the maximum available resource space can be calculated.

An example of such a resource-related check is the FILESYSTEMS check. A file system has a size and an amount of space that is used. Therefore, the usage in percent is calculated. Now if the used space of a file system grows fast, but is still below the usage thresholds, the used space may be filled very quickly so it may be hard to react in time before the usage threshold related check will give you a Warning status.

Forecasting takes the job to inform you of heavy resource consumption before the resource is full. It will use the previous usage values to interpolate into the future, how long it will take – assuming the usage will continue to grow uniformly – until the resource space available is exhausted. Similar to the two Warning and Critical usage thresholds, each check has two thresholds for forecasting: a Warning (…ExWarn) threshold with a default value of 12 hours, and a Critical (…ExCrit) threshold with a default of one hour. To interpolate into the future, forecasting will always take the usage values of half of the exhaustion threshold past from now, so per default, it will look 30 minutes back for the Critical threshold.

An issue with forecasting is to prevent false alerts, so a growing resource needs to be growing for a certain time, otherwise, a false alert will be reported too often. Additionally, forecasting will be inactive if the usage of a resource is below 50 %.

Forecasting is also available with Performance data for those resources where it is useful. Whenever you select the time range of a Performance chart to extend into the future, the data is extrapolated accordingly. It is also available in the Service Level Reports unless opted-out during generation.

Predictive Resource Planning

For the following resources, there is an option to predict their usage over time:

Servers

Used Disk Space, Total Disk Size, Used File System Space

SAP Systems

Used Database Space, Total Database Size

Stand-alone Databases

Used Database Space, Total Database Size

Perform the following steps:

  1. Select Reporting  Predictive Resource Planning from the top-level menu.

  2. Choose one of the items listed above from the Resource drop-down list.

  3. By default, data is analyzed three months in the past and extrapolated three months into the future. You can change these settings in Historical Data and Predict Usage for.

  4. By default, data is displayed for All Systems providing the resource selected in Resource. But you can restrict this to Systems with Monitoring on or to an Ad-Hoc or Predefined Selectors. Choose the appropriate value from the drop-down list.

    The System Type returned by Selectors eventually applied has to correspond to the type of the Resource. Otherwise, the list will be empty.

The tab displays the following data:

  • For every Resource, there is the System Status, the System Name, the System Role, and the Customer displayed.

  • The column Used Space contains the current value of the resource consumption.

  • The column Predicted Space contains the extrapolated value of the resource at the date chosen in Predict Usage for.

  • The column Pred. Growth displays the difference between Predicted Space and Used Space in absolute figures.

  • The column Pred. Growth % displayed the difference between Predicted Space and Used Space in percent of the Used Space.

  • The Analysis column may contain additional data, like when a resource will be exhausted.

If you click on the icon (available in the Predicted Space and the Analysis columns), a chart opens that displays a graphical representation of the analyzed and predicted data.