Splunk health

StackState Self-hosted v4.5.x

This page describes StackState v4.5.x. The StackState 4.5 version range is End of Life (EOL) and no longer supported. We encourage customers still running the 4.5 version range to upgrade to a more recent release.

Go to the documentation for the latest StackState release.

Overview

The StackState Splunk health integration collects health from Splunk by executing Splunk saved searches from StackState Agent V2. In order to receive Splunk health data in StackState, configuration needs to be added to both Splunk and StackState Agent V2:

  • In Splunk - there should be at least one saved search that generates the health data you want to retrieve.

  • In StackState Agent V2 - a Splunk health check should be configured to connect to your Splunk instance and execute the relevant Splunk saved searches.

The Splunk health check on StackState Agent V2 will execute all configured Splunk saved searches periodically to retrieve a snapshot of the health at the current time.

Splunk is a StackState core integration.

Fields used

StackState Agent V2 executes the Splunk saved searches configured in the Splunk health Agent check and pushes retrieved data to StackState. The following fields from the results of a saved search are sent to StackState:

FieldTypeRequired?Description

check_state_id

string

The unique identifier for the check state.

name

string

Display name for the check state.

health

string

The health value of the check state. Can be clear, deviating or critical.

topology_element_identifier

string

The identifier of the component/relation this check state belongs to.

message

string

-

Extended message associated with the check state, supports markdown.

Example Splunk query

| loadjob savedsearch=:disks
| search OrganizationPart="*" OrgGrp="*" company="*"
| table host disk available_pct
| eval check_state_id = strcat host "_" disk
| eval name = disk
| eval health = case(available_pct == 0, "critical", true, "clear") 
| eval topology_element_identifier = host
| table check_state_id name health topology_element_identifier

Agent check

Configure the Splunk health check

To enable the Splunk health integration and begin collecting health data from your Splunk instance, the Splunk health check must be configured on StackState Agent V2. The check configuration provides all details required for the Agent to connect to your Splunk instance and execute a Splunk saved search.

Example Splunk health Agent check configuration file: splunk_health/conf.yaml.example (github.com)

To configure the Splunk health Agent check:

  1. Edit the StackState Agent V2 configuration file /etc/stackstate-agent/conf.d/splunk_health.d/conf.yaml.

  2. Under instances, add details of your Splunk instance:

    • url - The URL of your Splunk instance.

    • authentication - How the Agent should authenticate with your Splunk instance. Choose either token-based (recommended) or basic authentication. For details, see authentication configuration details.

    • ignore_saved_search_errors - Set to false to return an error if one of the configured saved searches does not exist. Default true.

    • collection_interval - The interval at which the check is scheduled to run.

  3. Under saved_searches, add details of each Splunk saved search that the check should execute to retrieve health information:

    • name - The name of the Splunk saved search to execute.

      • match - Regex used for selecting Splunk saved search queries. Default "health.*".

      • app - The Splunk app in which the saved searches are located. Default "search".

      • request_timeout_seconds - Default 10.

      • search_max_retry_count - Default 5.

      • search_seconds_between_retries - Default 1.

      • batch_size - Default 1000.

      • parameters - Used in the Splunk API request. The default parameters provided make sure the Splunk saved search query refreshes. Default force_dispatch: true and dispatch.now: true.

  4. More advanced options can be found in the example configuration (github.com).

  5. Save the configuration file.

  6. Restart StackState Agent V2 to apply the configuration changes.

  7. Incoming health data will be mapped to associated components and relations in the StackState UI as check states.

  8. To more closely inspect what the synchronization is doing, use the StackState CLI

The configured collection_interval will be used as the repeat_interval for the health synchronization. Make sure that the value set for the the collection_interval matches the time that the check will take to run.

Disable the Agent check

To disable the Splunk health Agent check:

  1. Remove or rename the Agent integration configuration file, for example:

     mv /etc/stackstate-agent/conf.d/splunk_health.d/conf.yaml /etc/stackstate-agent/conf.d/splunk_health.d/conf.yaml.bak
  2. Restart StackState Agent V2 to apply the configuration changes.

See also

Last updated