AWS Health Check Dashboards¶
The Health Check Dashboards in Splunk Add-on for AWS lets monitor deployment performance while making it easier for users troubleshoot and mitigate issues faster. It provides the following insights from your AWS add-on configuration and deployment.
Dashboard | Panels | Description |
---|---|---|
Health Overview (Provides information for all the errors and warnings generated from the inputs configured in AWS add-on) | Error count by categories | Shows the error count by categories, for example, configuration error, network error. The error count panels include drilldowns that redirect to the Error Details dashboard, which provides information on potential causes and solutions for the errors. Click the error count to identify and mitigate issues faster. |
Warning count | Shows the warnins count by categories. The error count panels include drilldowns that redirect to the Warning Details dashboard, which provides information on possible reasons and resolutions for the warnings. Click the warning count to identify and mitigate issues faster. | |
Error count timechart | These timecharts display the count of errors over time based on hosts, input types, input names, and error categories. | |
Resource Utilization (Provides information regarding the resource utilization by different types of inputs configured in the AWS add-on) | CPU and Memory utilization | Displays the CPU and memory utilization over time for single instance and multi instance inputs configured in the AWS add-on (single instance inputs are the inputs where Splunk spawns a single process for all inputs, whereas multi instance inputs are the inputs where Splunk spawns individual process for each input). You can use it to identify over-utilization of resources affecting your Splunk platform environment. |
Inputs count (single instance and multi instance) | Displays the number of inputs (enabled/disabled) configured in the AWS add-on. Number of inputs help to identify resource utilization, and can be scaled up or down, based on the requirements. | |
KV Store calls count | Clicking the KV store calls count panel for a specific time range redirects you to the KV Store Utilization dashboard for that period, where you can analyze the load by comparing collections used by the AWS add-on to those used by other apps and add-ons. Select a collection name under the AWS add-on to display a timechart showing the average time taken by KV store calls for that collection. | |
S3 Inputs Health Details (Focuses on the Generic S3, Incremental S3, and SQS-based S3 input types) | Time lapse (delay) and throughput | Displays the delay (time taken) in fetching the data and throughput (size of data) over time. Useful to identify network latency or delay related issues. |
Error Message Details | Displays the error details encountered while input execution along with possible reasons and resolutions. |
In the Splunk Web UI, open the Splunk Add-on for AWS, and click on the Health Check tab. Select the dashboard from the dropdown which you want to monitor.