APM AI Assistant and Intelligent Troubleshooting

5 min

What is the APM AI Assistant?

The APM AI Assistant is an intelligent feature that helps you troubleshoot application performance issues by providing contextual guidance, analyzing traces, and suggesting next steps during investigations. It acts as a virtual expert that understands your APM data and guides you toward solutions.

Note

AI Assistant features may vary by Splunk Observability Cloud version and entitlement. Some capabilities described here may be in preview or require specific licensing.

Key Capabilities

1. Trace Analysis

2. Guided Troubleshooting

3. Contextual Insights

How AI Assistant Helps

Scenario 1: Investigating a Slow Trace

Traditional approach:

  1. Open trace waterfall
  2. Manually scan all spans
  3. Calculate durations
  4. Identify slowest operations
  5. Cross-reference with other traces
  6. Form hypothesis about root cause

With AI Assistant:

  1. Open trace
  2. AI highlights: “Database query in checkout-service took 2.3s (95th percentile: 45ms)”
  3. Suggests: “Check database index on orders table”
  4. Links to similar traces with same pattern
  5. Shows when pattern started

Scenario 2: Understanding Error Patterns

AI Assistant provides:

Hands-On Exercise: Using AI-Powered APM Features

Exercise

Step 1: Explore Service Insights

  1. Navigate to APMServices
  2. Select a service with performance data
  3. Look for AI-generated insights or summaries:
    • Service health score
    • Performance trends
    • Anomaly indicators
    • Top issues or bottlenecks

Step 2: Analyze a Trace with AI Assistance

  1. Go to APMTraces
  2. Filter for slow or error traces
  3. Open a trace waterfall view
  4. Look for AI-powered features:
    • Highlighted problematic spans
    • Automatic critical path identification
    • Comparison with baseline traces
    • Suggested root cause

Step 3: Leverage Automatic Root Cause Detection

  1. In the trace view, find the Root Cause or Insights panel
  2. Review AI suggestions:
    • Which span is the bottleneck?
    • What changed compared to normal behavior?
    • Which tags or attributes correlate with the issue?
  3. Follow the suggested investigation path
  4. Drill down into the identified component

Step 4: Use Trace Comparison

  1. Select a problematic trace
  2. Look for Compare or Similar Traces feature
  3. AI will show:
    • Similar normal traces (baseline)
    • What’s different in the slow trace
    • Statistical comparison
  4. Identify the anomalous component

Intelligent Trace Features

Critical Path Highlighting

AI automatically identifies the critical path through a distributed trace:

Span Anomaly Detection

AI detects unusual spans by considering:

Service Dependency Intelligence

AI understands your service architecture:

AI-Powered APM Alerts

Smart Alert Prioritization

AI helps prioritize alerts by:

Adaptive Thresholds

For APM-based detectors:

Natural Language Capabilities

Asking Questions (Where Available)

Some AI Assistant implementations allow natural language queries:

Example questions:

AI provides:

Best Practices for AI Assistant

1. Provide Rich Context

Help AI help you:

2. Trust but Verify

3. Learn from AI Patterns

4. Provide Feedback

If your AI Assistant supports feedback:

Combining AI Assistant with Other AI Features

Integrated Workflow

  1. Alert fires (AutoDetect ML detector)
  2. Tag Spotlight narrows down the problem
  3. APM AI Assistant analyzes affected traces
  4. Related Content surfaces relevant dashboards
  5. Log Observer AI shows correlated log patterns
  6. Resolution with full context

Example Investigation Flow

text
Alert: "Latency increased on payment-service"
Tag Spotlight: "Region: us-west-1 (87% contribution)"
APM AI: "Database span duration increased 450%"
Trace Analysis: "Connection pool exhausted"
Log Observer AI: Pattern "Connection pool timeout" increased
Related Content: Dashboard "Database Connection Health"
Root Cause: Recent traffic spike exceeded DB connection limits

Limitations and Considerations

Tip

The APM AI Assistant is most effective when your applications are well-instrumented with comprehensive tags and attributes. The richer your trace data, the better the AI insights.

Next Steps

Let’s wrap up the workshop with a summary and additional resources.