Background

Separation between ingestion and indexing services within Splunk Operator for Kubernetes enables the operator to independently manage the ingestion service while maintaining seamless integration with the indexing service.

This separation enables:

Independent scaling: Match resource allocation to ingestion or indexing workload.
Data durability: Off‑load buffer management and retry logic to a durable message queue.
Operational clarity: Separate monitoring dashboards for ingestion throughput vs indexing latency.

Splunk Support

These features are supported for Splunk 10.2 and above versions.

Important Note

[!WARNING] For customers deploying SmartBus on CMP, the Splunk Operator for Kubernetes (SOK) manages the configuration and lifecycle of the ingestor tier. The following SOK guide provides implementation details for setting up ingestion separation and integrating with existing indexers. This reference is primarily intended for CMP users leveraging SOK-managed ingestors.

Document Variables

SPLUNK_IMAGE_VERSION: Splunk Enterprise Docker Image version

Queue

Queue is introduced to store message queue information to be shared among IngestorCluster and IndexerCluster.

Spec

Queue inputs can be found in the table below. As of now, only SQS provider of message queue is supported.

Key	Type	Description
provider	string	[Required] Provider of message queue (Allowed values: sqs, sqs_cp)
sqs	SQS	[Required if provider=sqs or provider=sqs_cp] SQS message queue inputs

SQS message queue inputs can be found in the table below.

Key	Type	Description
name	string	[Required] Name of the queue
authRegion	string	[Required] Region where the queue is located
endpoint	string	[Optional, if not provided formed based on authRegion] AWS SQS Service endpoint
dlq	string	[Required] Name of the dead letter queue
volumes	[]VolumeSpec	[Optional] List of remote storage volumes used to mount the credentials for queue and bucket access (must contain s3_access_key and s3_secret_key)

SOK doesn’t support update of any of the Queue inputs except from the volumes which allow the change of secrets.

Example

apiVersion: enterprise.splunk.com/v4
kind: Queue
metadata:
  name: queue
spec:
  provider: sqs
  sqs:
    name: sqs-test
    authRegion: us-west-2
    endpoint: https://sqs.us-west-2.amazonaws.com
    dlq: sqs-dlq-test
    volumes:
      - name: s3-sqs-volume
        secretRef: s3-secret

ObjectStorage

ObjectStorage is introduced to store large messages (messages that exceed the size of messages that can be stored in SQS) to be shared among IngestorCluster and IndexerCluster.

Spec

ObjectStorage inputs can be found in the table below. As of now, only S3 provider of object storage is supported.

Key	Type	Description
provider	string	[Required] Provider of object storage (Allowed values: s3)
s3	S3	[Required if provider=s3] S3 object storage inputs

S3 object storage inputs can be found in the table below.

Key	Type	Description
path	string	[Required] Remote storage location for messages that are larger than the underlying maximum message size
endpoint	string	[Optional, if not provided formed based on authRegion] S3-compatible service endpoint

SOK doesn’t support update of any of the ObjectStorage inputs.

Example

apiVersion: enterprise.splunk.com/v4
kind: ObjectStorage
metadata:
  name: os
spec:
  provider: s3
  s3:
    path: ingestion/smartbus-test
    endpoint: https://s3.us-west-2.amazonaws.com

IngestorCluster

IngestorCluster is introduced for high‑throughput data ingestion into a durable message queue. Its Splunk pods are configured to receive events (outputs.conf) and publish them to a message queue.

Spec

In addition to common spec inputs, the IngestorCluster resource provides the following Spec configuration parameters.

Key	Type	Description
replicas	integer	The number of replicas (defaults to 3)
queueRef	corev1.ObjectReference	Message queue reference
objectStorageRef	corev1.ObjectReference	Object storage reference

SOK doesn’t support update of queueRef and objectStorageRef.

First provisioning or scaling up the number of replicas requires Ingestor Cluster Splunkd restart, but this restart is implemented automatically and done by SOK.

Example

The example presented below configures IngestorCluster named ingestor with Splunk ${SPLUNK_IMAGE_VERSION} image that resides in a default namespace and is scaled to 3 replicas that serve the ingestion traffic. This IngestorCluster custom resource is set up with the s3-secret credentials allowing it to perform SQS and S3 operations. Queue and ObjectStorage references allow the user to specify queue and bucket settings for the ingestion process.

In this case, the setup uses the SQS and S3 based configuration where the messages are stored in sqs-test queue in us-west-2 region with dead letter queue set to sqs-dlq-test queue. The object storage is set to ingestion bucket in smartbus-test directory. Based on these inputs, default-mode.conf and outputs.conf files are configured accordingly.

apiVersion: enterprise.splunk.com/v4
kind: IngestorCluster
metadata:
  name: ingestor
  finalizers:
    - enterprise.splunk.com/delete-pvc
spec:
  serviceAccount: ingestor-sa 
  replicas: 3
  image: splunk/splunk:${SPLUNK_IMAGE_VERSION}
  queueRef:
    name: queue
  objectStorageRef:
    name: os

IndexerCluster

IndexerCluster is enhanced to support index‑only mode enabling independent scaling, loss‑safe buffering, and simplified day‑0/day‑n management via Kubernetes CRDs. Its Splunk pods are configured to pull events from the queue (inputs.conf) and index them.

Spec

In addition to common spec inputs, the IndexerCluster resource provides the following Spec configuration parameters.

Key	Type	Description
replicas	integer	The number of replicas (defaults to 3)
queueRef	corev1.ObjectReference	Message queue reference
objectStorageRef	corev1.ObjectReference	Object storage reference

SOK doesn’t support update of queueRef and objectStorageRef.

First provisioning or scaling up the number of replicas requires Indexer Cluster Splunkd restart, but this restart is implemented automatically and done by SOK.

Example

The example presented below configures IndexerCluster named indexer with Splunk ${SPLUNK_IMAGE_VERSION} image that resides in a default namespace and is scaled to 3 replicas that serve the indexing traffic. This IndexerCluster custom resource is set up with the s3-secret credentials allowing it to perform SQS and S3 operations. Queue and ObjectStorage references allow the user to specify queue and bucket settings for the indexing process.

In this case, the setup uses the SQS and S3 based configuration where the messages are stored in and retrieved from sqs-test queue in us-west-2 region with dead letter queue set to sqs-dlq-test queue. The object storage is set to ingestion bucket in smartbus-test directory. Based on these inputs, default-mode.conf, inputs.conf and outputs.conf files are configured accordingly.

apiVersion: enterprise.splunk.com/v4
kind: ClusterManager
metadata:
  name: cm
  finalizers:
    - enterprise.splunk.com/delete-pvc
spec:
  serviceAccount: ingestor-sa 
  image: splunk/splunk:${SPLUNK_IMAGE_VERSION}
---
apiVersion: enterprise.splunk.com/v4
kind: IndexerCluster
metadata:
  name: indexer
  finalizers:
    - enterprise.splunk.com/delete-pvc
spec:
  clusterManagerRef:
    name: cm
  serviceAccount: ingestor-sa
  replicas: 3 
  image: splunk/splunk:${SPLUNK_IMAGE_VERSION}
  queueRef:
    name: queue
  objectStorageRef:
    name: os

Common Spec

Common spec values for all SOK Custom Resources can be found in CustomResources doc.

Helm Charts

Queue, ObjectStorage and IngestorCluster have been added to the splunk/splunk-enterprise Helm chart. IndexerCluster has also been enhanced to support new inputs.

Example

Below examples describe how to define values for Queue, ObjectStorage, IngestorCluster and IndexerCluster similarly to the above yaml files specifications.

queue:
  enabled: true
  name: queue
  provider: sqs
  sqs:
    name: sqs-test
    authRegion: us-west-2
    endpoint: https://sqs.us-west-2.amazonaws.com
    dlq: sqs-dlq-test
    volumes:
        - name: s3-sqs-volume
          secretRef: s3-secret

objectStorage:
  enabled: true
  name: os
  provider: s3
  s3:
    endpoint: https://s3.us-west-2.amazonaws.com
    path: ingestion/smartbus-test

ingestorCluster:
  enabled: true
  name: ingestor
  replicaCount: 3
  serviceAccount: ingestor-sa 
  queueRef:
    name: queue
  objectStorageRef:
    name: os

clusterManager:
  enabled: true
  name: cm
  replicaCount: 1
  serviceAccount: ingestor-sa 

indexerCluster:
  enabled: true
  name: indexer
  replicaCount: 3
  serviceAccount: ingestor-sa 
  clusterManagerRef:
    name: cm
  queueRef:
    name: queue
  objectStorageRef:
    name: os

Service Account

To be able to configure ingestion and indexing resources correctly in a secure manner, it is required to provide these resources with the service account that is configured with a minimum set of permissions to complete required operations. With this provided, the right credentials are used by Splunk to peform its tasks.

Example

The example presented below configures the ingestor-sa service account by using eksctl utility. It sets up the service account for cluster-name cluster in region us-west-2 with AmazonS3FullAccess and AmazonSQSFullAccess access policies.

eksctl create iamserviceaccount \                                                                                                                                          
  --name ingestor-sa \
  --cluster ind-ing-sep-demo \
  --region us-west-2 \
  --attach-policy-arn arn:aws:iam::aws:policy/AmazonS3FullAccess \
  --attach-policy-arn arn:aws:iam::aws:policy/AmazonSQSFullAccess \
  --approve \
  --override-existing-serviceaccounts

$ kubectl describe sa ingestor-sa                                                                                                                      
Name:                ingestor-sa
Namespace:           default
Labels:              app.kubernetes.io/managed-by=eksctl
Annotations:         eks.amazonaws.com/role-arn: arn:aws:iam::111111111111:role/eksctl-ind-ing-sep-demo-addon-iamserviceac-Role1-123456789123
Image pull secrets:  <none>
Mountable secrets:   <none>
Tokens:              <none>
Events:              <none>

$ aws iam get-role --role-name eksctl-ind-ing-sep-demo-addon-iamserviceac-Role1-123456789123
{
    "Role": {
        "Path": "/",
        "RoleName": "eksctl-ind-ing-sep-demo-addon-iamserviceac-Role1-123456789123",
        "RoleId": "123456789012345678901",
        "Arn": "arn:aws:iam::111111111111:role/eksctl-ind-ing-sep-demo-addon-iamserviceac-Role1-123456789123",
        "CreateDate": "2025-08-07T12:03:31+00:00",
        "AssumeRolePolicyDocument": {
            "Version": "2012-10-17",
            "Statement": [
                {
                    "Effect": "Allow",
                    "Principal": {
                        "Federated": "arn:aws:iam::111111111111:oidc-provider/oidc.eks.us-west-2.amazonaws.com/id/1234567890123456789012345678901"
                    },
                    "Action": "sts:AssumeRoleWithWebIdentity",
                    "Condition": {
                        "StringEquals": {
                            "oidc.eks.us-west-2.amazonaws.com/id/1234567890123456789012345678901:aud": "sts.amazonaws.com",
                            "oidc.eks.us-west-2.amazonaws.com/id/1234567890123456789012345678901:sub": "system:serviceaccount:default:ingestor-sa"
                        }
                    }
                }
            ]
        },
        "Description": "",
        "MaxSessionDuration": 3600,
        "Tags": [
            {
                "Key": "alpha.eksctl.io/cluster-name",
                "Value": "ind-ing-sep-demo"
            },
            {
                "Key": "alpha.eksctl.io/iamserviceaccount-name",
                "Value": "default/ingestor-sa"
            },
            {
                "Key": "alpha.eksctl.io/eksctl-version",
                "Value": "0.211.0"
            },
            {
                "Key": "eksctl.cluster.k8s.io/v1alpha1/cluster-name",
                "Value": "ind-ing-sep-demo"
            }
        ],
        "RoleLastUsed": {
            "LastUsedDate": "2025-08-18T08:47:27+00:00",
            "Region": "us-west-2"
        }
    }
}

$ aws iam list-attached-role-policies --role-name eksctl-cluster-name-addon-iamserviceac-Role1-123456789123
{
    "AttachedPolicies": [
        {
            "PolicyName": "AmazonSQSFullAccess",
            "PolicyArn": "arn:aws:iam::aws:policy/AmazonSQSFullAccess"
        },
        {
            "PolicyName": "AmazonS3FullAccess",
            "PolicyArn": "arn:aws:iam::aws:policy/AmazonS3FullAccess"
        }
    ]
}

Documentation References

IAM Roles for Service Accounts on eksctl Docs

Horizontal Pod Autoscaler

To automatically adjust the number of replicas to serve the ingestion traffic effectively, it is recommended to use Horizontal Pod Autoscaler which scales the workload based on the actual demand. It enables the user to provide the metrics which are used to make decisions on removing unwanted replicas if there is not too much traffic or setting up the new ones if the traffic is too big to be handled by currently running resources.

Example

The exmaple presented below configures HorizontalPodAutoscaler named ingestor-hpa that resides in a default namespace (same namespace as resources it is managing) to scale IngestorCluster custom resource named ingestor. With average utilization set to 50, the HorizontalPodAutoscaler resource will try to keep the average utilization of the pods in the scaling target at 50%. It will be able to scale the replicas starting from the minimum number of 3 with the maximum number of 10 replicas.

apiVersion: autoscaling/v2
kind: HorizontalPodAutoscaler
metadata:
  name: ingestor-hpa
spec:
  scaleTargetRef:
    apiVersion: enterprise.splunk.com/v4
    kind: IngestorCluster
    name: ingestor
  minReplicas: 3
  maxReplicas: 10
  metrics:
  - type: Resource
    resource:
      name: cpu
      target:
        type: Utilization
        averageUtilization: 50

Documentation References

Horizontal Pod Autoscaling on Kubernetes Docs

Grafana

In order to monitor the resources, Grafana could be installed and configured on the cluster to present the setup on a dashabord in a series of useful diagrams and metrics.

Example

In the following example, the dashboard presents ingestion and indexing data in the form of useful diagrams and metrics such as number of replicas or resource consumption.

{
  "id": null,
  "uid": "splunk-autoscale",
  "title": "Splunk Ingestion & Indexer Autoscaling with I/O & PV",
  "schemaVersion": 27,
  "version": 12,
  "refresh": "5s",
  "time": { "from": "now-30m", "to": "now" },
  "timezone": "browser",
  "style": "dark",
  "tags": ["splunk","autoscale","ingestion","indexer","io","pv"],
  "graphTooltip": 1,
  "panels": [
    { "id": 1,  "type": "stat",       "title": "Ingestion Replicas",       "gridPos": {"x":0,"y":0,"w":4,"h":4}, "targets":[{"expr":"kube_statefulset_replicas{namespace=\"default\",statefulset=\"splunk-ingestor-ingestor\"}"}], "options": {"reduceOptions":{"calcs":["last"]},"orientation":"horizontal","colorMode":"value","graphMode":"none","textMode":"value","thresholds":{"mode":"absolute","steps":[{"value":null,"color":"#73BF69"},{"value":5,"color":"#EAB839"},{"value":8,"color":"#BF1B00"}]}}},
    { "id": 2,  "type": "stat",       "title": "Indexer Replicas",       "gridPos": {"x":4,"y":0,"w":4,"h":4}, "targets":[{"expr":"kube_statefulset_replicas{namespace=\"default\",statefulset=\"splunk-indexer-indexer\"}"}], "options": {"reduceOptions":{"calcs":["last"]},"orientation":"horizontal","colorMode":"value","graphMode":"none","textMode":"value","thresholds":{"mode":"absolute","steps":[{"value":null,"color":"#73BF69"},{"value":5,"color":"#EAB839"},{"value":8,"color":"#BF1B00"}]}}},
    { "id": 3,  "type": "timeseries","title": "Ingestion CPU (cores)","gridPos": {"x":8,"y":0,"w":8,"h":4},"targets":[{"expr":"sum(rate(container_cpu_usage_seconds_total{namespace=\"default\",pod=~\"splunk-ingestor-ingestor-.*\"}[1m]))","legendFormat":"CPU (cores)"}],"options":{"legend":{"displayMode":"list","placement":"bottom"},"yAxis":{"mode":"auto"},"color":{"mode":"fixed","fixedColor":"#FFA600"}}},
    { "id": 4,  "type": "timeseries","title": "Ingestion Memory (MiB)","gridPos": {"x":16,"y":0,"w":8,"h":4},"targets":[{"expr":"sum(container_memory_usage_bytes{namespace=\"default\",pod=~\"splunk-ingestor-ingestor-.*\"}) / 1024 / 1024","legendFormat":"Memory (MiB)"}],"options":{"legend":{"displayMode":"list","placement":"bottom"},"yAxis":{"mode":"auto"},"color":{"mode":"fixed","fixedColor":"#00AF91"}}},
    { "id": 5,  "type": "timeseries","title": "Ingestion Network In (KB/s)","gridPos": {"x":0,"y":8,"w":8,"h":4},"targets":[{"expr":"sum(rate(container_network_receive_bytes_total{namespace=\"default\",pod=~\"splunk-ingestor-ingestor-.*\"}[1m])) / 1024","legendFormat":"Net In (KB/s)"}],"options":{"legend":{"displayMode":"list","placement":"bottom"},"yAxis":{"mode":"auto"},"color":{"mode":"fixed","fixedColor":"#59A14F"}}},
    { "id": 6,  "type": "timeseries","title": "Ingestion Network Out (KB/s)","gridPos": {"x":8,"y":8,"w":8,"h":4},"targets":[{"expr":"sum(rate(container_network_transmit_bytes_total{namespace=\"default\",pod=~\"splunk-ingestor-ingestor-.*\"}[1m])) / 1024","legendFormat":"Net Out (KB/s)"}],"options":{"legend":{"displayMode":"list","placement":"bottom"},"yAxis":{"mode":"auto"},"color":{"mode":"fixed","fixedColor":"#E15759"}}},
    { "id": 7,  "type": "timeseries","title": "Indexer CPU (cores)","gridPos": {"x":16,"y":4,"w":8,"h":4},"targets":[{"expr":"sum(rate(container_cpu_usage_seconds_total{namespace=\"default\",pod=~\"splunk-indexer-indexer-.*\"}[1m]))","legendFormat":"CPU (cores)"}],"options":{"legend":{"displayMode":"list","placement":"bottom"},"yAxis":{"mode":"auto"},"color":{"mode":"fixed","fixedColor":"#7D4E57"}}},
    { "id":8,  "type": "timeseries","title": "Indexer Memory (MiB)","gridPos": {"x":0,"y":12,"w":8,"h":4},"targets":[{"expr":"sum(container_memory_usage_bytes{namespace=\"default\",pod=~\"splunk-indexer-indexer-.*\"}) / 1024 / 1024","legendFormat":"Memory (MiB)"}],"options":{"legend":{"displayMode":"list","placement":"bottom"},"yAxis":{"mode":"auto"},"color":{"mode":"fixed","fixedColor":"#4E79A7"}}},
    { "id":9,  "type": "timeseries","title": "Indexer Network In (KB/s)","gridPos": {"x":8,"y":12,"w":8,"h":4},"targets":[{"expr":"sum(rate(container_network_receive_bytes_total{namespace=\"default\",pod=~\"splunk-indexer-indexer-.*\"}[1m])) / 1024","legendFormat":"Net In (KB/s)"}],"options":{"legend":{"displayMode":"list","placement":"bottom"},"yAxis":{"mode":"auto"},"color":{"mode":"fixed","fixedColor":"#9467BD"}}},
    { "id":10,  "type": "timeseries","title": "Indexer Network Out (KB/s)","gridPos": {"x":16,"y":12,"w":8,"h":4},"targets":[{"expr":"sum(rate(container_network_transmit_bytes_total{namespace=\"default\",pod=~\"splunk-indexer-indexer-.*\"}[1m])) / 1024","legendFormat":"Net Out (KB/s)"}],"options":{"legend":{"displayMode":"list","placement":"bottom"},"yAxis":{"mode":"auto"},"color":{"mode":"fixed","fixedColor":"#8C564B"}}},
    { "id":11,  "type": "timeseries","title": "Ingestion Disk Read (KB/s)","gridPos": {"x":0,"y":16,"w":8,"h":4},"targets":[{"expr":"sum(rate(container_fs_reads_bytes_total{namespace=\"default\",pod=~\"splunk-ingestor-ingestor-.*\"}[1m])) / 1024","legendFormat":"Disk Read (KB/s)"}],"options":{"legend":{"displayMode":"list","placement":"bottom"},"yAxis":{"mode":"auto"},"color":{"mode":"fixed","fixedColor":"#1F77B4"}}},
    { "id":12,  "type": "timeseries","title": "Ingestion Disk Write (KB/s)","gridPos": {"x":8,"y":16,"w":8,"h":4},"targets":[{"expr":"sum(rate(container_fs_writes_bytes_total{namespace=\"default\",pod=~\"splunk-ingestor-ingestor-.*\"}[1m])) / 1024","legendFormat":"Disk Write (KB/s)"}],"options":{"legend":{"displayMode":"list","placement":"bottom"},"yAxis":{"mode":"auto"},"color":{"mode":"fixed","fixedColor":"#FF7F0E"}}},
    { "id":13,  "type": "timeseries","title": "Indexer PV Usage (GiB)","gridPos": {"x":0,"y":20,"w":8,"h":4},"targets":[{"expr":"kubelet_volume_stats_used_bytes{namespace=\"default\",persistentvolumeclaim=~\".*-indexer-.*\"} / 1024 / 1024 / 1024","legendFormat":"Used GiB"},{"expr":"kubelet_volume_stats_capacity_bytes{namespace=\"default\",persistentvolumeclaim=~\".*-indexer-.*\"} / 1024 / 1024 / 1024","legendFormat":"Capacity GiB"}],"options":{"legend":{"displayMode":"list","placement":"bottom"},"yAxis":{"mode":"auto"}}},
    { "id":14,  "type": "timeseries","title": "Ingestion PV Usage (GiB)","gridPos": {"x":8,"y":20,"w":8,"h":4},"targets":[{"expr":"kubelet_volume_stats_used_bytes{namespace=\"default\",persistentvolumeclaim=~\".*-ingestor-.*\"} / 1024 / 1024 / 1024","legendFormat":"Used GiB"},{"expr":"kubelet_volume_stats_capacity_bytes{namespace=\"default\",persistentvolumeclaim=~\".*-ingestor-.*\"} / 1024 / 1024 / 1024","legendFormat":"Capacity GiB"}],"options":{"legend":{"displayMode":"list","placement":"bottom"},"yAxis":{"mode":"auto"}}}
  ]
}

Documentation References

kube-prometheus-stack

App Installation for Ingestor Cluster Instances

Application installation is supported for Ingestor Cluster instances. However, as of now, applications are installed using local scope and if any application requires Splunk restart, there is no automated way to detect it and trigger automatically via Splunk Operator.

Therefore, to be able to enforce Splunk restart for each of the Ingestor Cluster pods, it is recommended to add/update IngestorCluster CR annotations/labels and apply the new configuration which will trigger the rolling restart of Splunk pods for Ingestor Cluster.

Ideally, update of annotations and labels should not trigger pod restart at all and it is under the investigation on how to stop this from happening and handle restart automatically.

Example

Install CRDs and Splunk Operator for Kubernetes.

SOK_IMAGE_VERSION: version of the image for Splunk Operator for Kubernetes

$ make install

$ kubectl apply -f ${SOK_IMAGE_VERSION}/splunk-operator-cluster.yaml --server-side

$ kubectl get po -n splunk-operator                          
NAME                                                  READY   STATUS    RESTARTS   AGE
splunk-operator-controller-manager-785b89d45c-dwfkd   2/2     Running   0          4d3h

Create a service account.

$ eksctl create iamserviceaccount \                                                                                                                                          
  --name ingestor-sa \
  --cluster ind-ing-sep-demo \
  --region us-west-2 \
  --attach-policy-arn arn:aws:iam::aws:policy/AmazonS3FullAccess \
  --attach-policy-arn arn:aws:iam::aws:policy/AmazonSQSFullAccess \
  --approve \
  --override-existing-serviceaccounts

$ kubectl describe sa ingestor-sa                                                                                                                      
Name:                ingestor-sa
Namespace:           default
Labels:              app.kubernetes.io/managed-by=eksctl
Annotations:         eks.amazonaws.com/role-arn: arn:aws:iam::111111111111:role/eksctl-ind-ing-sep-demo-addon-iamserviceac-Role1-123456789123
Image pull secrets:  <none>
Mountable secrets:   <none>
Tokens:              <none>
Events:              <none>

$ aws iam get-role --role-name eksctl-ind-ing-sep-demo-addon-iamserviceac-Role1-123456789123
{
    "Role": {
        "Path": "/",
        "RoleName": "eksctl-ind-ing-sep-demo-addon-iamserviceac-Role1-123456789123",
        "RoleId": "123456789012345678901",
        "Arn": "arn:aws:iam::111111111111:role/eksctl-ind-ing-sep-demo-addon-iamserviceac-Role1-123456789123",
        "CreateDate": "2025-08-07T12:03:31+00:00",
        "AssumeRolePolicyDocument": {
            "Version": "2012-10-17",
            "Statement": [
                {
                    "Effect": "Allow",
                    "Principal": {
                        "Federated": "arn:aws:iam::111111111111:oidc-provider/oidc.eks.us-west-2.amazonaws.com/id/1234567890123456789012345678901"
                    },
                    "Action": "sts:AssumeRoleWithWebIdentity",
                    "Condition": {
                        "StringEquals": {
                            "oidc.eks.us-west-2.amazonaws.com/id/1234567890123456789012345678901:aud": "sts.amazonaws.com",
                            "oidc.eks.us-west-2.amazonaws.com/id/1234567890123456789012345678901:sub": "system:serviceaccount:default:ingestor-sa"
                        }
                    }
                }
            ]
        },
        "Description": "",
        "MaxSessionDuration": 3600,
        "Tags": [
            {
                "Key": "alpha.eksctl.io/cluster-name",
                "Value": "ind-ing-sep-demo"
            },
            {
                "Key": "alpha.eksctl.io/iamserviceaccount-name",
                "Value": "default/ingestor-sa"
            },
            {
                "Key": "alpha.eksctl.io/eksctl-version",
                "Value": "0.211.0"
            },
            {
                "Key": "eksctl.cluster.k8s.io/v1alpha1/cluster-name",
                "Value": "ind-ing-sep-demo"
            }
        ],
        "RoleLastUsed": {
            "LastUsedDate": "2025-08-18T08:47:27+00:00",
            "Region": "us-west-2"
        }
    }
}

$ aws iam list-attached-role-policies --role-name eksctl-ind-ing-sep-demo-addon-iamserviceac-Role1-123456789123
{
    "AttachedPolicies": [
        {
            "PolicyName": "AmazonSQSFullAccess",
            "PolicyArn": "arn:aws:iam::aws:policy/AmazonSQSFullAccess"
        },
        {
            "PolicyName": "AmazonS3FullAccess",
            "PolicyArn": "arn:aws:iam::aws:policy/AmazonS3FullAccess"
        }
    ]
}

Install Queue resource.

$ cat queue.yaml          
apiVersion: enterprise.splunk.com/v4
kind: Queue
metadata:
  name: queue
  finalizers:
    - enterprise.splunk.com/delete-pvc
spec:
  provider: sqs
  sqs:
    name: sqs-test
    authRegion: us-west-2
    endpoint: https://sqs.us-west-2.amazonaws.com
    dlq: sqs-dlq-test

$ kubectl apply -f queue.yaml

$ kubectl get queue                        
NAME   PHASE   AGE   MESSAGE
queue  Ready   20s  

kubectl describe queue                               
Name:         queue
Namespace:    default
Labels:       <none>
Annotations:  <none>
API Version:  enterprise.splunk.com/v4
Kind:         Queue
Metadata:
  Creation Timestamp:  2025-10-27T10:25:53Z
  Finalizers:
    enterprise.splunk.com/delete-pvc
  Generation:        1
  Resource Version:  12345678
  UID:               12345678-1234-5678-1234-012345678911
Spec:
  Sqs:
    Auth Region:                        us-west-2
    DLQ:                           sqs-dlq-test
    Endpoint:                      https://sqs.us-west-2.amazonaws.com
    Name:                          sqs-test
  Provider:                        sqs
Status:
  Message:  
  Phase:    Ready
  Resource Rev Map:
Events:  <none>

Install ObjectStorage resource.

$ cat os.yaml          
apiVersion: enterprise.splunk.com/v4
kind: ObjectStorage
metadata:
  name: os
  finalizers:
    - enterprise.splunk.com/delete-pvc
spec:
  provider: s3
  s3:
    endpoint: https://s3.us-west-2.amazonaws.com
    path: ingestion/smartbus-test

$ kubectl apply -f os.yaml

$ kubectl get os                        
NAME   PHASE   AGE   MESSAGE
os    Ready   20s  

kubectl describe os                               
Name:         os
Namespace:    default
Labels:       <none>
Annotations:  <none>
API Version:  enterprise.splunk.com/v4
Kind:         ObjectStorage
Metadata:
  Creation Timestamp:  2025-10-27T10:25:53Z
  Finalizers:
    enterprise.splunk.com/delete-pvc
  Generation:        1
  Resource Version:  12345678
  UID:               12345678-1234-5678-1234-012345678911
Spec:
  S3:
    Endpoint:  https://s3.us-west-2.amazonaws.com
    Path:      ingestion/smartbus-test
  Provider:    s3
Status:
  Message:  
  Phase:    Ready
  Resource Rev Map:
Events:  <none>

Install IngestorCluster resource.

$ cat ingestor.yaml          
apiVersion: enterprise.splunk.com/v4
kind: IngestorCluster
metadata:
  name: ingestor
  finalizers:
    - enterprise.splunk.com/delete-pvc
spec:
  serviceAccount: ingestor-sa 
  replicas: 3
  image: splunk/splunk:${SPLUNK_IMAGE_VERSION}
  queueRef:
    name: queue
  objectStorageRef:
    name: os

$ kubectl apply -f ingestor.yaml

$ kubectl get po 
NAME                         READY   STATUS    RESTARTS   AGE
splunk-ingestor-ingestor-0   1/1     Running   0          2m12s
splunk-ingestor-ingestor-1   1/1     Running   0          2m12s
splunk-ingestor-ingestor-2   1/1     Running   0          2m12s

$ kubectl describe ingestorcluster ingestor
Name:         ingestor
Namespace:    default
Labels:       <none>
Annotations:  <none>
API Version:  enterprise.splunk.com/v4
Kind:         IngestorCluster
Metadata:
  Creation Timestamp:  2025-08-18T09:49:45Z
  Generation:          1
  Resource Version:    12345678
  UID:                 12345678-1234-1234-1234-1234567890123
Spec:
  Queue Ref:
    Name:           queue
    Namespace:      default
  Image:  splunk/splunk:${SPLUNK_IMAGE_VERSION}
  Object Storage Ref:
    Name:           os
    Namespace:      default
  Replicas:                          3
  Service Account:                   ingestor-sa
Status:
  App Context:
    App Repo:
      App Install Period Seconds:  90
      Defaults:
        Premium Apps Props:
          Es Defaults:
      Install Max Retries:  2
    Bundle Push Status:
    Is Deployment In Progress:  false
    Last App Info Check Time:   0
    Version:                    0
  Credential Secret Version:  33744270
  Message:                      
  Phase:                        Ready
  Ready Replicas:               3
  Replicas:                     3
  Resource Rev Map:
  Selector:           app.kubernetes.io/instance=splunk-ingestor-ingestor
  Tel App Installed:  true
Events:               <none>

$ kubectl exec -it splunk-ingestor-ingestor-0 -- sh
$ kubectl exec -it splunk-ingestor-ingestor-1 -- sh
$ kubectl exec -it splunk-ingestor-ingestor-2 -- sh
sh-4.4$ env | grep AWS
AWS_DEFAULT_REGION=us-west-2
AWS_WEB_IDENTITY_TOKEN_FILE=/var/run/secrets/eks.amazonaws.com/serviceaccount/token
AWS_REGION=us-west-2
AWS_ROLE_ARN=arn:aws:iam::111111111111:role/eksctl-ind-ing-sep-demo-addon-iamserviceac-Role1-123456789123
AWS_STS_REGIONAL_ENDPOINTS=regional
sh-4.4$ cat /opt/splunk/etc/system/local/default-mode.conf 
[pipeline:remotequeueruleset]
disabled = false

[pipeline:ruleset]
disabled = true

[pipeline:remotequeuetyping]
disabled = false

[pipeline:remotequeueoutput]
disabled = false

[pipeline:typing]
disabled = true

[pipeline:indexerPipe]
disabled = true
    
sh-4.4$ cat /opt/splunk/etc/system/local/outputs.conf 
[remote_queue:sqs-test]
remote_queue.sqs_smartbus.max_count.max_retries_per_part = 4
remote_queue.sqs_smartbus.auth_region = us-west-2
remote_queue.sqs_smartbus.dead_letter_queue.name = sqs-dlq-test
remote_queue.sqs_smartbus.encoding_format = s2s
remote_queue.sqs_smartbus.endpoint = https://sqs.us-west-2.amazonaws.com
remote_queue.sqs_smartbus.large_message_store.endpoint = https://s3.us-west-2.amazonaws.com
remote_queue.sqs_smartbus.large_message_store.path = s3://ingestion/smartbus-test
remote_queue.sqs_smartbus.retry_policy = max_count
remote_queue.sqs_smartbus.send_interval = 5s
remote_queue.type = sqs_smartbus

Install IndexerCluster resource.

$ cat idxc.yaml 
apiVersion: enterprise.splunk.com/v4
kind: ClusterManager
metadata:
  name: cm
  finalizers:
    - enterprise.splunk.com/delete-pvc
spec:
  image: splunk/splunk:${SPLUNK_IMAGE_VERSION}
  serviceAccount: ingestor-sa 
---
apiVersion: enterprise.splunk.com/v4
kind: IndexerCluster
metadata:
  name: indexer
  finalizers:
    - enterprise.splunk.com/delete-pvc
spec:
  image: splunk/splunk:${SPLUNK_IMAGE_VERSION}
  replicas: 3
  clusterManagerRef:
    name: cm
  serviceAccount: ingestor-sa 
  queueRef:
    name: queue
  objectStorageRef:
    name: os

$ kubectl apply -f idxc.yaml

$ kubectl get po
NAME                          READY   STATUS    RESTARTS   AGE
splunk-cm-cluster-manager-0   1/1     Running   0          15m
splunk-indexer-indexer-0      1/1     Running   0          12m
splunk-indexer-indexer-1      1/1     Running   0          12m
splunk-indexer-indexer-2      1/1     Running   0          12m
splunk-ingestor-ingestor-0    1/1     Running   0          27m
splunk-ingestor-ingestor-1    1/1     Running   0          29m
splunk-ingestor-ingestor-2    1/1     Running   0          31m

$ kubectl exec -it splunk-indexer-indexer-0  -- sh 
$ kubectl exec -it splunk-indexer-indexer-1  -- sh 
$ kubectl exec -it splunk-indexer-indexer-2  -- sh 
sh-4.4$ env | grep AWS
AWS_DEFAULT_REGION=us-west-2
AWS_WEB_IDENTITY_TOKEN_FILE=/var/run/secrets/eks.amazonaws.com/serviceaccount/token
AWS_REGION=us-west-2
AWS_ROLE_ARN=arn:aws:iam::111111111111:role/eksctl-ind-ing-sep-demo-addon-iamserviceac-Role1-123456789123
AWS_STS_REGIONAL_ENDPOINTS=regional
sh-4.4$ cat /opt/splunk/etc/system/local/inputs.conf 

[splunktcp://9997]
disabled = 0

[remote_queue:sqs-test]
remote_queue.sqs_smartbus.max_count.max_retries_per_part = 4
remote_queue.sqs_smartbus.auth_region = us-west-2
remote_queue.sqs_smartbus.dead_letter_queue.name = sqs-dlq-test
remote_queue.sqs_smartbus.endpoint = https://sqs.us-west-2.amazonaws.com
remote_queue.sqs_smartbus.large_message_store.endpoint = https://s3.us-west-2.amazonaws.com
remote_queue.sqs_smartbus.large_message_store.path = s3://ingestion/smartbus-test
remote_queue.sqs_smartbus.retry_policy = max_count
remote_queue.type = sqs_smartbus
sh-4.4$ cat /opt/splunk/etc/system/local/outputs.conf 
[remote_queue:sqs-test]
remote_queue.sqs_smartbus.max_count.max_retries_per_part = 4
remote_queue.sqs_smartbus.auth_region = us-west-2
remote_queue.sqs_smartbus.dead_letter_queue.name = sqs-dlq-test
remote_queue.sqs_smartbus.encoding_format = s2s
remote_queue.sqs_smartbus.endpoint = https://sqs.us-west-2.amazonaws.com
remote_queue.sqs_smartbus.large_message_store.endpoint = https://s3.us-west-2.amazonaws.com
remote_queue.sqs_smartbus.large_message_store.path = s3://ingestion/smartbus-test
remote_queue.sqs_smartbus.retry_policy = max_count
remote_queue.sqs_smartbus.send_interval = 5s
remote_queue.type = sqs_smartbus
sh-4.4$ cat /opt/splunk/etc/system/local/default-mode.conf 
[pipeline:remotequeueruleset]
disabled = false

[pipeline:ruleset]
disabled = true

[pipeline:remotequeuetyping]
disabled = false

[pipeline:remotequeueoutput]
disabled = false

[pipeline:typing]
disabled = true

Install Horizontal Pod Autoscaler for IngestorCluster.

$ cat hpa-ing.yaml 
apiVersion: autoscaling/v2
kind: HorizontalPodAutoscaler
metadata:
  name: ing-hpa
spec:
  scaleTargetRef:
    apiVersion: enterprise.splunk.com/v4
    kind: IngestorCluster
    name: ingestor
  minReplicas: 3
  maxReplicas: 10
  metrics:
  - type: Resource
    resource:
      name: cpu
      target:
        type: Utilization
        averageUtilization: 50

$ kubectl apply -f hpa-ing.yaml

$ kubectl get hpa              
NAME      REFERENCE                  TARGETS              MINPODS   MAXPODS   REPLICAS   AGE
ing-hpa   IngestorCluster/ingestor   cpu: <unknown>/50%   3         10        0          10s

kubectl top pod
NAME                             CPU(cores)   MEMORY(bytes)   
hec-locust-load-29270124-f86gj   790m         221Mi           
splunk-cm-cluster-manager-0      154m         1696Mi          
splunk-indexer-indexer-0         107m         1339Mi          
splunk-indexer-indexer-1         187m         1052Mi          
splunk-indexer-indexer-2         203m         1703Mi          
splunk-ingestor-ingestor-0       97m          517Mi           
splunk-ingestor-ingestor-1       64m          585Mi           
splunk-ingestor-ingestor-2       57m          565Mi  

$ kubectl get po   
NAME                             READY   STATUS    RESTARTS   AGE
hec-locust-load-29270126-szgv2   1/1     Running   0          30s
splunk-cm-cluster-manager-0      1/1     Running   0          41m
splunk-indexer-indexer-0         1/1     Running   0          38m
splunk-indexer-indexer-1         1/1     Running   0          38m
splunk-indexer-indexer-2         1/1     Running   0          38m
splunk-ingestor-ingestor-0       1/1     Running   0          53m
splunk-ingestor-ingestor-1       1/1     Running   0          55m
splunk-ingestor-ingestor-2       1/1     Running   0          57m
splunk-ingestor-ingestor-3       0/1     Running   0          116s
splunk-ingestor-ingestor-4       0/1     Running   0          116s

kubectl top pod
NAME                             CPU(cores)   MEMORY(bytes)   
hec-locust-load-29270126-szgv2   532m         72Mi            
splunk-cm-cluster-manager-0      91m          1260Mi          
splunk-indexer-indexer-0         112m         865Mi           
splunk-indexer-indexer-1         115m         855Mi           
splunk-indexer-indexer-2         152m         1696Mi          
splunk-ingestor-ingestor-0       115m         482Mi           
splunk-ingestor-ingestor-1       76m          496Mi           
splunk-ingestor-ingestor-2       156m         553Mi           
splunk-ingestor-ingestor-3       355m         846Mi           
splunk-ingestor-ingestor-4       1036m        979Mi   

kubectl get hpa
NAME      REFERENCE                  TARGETS         MINPODS   MAXPODS   REPLICAS   AGE
ing-hpa   IngestorCluster/ingestor   cpu: 115%/50%   3         10        10         8m54s

Generate fake load.

HEC_TOKEN: HEC token for making fake calls

$ kubectl get secret splunk-default-secret -o yaml
apiVersion: v1
data:
  hec_token: HEC_TOKEN
  idxc_secret: YWJjZGVmMTIzNDU2Cg==
  pass4SymmKey: YWJjZGVmMTIzNDU2Cg==
  password: YWJjZGVmMTIzNDU2Cg==
  shc_secret: YWJjZGVmMTIzNDU2Cg==
kind: Secret
metadata:
  creationTimestamp: "2025-08-26T10:15:11Z"
  name: splunk-default-secret
  namespace: default
  ownerReferences:
  - apiVersion: enterprise.splunk.com/v4
    controller: false
    kind: IngestorCluster
    name: ingestor
    uid: 12345678-1234-1234-1234-1234567890123
  - apiVersion: enterprise.splunk.com/v4
    controller: false
    kind: ClusterManager
    name: cm
    uid: 12345678-1234-1234-1234-1234567890125
  - apiVersion: enterprise.splunk.com/v4
    controller: false
    kind: IndexerCluster
    name: indexer
    uid: 12345678-1234-1234-1234-1234567890124
  resourceVersion: "123456"
  uid: 12345678-1234-1234-1234-1234567890126
type: Opaque 

$ echo HEC_TOKEN | base64 -d         
HEC_TOKEN

cat loadgen.yaml 
apiVersion: v1
kind: ConfigMap
metadata:
  name: hec-locust-config
data:
  requirements.txt: |
    locust
    requests
    urllib3

  locustfile.py: |
    import urllib3
    from locust import HttpUser, task, between

    # disable insecure‐ssl warnings
    urllib3.disable_warnings(urllib3.exceptions.InsecureRequestWarning)

    class HECUser(HttpUser):
        wait_time = between(1, 2)
        # use HTTPS and explicit port
        host = "https://splunk-ingestor-ingestor-service:8088"

        def on_start(self):
            # turn off SSL cert verification
            self.client.verify = False

        @task
        def send_event(self):
            token = "HEC_TOKEN"
            headers = {
                "Authorization": f"Splunk {token}",
                "Content-Type": "application/json"
            }
            payload = {"event": {"message": "load test", "value": 123}}
            # this will POST to https://…:8088/services/collector/event
            self.client.post(
                "/services/collector/event",
                json=payload,
                headers=headers,
                name="HEC POST"
            )
---
apiVersion: batch/v1
kind: CronJob
metadata:
  name: hec-locust-load
spec:
  schedule: "*/2 * * * *"
  concurrencyPolicy: Replace
  startingDeadlineSeconds: 60
  jobTemplate:
    spec:
      backoffLimit: 1
      template:
        spec:
          containers:
          - name: locust
            image: python:3.9-slim
            command:
              - sh
              - -c
              - |
                pip install --no-cache-dir -r /app/requirements.txt \
                  && exec locust \
                     -f /app/locustfile.py \
                     --headless \
                     -u 200 \
                     -r 50 \
                     --run-time 1m50s
            volumeMounts:
            - name: app
              mountPath: /app
          restartPolicy: OnFailure
          volumes:
          - name: app
            configMap:
              name: hec-locust-config
              defaultMode: 0755

kubectl apply -f loadgen.yaml

$ kubectl get cm                                  
NAME                                  DATA   AGE
hec-locust-config                     2      10s
kube-root-ca.crt                      1      5d2h
splunk-cluster-manager-cm-configmap   1      28m
splunk-default-probe-configmap        3      58m
splunk-indexer-indexer-configmap      1      28m
splunk-ingestor-ingestor-configmap    1      48m

$ kubectl get cj
NAME              SCHEDULE      TIMEZONE   SUSPEND   ACTIVE   LAST SCHEDULE   AGE
hec-locust-load   */2 * * * *   <none>     False     1        2s              26s

$ kubectl get po
NAME                             READY   STATUS    RESTARTS   AGE
hec-locust-load-29270114-zq7zz   1/1     Running   0          15s
splunk-cm-cluster-manager-0      1/1     Running   0          29m
splunk-indexer-indexer-0         1/1     Running   0          26m
splunk-indexer-indexer-1         1/1     Running   0          26m
splunk-indexer-indexer-2         1/1     Running   0          26m
splunk-ingestor-ingestor-0       1/1     Running   0          41m
splunk-ingestor-ingestor-1       1/1     Running   0          43m
splunk-ingestor-ingestor-2       1/1     Running   0          45m

$ aws s3 ls s3://ingestion/smartbus-test/
                           PRE 29DDC1B4-D43E-47D1-AC04-C87AC7298201/
                           PRE 43E16731-7146-4397-8553-D68B5C2C8634/
                           PRE C8A4D060-DE0D-4DCB-9690-01D8902825DC/