Wrap-Up

5 minutes

Wrap-Up

We hope you enjoyed this workshop, which provided hands-on experience deploying and working with several of the technologies that are used to monitor Cisco AI PODs with Splunk Observability Cloud. Specifically, you had the opportunity to:

Work with a RedHat OpenShift cluster with GPU-based worker nodes.
Work with the NVIDIA NIM Operator and NVIDIA GPU Operator.
Work with Large Language Models (LLMs) deployed using NVIDIA NIM to the cluster.
Deploy the OpenTelemetry Collector in the Red Hat OpenShift cluster.
Add Prometheus receivers to the collector to ingest infrastructure metrics.
Monitor the Weaviate vector database in the cluster.
Configure monitoring for Pure Storage metrics using Prometheus.
Instrument Python services that interact with Large Language Models (LLMs) with OpenTelemetry.
Understand which details which OpenTelemetry captures in the trace from applications that interact with LLMs.