/ 
Nvidia GPU Monitoring

Nvidia GPU Monitoring

Monitor NVidia GPUs performance and state.

What's included?

dashboards
1
Nvidia GPU Monitoring quickstart contains 1 dashboard. These interactive visualizations let you easily explore your data, understand context, and resolve problems faster.
alerts
1
Nvidia GPU Monitoring observability quickstart contains 1 alert. These alerts detect changes in key performance metrics. Integrate these alerts with your favorite tools (like Slack, PagerDuty, etc.) and New Relic will let you know when something needs your attention.
High GPU Memory Utilization
This alert is triggered when the Nvidia GPU memory utilization is above 90%.
documentation
1
Nvidia GPU Monitoring observability quickstart contains 1 documentation reference. This is how you'll get your data into New Relic.

Our NVIDIA GPU integration assists you in monitoring the status of GPUs. This integration leverages our infrastructure agent and the Flex integration, which is seamlessly integrated with NVIDIA's SMI utility. It provides you with a pre-built dashboard containing crucial GPU metrics, including GPU utilization, ECC error counts, active compute processes, clock and performance states, temperature, fan speed, as well as dynamic and static information about each supported device.

How to use this quickstart

  • Sign Up for a free New Relic account or Log In to your existing account.
  • Click the install button.
  • Install the quickstart to get started or improve how you monitor your environment. They're filled with pre-built resources like dashboards, instrumentation, and alerts.
Authors
New Relic
Support
BUILT BY COMMUNITY
Need help? Visit our Support Center or check out our community forum, the Explorers Hub.