Pour le moment, cette page n'est disponible qu'en anglais.

NVML

Monitor and analyze your NVML infrastructure with New Relic.

What's included?

dashboards
1
NVML quickstart contains 1 dashboard. These interactive visualizations let you easily explore your data, understand context, and resolve problems faster.
alerts
1
NVML observability quickstart contains 1 alert. These alerts detect changes in key performance metrics. Integrate these alerts with your favorite tools (like Slack, PagerDuty, etc.) and New Relic will let you know when something needs your attention.
High GPU Temperature
This alert is triggered when the GPU Temperature is exceeds 85 degrees Celsius for 5 minutes.
documentation
1
NVML observability quickstart contains 1 documentation reference. This is how you'll get your data into New Relic.

Why monitor NVML?

Monitoring your NVML metrics is crucial to ensure optimal performance, reliability, and efficiency of your GPU-based systems.

Comprehensive monitoring quickstart for NVML

Monitoring NVML will allow you to effectively track the health and performance of your GPUs, leveraging the capabilities of New Relic for data visualization, alerting, and analysis.

What’s included in this quickstart?

New Relic NVML monitoring quickstart provides quality out-of-the-box reporting:

  • Dashboards (power usage, GPU utilisation, clocks, etc)
  • Alerts for NVML (GPU temperature, power usage)

How to use this quickstart

  • Sign Up for a free New Relic account or Log In to your existing account.
  • Click the install button.
  • Install the quickstart to get started or improve how you monitor your environment. They're filled with pre-built resources like dashboards, instrumentation, and alerts.
Authors
New Relic
Ramana Reddy
Support
BUILT BY NEW RELIC
Need help? Visit our Support Center or check out our community forum, the Explorers Hub.