What's included?
dashboards
1
NVML quickstart contains 1 dashboard. These interactive visualizations let you easily explore your data, understand context, and resolve problems faster.
NVML
alerts
1
NVML observability quickstart contains 1 alert. These alerts detect changes in key performance metrics. Integrate these alerts with your favorite tools (like Slack, PagerDuty, etc.) and New Relic will let you know when something needs your attention.
High GPU Temperature
This alert is triggered when the GPU Temperature is exceeds 85 degrees Celsius for 5 minutes.
documentation
1
NVML observability quickstart contains 1 documentation reference. This is how you'll get your data into New Relic.
Why monitor NVML?
Monitoring your NVML metrics is crucial to ensure optimal performance, reliability, and efficiency of your GPU-based systems.
Comprehensive monitoring quickstart for NVML
Monitoring NVML will allow you to effectively track the health and performance of your GPUs, leveraging the capabilities of New Relic for data visualization, alerting, and analysis.
What’s included in this quickstart?
New Relic NVML monitoring quickstart provides quality out-of-the-box reporting:
- Dashboards (power usage, GPU utilisation, clocks, etc)
- Alerts for NVML (GPU temperature, power usage)