What's included?
dashboards
1
NVIDIA Triton quickstart contains 1 dashboard. These interactive visualizations let you easily explore your data, understand context, and resolve problems faster.
NVIDIA-Triton
alerts
3
NVIDIA Triton observability quickstart contains 3 alerts. These alerts detect changes in key performance metrics. Integrate these alerts with your favorite tools (like Slack, PagerDuty, etc.) and New Relic will let you know when something needs your attention.
CPU Utilization (%)
This alert is triggered when the CPU utilization exceeds 85% for 5 minutes.
Storage Utilization (%)
This alert is triggered when the storage utilization exceeds 85% for 5 minutes.
HTTP Request Failures
This alert is triggered when HTTP Request Failures exceed 1 every 5 minutes.
documentation
1
NVIDIA Triton observability quickstart contains 1 documentation reference. This is how you'll get your data into New Relic.
Why monitor NVIDIA Triton?
Monitoring ensures optimal performance of your Triton server by tracking metrics such as GPU utilization, memory usage, and inference latency. This allows you to identify bottlenecks and refine your server configuration to enhance performance.
Comprehensive monitoring quickstart for NVIDIA Triton
Monitoring provides a thorough evaluation of NVIDIA GPU health, encompassing memory usage, temperature, HTTP request errors, CPU utilization, and power consumption.
What’s included in this quickstart?
New Relic NVIDIA Triton monitoring quickstart ability to cover quality on out-of-the-box reporting.
- Dashboards (HTTP requests, CPU utilization, Pinned pool memory total and used)
- Alerts (CPU used percentage, Storage used percentage and etc)