Summary
Retailers, banks, streaming giants, and countless other businesses depend on New Relic as their essential "eyes and ears" to deliver reliable experiences at scale. They turn to us during their most critical moments—new product launches or major streaming events—watching traffic surge on New Relic dashboards and relying on alerts to flag abnormal behavior or errors. We understand the utmost importance of observability at scale because, as engineers, we live and breathe these challenges daily.
This paper will illustrate, through concrete examples, how our engineering organization uses our own product daily to achieve a broad set of critical business objectives, from significant cloud cost savings and boosting developer productivity to continuously improving the customer experience and maintaining high uptime.
New Relic's engineering organization relies exclusively on its own observability platform to maintain unparalleled uptime and low latency. This self-instrumentation, or New Relic on New Relic, is crucial for managing the platform at immense scale, collecting over a trillion data points and executing more than 20 million queries daily, all while significantly reducing operational costs.
As creators and users of the New Relic platform, we develop solutions for our complex observability needs, and these learnings directly refine customer-facing product features. This "inside-out" approach ensures New Relic meets the demands of modern, distributed systems.
We will explore several use cases detailing how our internal teams—SRE, front-end, back-end, platform engineering, network, and others—achieve operational excellence and innovation through comprehensive observability.
We'll explore three key pillars of our reliability strategy:
- Measuring What Matters: Understanding and tracking the metrics crucial for reliable performance and efficient troubleshooting to achieve business objectives.
- Self-Healing Systems: Reducing engineering toil and improving reliability by using automation to respond to and prevent potential issues.
- Mitigating Issues Quickly: Equipping our teams with the tools and insights to rapidly diagnose and resolve problems when they do arise.
Through this overview, you'll gain a deeper understanding of New Relic's commitment to self-observability which not only ensures the reliable performance of our customers' environments but also continuously refines and validates the very platform they rely on. This is our story of innovation, engineering excellence, and the unwavering confidence—backed by our own daily experience—that comes from running on New Relic.