Por el momento, esta página sólo está disponible en inglés.

AI has enormous potential to improve how teams operate systems. It can identify patterns humans miss, process large volumes of data quickly, and surface insights faster than manual investigation.

But production systems impose real constraints.

Reliability, safety, and trust matter more than novelty. Any AI operating in this environment must respect operational boundaries and human judgment.

New Relic SRE Agent, now in preview, is designed to bring agentic AI into operations without breaking that contract.

The growing gap between complexity and human response

Modern systems generate massive amounts of telemetry across distributed services, dependencies, and environments.

During incidents, engineers must process metrics, logs, traces, changes, and alerts simultaneously. Even highly experienced teams struggle to keep up, especially under time pressure.

AI has the potential to help close that gap. But only if it operates with real system context and understands what matters operationally.

Why generic AI fails in production

General-purpose AI tools are not built for operating live systems.

They lack awareness of service ownership, dependency topology, and reliability constraints. They do not understand escalation paths, blast radius, or acceptable risk.

In production, incomplete or misleading assistance is worse than no assistance at all.

SRE teams need AI that is grounded in observability and designed for operational use.

New Relic SRE Agent

SRE Agent is a capability built directly into the New Relic Intelligent Obseservability platform.

It operates on live telemetry and system context to help engineers understand what is happening during incidents and high-pressure situations.

Rather than acting autonomously, SRE Agent supports humans by:

  • Continuously observing system behavior
  • Surfacing relevant signals and patterns
  • Helping engineers focus investigation
  • Reducing time spent searching for context

Its role is to accelerate understanding, not to take action on its own.

Designed with explicit boundaries

Trust is critical in operations.

SRE Agent is intentionally constrained to ensure it enhances reliability rather than introducing risk.

It does not make changes to production systems. It does not bypass approval workflows. It does not override human decisions.

Every insight it provides is grounded in observable data and designed to support human judgment.

How SRE teams use SRE Agent

Teams adopt SRE Agent to reduce cognitive load and improve consistency across on-call rotations.

By guiding engineers toward relevant signals and patterns, SRE Agent helps teams move from reactive troubleshooting to structured investigation.

Over time, this leads to:

  • Faster diagnosis
  • More consistent response outcomes
  • Reduced fatigue during incidents
  • Improved confidence in decision-making

SRE Agent acts as a force multiplier for experienced teams rather than a replacement.

Working alongside existing workflows

SRE Agent is designed to fit naturally into existing operational practices.

It respects current alerting, ownership, and escalation models. It complements, rather than replaces, existing incident response workflows.

This ensures teams can adopt it incrementally without disrupting established processes.

Why this matters now

As systems grow more complex and AI-driven workloads increase, the cost of slow or uncertain response rises.

SRE Agent helps teams safely introduce AI into operations while preserving the control and trust that reliability depends on.
 

Learn more about New Relic SRE Agent.