|
| 1 | +--- |
| 2 | +title: SRE Agent overview (preview) |
| 3 | +description: Learn how AI-enabled agents help solve problems and support resilient and self-healing systems on your behalf. |
| 4 | +services: container-apps |
| 5 | +author: craigshoemaker |
| 6 | +ms.service: azure-container-apps |
| 7 | +ms.topic: conceptual |
| 8 | +ms.date: 05/02/2025 |
| 9 | +ms.author: cshoe |
| 10 | +--- |
| 11 | + |
| 12 | +# Azure SRE Agent overview |
| 13 | + |
| 14 | +An SRE Agent is AI-enabled service that works on your behalf as you build resilient and self-healing systems. Available as a chatbot, you can ask questions and give natural language commands to maintain your applications and services. |
| 15 | + |
| 16 | +Agents have access to every resource inside the resource groups associated to the agent. Therefore, agents: |
| 17 | + |
| 18 | +- Continuously evaluate resource activity, and monitor active resources |
| 19 | + |
| 20 | +- Send proactive notifications about unhealthy or unstable apps |
| 21 | + |
| 22 | +- Provide a natural language interface to issue commands |
| 23 | + |
| 24 | +An Azure SRE Agent also integrates with [PagerDuty](https://www.pagerduty.com/) to support advanced notification solutions. |
| 25 | + |
| 26 | +> [!NOTE] |
| 27 | +> The SRE Agent feature is in limited preview. To sign up for access, fill out the [SRE Agent application](https://go.microsoft.com/fwlink/?linkid=2319540). |
| 28 | +
|
| 29 | +## Key features |
| 30 | + |
| 31 | +The Azure SRE Agent offers several key features that enhance the reliability and performance of your Azure resources: |
| 32 | + |
| 33 | +- **Proactive monitoring**: Continuous resource monitoring with real-time alerts for potential issues and daily resource reports. |
| 34 | + |
| 35 | +- **Automated mitigation:** Automatic detection and mitigation of common issues, reducing downtime and improving resource health. |
| 36 | + |
| 37 | +- **Resource visualization**: Comprehensive views of your resource dependencies and health status |
| 38 | + |
| 39 | +## Scenarios |
| 40 | + |
| 41 | +| Scenario | Possible cause | Agent mitigation | |
| 42 | +|---|---|---| |
| 43 | +| Application down | ▪ **Application code issues**: Bugs or errors in the application code can lead to crashes or unresponsiveness.<br><br>▪ **Bad deployment**: Incorrect configurations or failed deployments can cause the application to go down.<br><br>▪ **High CPU/memory/thread issues**: Resource exhaustion due to high CPU, memory, or thread usage can affect application performance. | The Azure SRE Agent can detect these issues and provide actionable insights or automated fixes. For example, it can identify high CPU usage and recommend scaling up the resources or suggest code optimizations. | |
| 44 | +| Virtual machine RDP issues | ▪ **Network configuration problems**: Incorrect network settings can prevent Remote Desktop Protocol (RDP) access to virtual machines.<br><br> ▪ **Firewall rules**: Misconfigured firewall rules can block RDP access.<br><br> ▪ **Resource health**: Virtual machine health issues can affect RDP connectivity. | The Azure SRE Agent can monitor virtual machine health and network configurations, providing alerts and recommendations to resolve RDP issues. Agents can also automate the application of correct firewall rules to restore access. | |
| 45 | +| Container image pull failures | ▪ **Registry authentication issues**: Problems with authentication to the container registry can prevent image pulls.<br><br> ▪ **Network connectivity**: Network issues can disrupt the connection to the container registry.<br><br>▪ **Image availability**: The requested image might not be available or could be missing. | The Azure SRE Agent can detect container image pull failures and provide detailed diagnostics. It can recommend solutions such as verifying registry credentials, checking network connectivity, or ensuring the image is available. | |
| 46 | + |
| 47 | +## Security context |
| 48 | + |
| 49 | +An SRE Agent works on your behalf to evaluate and make changes to your Azure resources. Before you can create an agent, you need to make sure you're using an account that has the appropriate security context. |
| 50 | + |
| 51 | +The user account used to create an agent needs `Microsoft.Authorization/roleAssignments/write` permissions using either [Role Based Access Control Administrator](/azure/role-based-access-control/built-in-roles) or [User Access Administrator](/azure/role-based-access-control/built-in-roles). |
| 52 | + |
| 53 | +## Preview access |
| 54 | + |
| 55 | +Access to an SRE Agent is only available as a limited preview. To sign up for access, fill out the [SRE Agent application](https://go.microsoft.com/fwlink/?linkid=2319540). |
0 commit comments