Policy automations: add retries for scripts & software

## Goal

| User story  |
|:---------------------------------------------------------------------------|
| As an IT admin,
| I want Fleet to retry script runs and software installs up to 3 times by default
| so that I don't have to manually retry these scripts/software.

## Roadmap item

None.

## Original requests

- #24032

## Resources
  
None.

## Changes

### Product
- [x] By default, all script runs and software installs triggered by policy automation are retried up to 3 times. 
- [x] UI changes: No changes
- [x] CLI (fleetctl) usage changes: No changes
- [x] YAML changes: No changes
- [x] REST API changes: No changes
- [x] Fleet's agent (fleetd) changes: No changes
- [x] GitOps mode UI changes: No changes
- [x] GitOps generation changes: No changes
- [x] Activity changes: No changes
- [x] Permissions changes: No changes
- [x] Changes to paid features or tiers: Fleet Premium only. Script and software automations are Fleet Premium only.
- [x] My device and fleetdm.com/better changes: No changes
- [x] Usage statistics: No changes
- [x] Other reference documentation changes: https://github.com/fleetdm/fleet/pull/37120
- [x] First draft of test plan added
- [ ] Once shipped, requester has been notified
- [ ] Once shipped, dogfooding issue has been filed

### Engineering
- [ ] Test plan is finalized
- [ ] Feature guide changes: https://github.com/fleetdm/fleet/pull/37120
- [ ] Database schema migrations: Yes 
- [ ] This is a premium only feature: Yes / No  

> ℹ️  Please read this issue carefully and understand it.  Pay [special attention](https://fleetdm.com/handbook/company/development-groups#developing-from-wireframes) to UI wireframes, especially "dev notes".

## QA

### Risk assessment

- Risk level: Low

### Test plan


> Make sure to go through [the list](https://github.com/fleetdm/fleet/blob/main/docs/Contributing/guides/ui/design-qa-considerations.md) and consider all events that might be related to this story, so we catch edge cases earlier.

Here’s a shortened, flat, checkbox-style version:

---

## ✔️ Policy automation retry test plan (scripts + software)

* [x] Software installs retry up to 3 times when triggered by a policy automation

  * [x] Set up failing software policy (e.g., 1Password not installed).
  * [x] Trigger policy → confirm fail + install attempt # 1 + install attempt # 2 + install attempt # 3.
  * [x] Confirm that the software install stays pending until the third failure. At that point the software is marked as failed.

* [x] Script run retries up to 3 times when triggered by a policy automation

  * [x] Set up failing script policy (e.g., file missing).
  * [x] Trigger policy → confirm fail + script attempt # 1 + attempt # 2 + attempt # 3.
  * [x] Confirm that the script stays pending until the third failure. At that point the script is marked as failed.

* [x] Software stops retrying when it's successful

  * [x] Trigger fail → attempt # 1 is successful
  * [x] Confirm no retries

* [x] Script stops retrying when it's successful

  * [x] Trigger fail → attempt # 1 is successful
  * [x] Fix condition so policy passes.
  * [x] Confirm no retries

* [x] **Regression check: other automations unaffected**

  * [x] Trigger unrelated policy automation.
  * [x] Confirm its behavior is unchanged (not retries).

* [x] Modify policy and make sure pass/fail counts are reset and retries start over again
* [x] Modify policy automations and make sure pass/fail counts are reset and retries start over again




### Testing notes


### Confirmation


1. [ ] Engineer: Added comment to user story confirming successful completion of test plan.
2. [ ] QA: Added comment to user story confirming successful completion of test plan.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Policy automations: add retries for scripts & software #31916

Goal

Roadmap item

Original requests

Resources

Changes

Product

Engineering

QA

Risk assessment

Test plan

✔️ Policy automation retry test plan (scripts + software)

Testing notes

Confirmation

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Policy automations: add retries for scripts & software #31916

Description

Goal

Roadmap item

Original requests

Resources

Changes

Product

Engineering

QA

Risk assessment

Test plan

✔️ Policy automation retry test plan (scripts + software)

Testing notes

Confirmation

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions