[BUG] LiteLLM reports wrong output token count (max_tokens vs max_output_tokens)

### Problem (one or two sentences)

When trying to use Sonnet 4.5 via LiteLLM via Google Vertex, this error shows up:

```
LiteLLM streaming error: 400 litellm.BadRequestError: VertexAIException BadRequestError - b'{"type":"error","error":{"type":"invalid_request_error","message":"max_tokens: 200000 > 64000, which is the maximum allowed number of output tokens for claude-sonnet-4-5-20250929"},"request_id":"req_vrtx_011CTeGWyomNL2s6LacBN6w5"}'. Received Model Group=claude-sonnet-4-5
```

The issue seems to be a confusion of `max_tokens` and `max_output_tokens`:

<img width="364" height="206" alt="Image" src="https://github.com/user-attachments/assets/9e913957-42fa-4fdc-be4c-460d8dd9cb4b" />

<img width="282" height="101" alt="Image" src="https://github.com/user-attachments/assets/afdf7c42-3b6f-4964-8a4b-a6199a731c07" />


The problem is most likely in this line: https://github.com/RooCodeInc/Roo-Code/blob/13534cc8b7babbf1eac5c252c2ec36650f6a8a9a/src/api/providers/fetchers/litellm.ts#L59

I think in this line, `max_output_tokens` should be used if available, and `max_tokens` only as fallback.

### Context (who is affected and when)

LiteLLM with Sonnet 4.5 on Google Vertex

### Reproduction steps

1. Add Sonnet 4.5 via LiteLLM via Google Vertex

### Expected result

Prompts should work

### Actual result

Prompts fail, because the requests ask for 200k output tokens, where the maximum is 64k

### Variations tried (optional)

_No response_

### App Version

3.28.14

### API Provider (optional)

LiteLLM

### Model Used (optional)

Sonnet 4.5 via Google Vertex

### Roo Code Task Links (optional)

_No response_

### Relevant logs or errors (optional)

```shell

```

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[BUG] LiteLLM reports wrong output token count (max_tokens vs max_output_tokens) #8454

Problem (one or two sentences)

Context (who is affected and when)

Reproduction steps

Expected result

Actual result

Variations tried (optional)

App Version

API Provider (optional)

Model Used (optional)

Roo Code Task Links (optional)

Relevant logs or errors (optional)

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[BUG] LiteLLM reports wrong output token count (max_tokens vs max_output_tokens) #8454

Description

Problem (one or two sentences)

Context (who is affected and when)

Reproduction steps

Expected result

Actual result

Variations tried (optional)

App Version

API Provider (optional)

Model Used (optional)

Roo Code Task Links (optional)

Relevant logs or errors (optional)

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions