You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
feat(consult,debate): update Gemini 3.1 and Codex model defaults (#234) (#244)
* feat(consult,debate): update Gemini 3.1 as default for high effort tier (#234)
Update the Gemini model default for the `high` effort tier from
`gemini-3-pro-preview` to `gemini-3.1-pro-preview` across all consult
and debate configuration files. The `max` tier already uses
`gemini-3.1-pro-preview` and is unchanged.
Updated across 3 platforms (Claude Code plugins, OpenCode adapter,
Codex adapter) in skill files, command files, and README.
* fix(consult,debate): address review findings from gemini-3.1 update
- Update Copilot picker labels from gemini-3-pro to gemini-3.1-pro
in plugins/consult/commands/consult.md,
adapters/opencode/commands/consult.md,
adapters/codex/skills/consult/SKILL.md
- Add gemini-3.1-pro-preview to expectedModels assertion in
debate-command.test.js to catch regressions
- Add gemini high-effort model assertion in debate-command.test.js
for consult skill adapter sync
- Update docs/consult-command-test-strategy.md stale model references
* fix(consult,debate): update stale Codex and Gemini low-tier model defaults
- Codex: replace o4-mini/o3 with gpt-5.3-codex across all effort tiers
in consult and debate skill files, command files, and adapters
- Gemini low tier: replace gemini-2.5-flash with gemini-3-flash-preview
(now consistent: low=gemini-3-flash-preview, medium=gemini-3-flash-preview,
high/max=gemini-3.1-pro-preview)
- Update model picker label for Gemini flash in consult command files
- Update README, top picks, and test strategy doc
- Fix debate-command.test.js expectedModels and consult adapter sync assertions
to reflect current model names (remove o4-mini/o3/gemini-2.5-flash,
add gpt-5.3-codex/gemini-3-flash-preview/gemini-3.1-pro-preview)
* fix(consult,debate): use full gemini-3.1-pro-preview API name consistently
- Update picker labels and example invocations from 'gemini-3.1-pro'
to 'gemini-3.1-pro-preview' to match the effort table API model name
(plugins/consult/commands, adapters/opencode/commands/consult,
adapters/codex/skills/consult)
- Fix debate state-schema JSON examples in plugins/debate/skills and
adapters/opencode/skills/debate to use 'gemini-3.1-pro-preview'
- Update docs/consult-command-test-strategy.md to use full preview name
- Strengthen test regression guard to cover both high and max rows
* docs: add CHANGELOG entry for gemini-3.1 model defaults update (#234)
Copy file name to clipboardExpand all lines: CHANGELOG.md
+2Lines changed: 2 additions & 0 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -21,6 +21,8 @@ and this project adheres to [Semantic Versioning](https://semver.org/spec/v2.0.0
21
21
22
22
-**`/debate` External Tool Quick Reference** — Added a "External Tool Quick Reference" section to all copies of the debate skill (`plugins/debate/skills/debate/SKILL.md`, OpenCode and Codex adapters) with safe command patterns, effort-to-model mapping tables, and output parsing expressions. The section includes a canonical-source pointer to `plugins/consult/skills/consult/SKILL.md` so the debate orchestrator doesn't duplicate provider logic. Added pointer notes in `debate-orchestrator` agents. Fixes issue #232.
23
23
24
+
-**`/consult` and `/debate` model defaults update** — Gemini high/max effort now uses `gemini-3.1-pro-preview`; Gemini low/medium uses `gemini-3-flash-preview`. Codex uses `gpt-5.3-codex` for all effort tiers. Updated across all platforms: Claude Code plugin, OpenCode adapter, and Codex adapter for both consult and debate skills and commands. Fixes issue #234.
25
+
24
26
-**`/consult` model name updates** — Updated stale model names in the consult skill: Codex models are now `o4-mini` (low/medium) and `o3` (high/max); Gemini models include `gemini-3-flash-preview`, `gemini-3-pro-preview`, and `gemini-3.1-pro-preview`. Synced to OpenCode adapter consult skill. Fixes issue #232.
25
27
26
28
-**`/next-task` Phase 12 ship invocation** — Phase 12 now invokes `ship:ship` via `await Skill({ name: "ship:ship", args: ... })` instead of `Task({ subagent_type: "ship:ship", ... })`. `ship:ship` is a skill, not an agent; the previous `Task()` call silently failed, leaving the workflow stuck after delivery validation with no PR created. The Codex adapter is updated in parity and regression tests are added. Fixes issue #230.
Example: "Is this the right approach?" --tool=gemini --effort=high --model=gemini-3-pro
236
+
Example: "Is this the right approach?" --tool=gemini --effort=high --model=gemini-3.1-pro-preview
237
237
```
238
238
239
239
The skill handles the full consultation lifecycle: model resolution, command building, context packaging, execution with 120s timeout, and returns a plain JSON result.
Example: "Is this the right approach?" --tool=gemini --effort=high --model=gemini-3-pro
244
+
Example: "Is this the right approach?" --tool=gemini --effort=high --model=gemini-3.1-pro-preview
245
245
```
246
246
247
247
The skill handles the full consultation lifecycle: model resolution, command building, context packaging, execution with 120s timeout, and returns a plain JSON result.
Note: `codex exec` is the non-interactive/headless mode. There is no `-q` flag. The TUI mode is `codex` (no subcommand).
91
91
92
-
Models: o4-mini, o3
92
+
Models: gpt-5.3-codex
93
93
94
94
| Effort | Model | Reasoning |
95
95
|--------|-------|-----------|
96
-
| low |o4-mini| low |
97
-
| medium |o4-mini| medium |
98
-
| high |o3| high |
99
-
| max |o3| high |
96
+
| low |gpt-5.3-codex| low |
97
+
| medium |gpt-5.3-codex| medium |
98
+
| high |gpt-5.3-codex| high |
99
+
| max |gpt-5.3-codex| high |
100
100
101
101
**Parse output**: `JSON.parse(stdout).message` or raw text
102
102
**Session ID**: Codex prints a resume hint at session end (e.g., `codex resume SESSION_ID`). Extract the session ID from stdout or from `JSON.parse(stdout).session_id` if available.
0 commit comments