bug: GLM 4.7 erroneously below GLM 4.6V (#931) #932

tawnymanticore · 2026-01-12T16:26:53Z

What does this PR do?

[
bug: GLM 4.7 erroneously below GLM 4.6V

Checklists

Tests have been run locally and passed
New tests have been added to any work in /lib

Summary by CodeRabbit

New Features
- GLM-4.7 model now available through three providers: OpenRouter, SiliconFlow CN, and Cerebras
- Supports structured output formatting and advanced reasoning capabilities
- Expands AI model options for enhanced task processing

_{✏️ Tip: You can customize this high-level summary in your review settings.}

coderabbitai · 2026-01-12T16:27:04Z

Walkthrough

The pull request relocates the GLM-4.7 model configuration within the model registry file, repositioning it to a new location and updating its provider definitions (OpenRouter, SiliconFlow CN, and Cerebras) with adjusted model IDs and structured output modes.

Changes

Cohort / File(s)	Summary
GLM-4.7 Model Registry Relocation `libs/core/kiln_ai/adapters/ml_model_list.py`	Removes existing GLM-4.7 model entry and adds it to a new position with updated provider configurations: OpenRouter (json_instructions mode), SiliconFlow CN (json_instructions mode with reasoning_optional_for_structured_output), and Cerebras (json_schema mode). All three providers configured with reasoning capabilities enabled.

Estimated code review effort

🎯 2 (Simple) | ⏱️ ~8 minutes

Possibly related PRs

Adding support for GLM 4.7 #923: Modifies the same GLM-4.7 model entry in ml_model_list.py, adjusting provider configurations.
4.7 should be above 4.6V #931: Identical code-level change relocating GLM-4.7 model entry above GLM-4.6 with duplicate removal.
add models (Minimax M2.1, Bytedance, Mistral) #926: Updates GLM-4.7 providers (Cerebras and OpenRouter) in the same model registry file.

Suggested reviewers

scosman
leonardmq

Poem

🐰 The GLM-4.7 hops to a brighter place,
New providers dance with structured grace,
Cerebras, OpenRouter, Silicon's gleam—
A reorganized model registry dream! 🌟

🚥 Pre-merge checks | ✅ 2 | ❌ 1

❌ Failed checks (1 inconclusive)

Check name	Status	Explanation	Resolution
Description check	❓ Inconclusive	The description is missing the 'Related Issues' section and lacks detail about the actual change, though required checklists are completed.	Add the 'Related Issues' section referencing issue #931 and expand 'What does this PR do?' to explain the specific reordering change.

✅ Passed checks (2 passed)

Check name	Status	Explanation
Title check	✅ Passed	The title clearly describes the main change: relocating GLM 4.7 to its correct position above GLM 4.6V in the model list.
Docstring Coverage	✅ Passed	No functions found in the changed files to evaluate docstring coverage. Skipping docstring coverage check.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing touches

📝 Generate docstrings

📜 Recent review details

Configuration used: Repository UI

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 811194a and 133e01b.

📒 Files selected for processing (1)

libs/core/kiln_ai/adapters/ml_model_list.py

🧰 Additional context used

📓 Path-based instructions (2)

libs/core/**/*.py

📄 CodeRabbit inference engine (.cursor/rules/project.mdc)

Use Python 3.10+ for the core library (libs/core) and Python 3.13 for the desktop app

Use Python 3.10+ for the core library (libs/core)

Files:

libs/core/kiln_ai/adapters/ml_model_list.py

**/*.py

📄 CodeRabbit inference engine (.cursor/rules/project.mdc)

**/*.py: Use asyncio for asynchronous operations in Python
Use Pydantic v2 (not v1) for data validation

**/*.py: Use Pydantic v2, not v1
Use asyncio for asynchronous operations in Python

Files:

libs/core/kiln_ai/adapters/ml_model_list.py

🧠 Learnings (8)

📓 Common learnings

Learnt from: leonardmq
Repo: Kiln-AI/Kiln PR: 418
File: libs/core/kiln_ai/adapters/ml_model_list.py:1979-1983
Timestamp: 2025-08-08T16:14:54.346Z
Learning: In libs/core/kiln_ai/adapters/ml_model_list.py, for ModelName.deepseek_r1_distill_qwen_32b on provider siliconflow_cn (model_id "deepseek-ai/DeepSeek-R1-Distill-Qwen-32B"), do not set a parser (r1_thinking/optional_r1_thinking). Verified via testing that SiliconFlow returns final outputs that parse without an R1 parser, and reasoning may be omitted under structured output; rely on reasoning_optional_for_structured_output=True.

Learnt from: leonardmq
Repo: Kiln-AI/Kiln PR: 418
File: libs/core/kiln_ai/adapters/ml_model_list.py:0-0
Timestamp: 2025-07-16T09:37:39.816Z
Learning: The `glm_z1_rumination_32b_0414` model was intentionally removed from the built_in_models list due to output formatting issues: output was duplicated in both `output` and `reasoning` fields, and contained random internal JSON in the output. This model should not be re-added without addressing these formatting problems.

Learnt from: leonardmq
Repo: Kiln-AI/Kiln PR: 418
File: libs/core/kiln_ai/adapters/ml_model_list.py:0-0
Timestamp: 2025-07-16T09:37:39.816Z
Learning: The `glm_z1_rumination_32b_0414` model was intentionally removed from the built_in_models list due to output formatting issues: output was duplicated in both `output` and `reasoning` fields, and contained random internal JSON in the output. This model should not be re-added without addressing these formatting problems.

📚 Learning: 2025-08-08T16:13:26.526Z

Learnt from: leonardmq
Repo: Kiln-AI/Kiln PR: 418
File: libs/core/kiln_ai/adapters/ml_model_list.py:1875-1890
Timestamp: 2025-08-08T16:13:26.526Z
Learning: In libs/core/kiln_ai/adapters/ml_model_list.py (Python), do not blanket-add r1_thinking/optional_r1_thinking parsers for R1-style models. Parser usage is provider-specific and must be based on observed responses in tests. For PR Kiln-AI/Kiln#418, deepseek_r1_0528_distill_qwen3_8b providers were validated without needing a parser.

Applied to files:

libs/core/kiln_ai/adapters/ml_model_list.py

📚 Learning: 2025-08-08T16:14:54.346Z

Learnt from: leonardmq
Repo: Kiln-AI/Kiln PR: 418
File: libs/core/kiln_ai/adapters/ml_model_list.py:1979-1983
Timestamp: 2025-08-08T16:14:54.346Z
Learning: In libs/core/kiln_ai/adapters/ml_model_list.py, for ModelName.deepseek_r1_distill_qwen_32b on provider siliconflow_cn (model_id "deepseek-ai/DeepSeek-R1-Distill-Qwen-32B"), do not set a parser (r1_thinking/optional_r1_thinking). Verified via testing that SiliconFlow returns final outputs that parse without an R1 parser, and reasoning may be omitted under structured output; rely on reasoning_optional_for_structured_output=True.

Applied to files:

libs/core/kiln_ai/adapters/ml_model_list.py

📚 Learning: 2025-08-08T16:19:20.074Z

Learnt from: leonardmq
Repo: Kiln-AI/Kiln PR: 418
File: libs/core/kiln_ai/adapters/ml_model_list.py:1736-1741
Timestamp: 2025-08-08T16:19:20.074Z
Learning: In libs/core/kiln_ai/adapters/ml_model_list.py (Python), for ModelName.qwq_32b with ModelProviderName.siliconflow_cn (model_id "Qwen/QwQ-32B"), do not set a parser (r1_thinking/optional_r1_thinking). Verified via tests/responses that SiliconFlow returns outputs that parse correctly without an R1 parser. Parser usage remains provider-specific and should only be added when validated.

Applied to files:

libs/core/kiln_ai/adapters/ml_model_list.py

📚 Learning: 2025-07-16T09:37:39.816Z

Learnt from: leonardmq
Repo: Kiln-AI/Kiln PR: 418
File: libs/core/kiln_ai/adapters/ml_model_list.py:0-0
Timestamp: 2025-07-16T09:37:39.816Z
Learning: The `glm_z1_rumination_32b_0414` model was intentionally removed from the built_in_models list due to output formatting issues: output was duplicated in both `output` and `reasoning` fields, and contained random internal JSON in the output. This model should not be re-added without addressing these formatting problems.

Applied to files:

libs/core/kiln_ai/adapters/ml_model_list.py

📚 Learning: 2025-08-08T15:50:45.334Z

Learnt from: leonardmq
Repo: Kiln-AI/Kiln PR: 418
File: libs/core/kiln_ai/adapters/ml_model_list.py:221-228
Timestamp: 2025-08-08T15:50:45.334Z
Learning: In libs/core/kiln_ai/adapters/ml_model_list.KilnModelProvider, naming policy: keep cross-provider feature flags unprefixed (e.g., reasoning_optional_for_structured_output) to allow reuse across providers; prefix provider-specific toggles with the provider name (e.g., siliconflow_enable_thinking) when the implementation is specific and potentially unsafe to generalize.

Applied to files:

libs/core/kiln_ai/adapters/ml_model_list.py

📚 Learning: 2025-08-08T16:15:20.796Z

Learnt from: leonardmq
Repo: Kiln-AI/Kiln PR: 418
File: libs/core/kiln_ai/adapters/ml_model_list.py:2434-2439
Timestamp: 2025-08-08T16:15:20.796Z
Learning: In libs/core/kiln_ai/adapters/ml_model_list.py (Python), for ModelName.qwen_3_8b_no_thinking with ModelProviderName.siliconflow_cn (model_id "Qwen/Qwen3-8B"), the qwen3_style_no_think formatter is not required: SiliconFlow’s non‑thinking Qwen3‑8B returns clean output without <answer> wrappers. Formatter/parser usage is provider-specific and should be added only when verified by tests/responses.

Applied to files:

libs/core/kiln_ai/adapters/ml_model_list.py

📚 Learning: 2025-08-22T11:15:53.584Z

Learnt from: leonardmq
Repo: Kiln-AI/Kiln PR: 413
File: libs/core/kiln_ai/adapters/embedding/embedding_registry.py:14-20
Timestamp: 2025-08-22T11:15:53.584Z
Learning: In the Kiln AI codebase, model_provider_name fields in datamodels like EmbeddingConfig are typed as ModelProviderName enum (which inherits from str, Enum), not as plain strings. Therefore, accessing .value on these fields is valid and returns the string representation of the enum value.

Applied to files:

libs/core/kiln_ai/adapters/ml_model_list.py

⏰ Context from checks skipped due to timeout of 90000ms. You can increase the timeout in your CodeRabbit configuration to a maximum of 15 minutes (900000ms). (6)

GitHub Check: Generate Coverage Report
GitHub Check: Build Desktop Apps (windows-latest)
GitHub Check: Build Desktop Apps (ubuntu-22.04-arm)
GitHub Check: Build Desktop Apps (macos-latest)
GitHub Check: Build Desktop Apps (ubuntu-22.04)
GitHub Check: Build Desktop Apps (macos-15-intel)

🔇 Additional comments (1)

libs/core/kiln_ai/adapters/ml_model_list.py (1)

5188-5214: Model ID patterns need verification before approval.

The GLM 4.7 ordering fix is correct, but the provider configuration has inconsistencies that require verification:

SiliconFlow model_id: GLM 4.7 uses "Pro/zai-org/GLM-4.7" (with Pro/ prefix), while GLM 4.6 and GLM 4.6V both use "zai-org/..." without the prefix. Verify this prefix is intentional and correct for the GLM 4.7 endpoint on SiliconFlow.

OpenRouter model_id: GLM 4.6 uses "z-ai/glm-4.6:exacto" (with :exacto suffix), while GLM 4.7 uses "z-ai/glm-4.7" without the suffix. Confirm whether OpenRouter dropped the suffix for newer versions or if GLM 4.7 should include it.

reasoning_optional_for_structured_output: Present on GLM 4.7 and GLM 4.6V SiliconFlow, but absent on GLM 4.6 SiliconFlow. Verify this is intentional based on model capabilities.

Cerebras model_ids follow the consistent zai-glm-X pattern.

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

gemini-code-assist · 2026-01-12T16:27:05Z

Summary of Changes

Hello @tawnymanticore, I'm Gemini Code Assist¹! I'm currently reviewing this pull request and will post my feedback shortly. In the meantime, here's a summary to help you and other reviewers quickly get up to speed!

This pull request addresses a minor bug where the GLM 4.7 model definition was incorrectly positioned after GLM 4.6V in the model list. The change reorders these entries to maintain logical versioning, ensuring the model list is consistently and correctly structured.

Highlights

Model Definition Ordering: Corrected the order of the GLM 4.7 model definition within the ml_model_list.py file.
Logical Consistency: Ensured that GLM 4.7 is now listed before GLM 4.6V, resolving an erroneous ordering.

🧠 New Feature in Public Preview: You can now enable Memory to help Gemini Code Assist learn from your team's feedback. This makes future code reviews more consistent and personalized to your project's style. Click here to enable Memory in your admin console.

Using Gemini Code Assist

The full guide for Gemini Code Assist can be found on our documentation page, here are some quick tips.

Invoking Gemini

You can request assistance from Gemini at any point by creating a comment using either /gemini <command> or @gemini-code-assist <command>. Below is a summary of the supported commands on the current page.

Feature	Command	Description
Code Review	`/gemini review`	Performs a code review for the current pull request in its current state.
Pull Request Summary	`/gemini summary`	Provides a summary of the current pull request in its current state.
Comment	@gemini-code-assist	Responds in comments when explicitly tagged, both in pull request comments and review comments.
Help	`/gemini help`	Displays a list of available commands.

Customization

To customize Gemini Code Assist for GitHub experience, repository maintainers can create a configuration file and/or provide a custom code review style guide (such as PEP-8 for Python) by creating and adding files to a .gemini/ folder in the base of the repository. Detailed instructions can be found here.

Limitations & Feedback

Gemini Code Assist may make mistakes. Please leave feedback on any instances where its feedback is incorrect or counter productive. You can react with 👍 and 👎 on @gemini-code-assist comments. If you're interested in giving your feedback about your experience with Gemini Code Assist for Github and other Google products, sign up here.

You can also get AI-powered code generation, chat, as well as code reviews directly in the IDE at no cost with the Gemini Code Assist IDE Extension.

Review the Privacy Notices, Generative AI Prohibited Use Policy, Terms of Service, and learn how to configure Gemini Code Assist in GitHub here. Gemini can make mistakes, so double check it and use code with caution. ↩

gemini-code-assist

Code Review

This pull request addresses a bug where the GLM 4.7 model was incorrectly positioned after GLM 4.6V in the model list. The change moves the GLM 4.7 definition to its correct place, ensuring the models are listed in a logical, version-descending order. The fix is simple, accurate, and improves the maintainability of the model list. I've reviewed the changes and found no issues. The code is ready to be merged.

github-actions · 2026-01-12T16:29:10Z

📊 Coverage Report

Overall Coverage: 91%

Diff: origin/remote_config...HEAD

No lines with coverage information in this diff.

📊 HTML Coverage Report - Interactive coverage report
📈 Diff Coverage Report - Detailed diff analysis
Github Actions Run - View the full coverage report

4.7 should be above 4.6V (#931)

133e01b

tawnymanticore requested a review from scosman January 12, 2026 16:26

gemini-code-assist bot reviewed Jan 12, 2026

View reviewed changes

scosman approved these changes Jan 12, 2026

View reviewed changes

tawnymanticore merged commit 2909d08 into remote_config Jan 12, 2026
18 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

bug: GLM 4.7 erroneously below GLM 4.6V (#931) #932

bug: GLM 4.7 erroneously below GLM 4.6V (#931) #932

Uh oh!

tawnymanticore commented Jan 12, 2026 •

edited by coderabbitai bot

Loading

Uh oh!

coderabbitai bot commented Jan 12, 2026 •

edited

Loading

Uh oh!

gemini-code-assist bot commented Jan 12, 2026

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

github-actions bot commented Jan 12, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

bug: GLM 4.7 erroneously below GLM 4.6V (#931) #932

bug: GLM 4.7 erroneously below GLM 4.6V (#931) #932

Uh oh!

Conversation

tawnymanticore commented Jan 12, 2026 • edited by coderabbitai bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Checklists

Summary by CodeRabbit

Uh oh!

coderabbitai bot commented Jan 12, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

Changes

Estimated code review effort

Possibly related PRs

Suggested reviewers

Poem

Uh oh!

gemini-code-assist bot commented Jan 12, 2026

Summary of Changes

Highlights

Footnotes

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

github-actions bot commented Jan 12, 2026

📊 Coverage Report

Diff: origin/remote_config...HEAD

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

tawnymanticore commented Jan 12, 2026 •

edited by coderabbitai bot

Loading

coderabbitai bot commented Jan 12, 2026 •

edited

Loading