fix: resolve three architectural gaps in wrapper layer by praisonai-triage-agent[bot] · Pull Request #1867 · MervinPraison/PraisonAI

praisonai-triage-agent · 2026-06-08T08:24:32Z

This PR addresses the three validated architectural gaps in the wrapper layer as detailed in issue #1866:

Gap 1: OpenAI-only credential resolver

Problem: resolve_llm_endpoint() always returned OPENAI_API_KEY, causing multi-provider model selection to use wrong credentials.

Fix: Added provider mapping that correctly routes model prefixes (anthropic/, groq/, google/, etc.) to their respective API key environment variables while maintaining backward compatibility.

Gap 2: SubprocessSandbox ignores SecurityPolicy/ResourceLimits

Problem: SubprocessSandbox was not actually a sandbox - it inherited all host environment variables and ignored security policies and most resource limits.

Fix:

Build child environment based on SecurityPolicy instead of copying host environment
Apply resource limits via POSIX setrlimit where supported
Enforce max_output_size to prevent OOM attacks
Use process groups for proper timeout cleanup
Added warnings when isolation is limited (Windows, missing modules)

Gap 3: inc/models.py mutates root logger at import

Problem: inc/models.py called logging.basicConfig() at module import, violating the project's logging policy and hijacking embedders' root logger configuration.

Fix: Removed the logging.basicConfig() call. CLI logging configuration remains through _logging.py as intended.

Testing

All fixes have been validated:

Credential resolver now correctly maps anthropic/claude-3-5-sonnet to ANTHROPIC_API_KEY
SubprocessSandbox with SecurityPolicy.strict() no longer leaks host environment variables
Root logger is no longer mutated during import

🤖 Generated with Claude Code

Summary by CodeRabbit

New Features
- Added support for multiple LLM providers including Anthropic, Groq, and Ollama with provider-specific configuration handling.
Bug Fixes
- Improved subprocess sandbox security with resource limiting and enhanced timeout handling for better reliability.

Gap 1: resolve_llm_endpoint() now maps model prefixes to correct provider API keys - Added provider mapping for anthropic/, groq/, google/, openrouter/, etc. - Stops hardcoding OPENAI_API_KEY for all providers - Maintains fallback to OPENAI_API_KEY for default OpenAI models Gap 2: SubprocessSandbox now enforces SecurityPolicy and ResourceLimits - Builds child environment from policy instead of copying host environment - Applies resource limits via POSIX setrlimit where supported - Enforces max_output_size to prevent OOM attacks - Uses process groups for proper timeout cleanup - Provides warnings when isolation is limited (Windows, missing modules) Gap 3: Remove root logger mutation from inc/models.py - Removed logging.basicConfig() call that violated project logging policy - CLI logging configuration remains through _logging.py as intended - Embedders no longer have their root logger hijacked at import time Co-authored-by: Mervin Praison <MervinPraison@users.noreply.github.com>

MervinPraison · 2026-06-08T08:24:41Z

@coderabbitai review

MervinPraison · 2026-06-08T08:24:42Z

/review

qodo-code-review · 2026-06-08T08:24:46Z

Qodo reviews are paused for this user.

Troubleshooting steps vary by plan Learn more →

On a Teams plan?
Reviews resume once this user has a paid seat and their Git account is linked in Qodo.
Link Git account →

Using GitHub Enterprise Server, GitLab Self-Managed, or Bitbucket Data Center?
These require an Enterprise plan - Contact us
Contact us →

coderabbitai · 2026-06-08T08:24:47Z

✅ Action performed

Review finished.

Note: CodeRabbit is an incremental review system and does not re-review already reviewed commits. This command is applicable only when automatic reviews are paused.

coderabbitai · 2026-06-08T08:24:56Z

Important

Review skipped

Bot user detected.

To trigger a single review, invoke the @coderabbitai review command.

⚙️ Run configuration

Configuration used: defaults

Review profile: CHILL

Plan: Pro

Run ID: 60625051-11e9-45b6-bf18-f4fd9a10567e

You can disable this status message by setting the reviews.review_status to false in the CodeRabbit configuration file.

Use the checkbox below for a quick retry:

🔍 Trigger review

📝 Walkthrough

Walkthrough

This PR addresses three security and provider-support gaps: removes module-level logging configuration that mutated the root logger, implements provider-aware LLM endpoint resolution (api_key and base_url) instead of hardcoding OpenAI, and enforces SecurityPolicy in SubprocessSandbox through environment isolation, resource limits, and process group timeout management.

Changes

Security enforcement and multi-provider LLM configuration

Layer / File(s)	Summary
Remove module-level logging configuration `src/praisonai/praisonai/inc/models.py`	Deleted `logging.basicConfig()` call at import time, eliminating side effects on the root logger and deferring logging setup to the application entry point.
Provider-aware LLM endpoint resolution `src/praisonai/praisonai/llm/env.py`	Added `_PROVIDER_MAP` to map model prefixes (`anthropic/`, `groq/`, `ollama/`, etc.) to provider-specific API key env vars and default base URLs. Refactored `resolve_llm_endpoint` and introduced `_provider_from_model` helper to resolve api_key and base_url from the provider-specific mapping instead of always using `OPENAI_API_KEY`.
SubprocessSandbox security infrastructure `src/praisonai/praisonai/sandbox/subprocess.py`	Introduced `_build_child_env` to construct child process environment from SecurityPolicy with conditional proxy passthrough when `allow_network=True`, and `_apply_rlimits` to enforce POSIX resource limits (memory, max processes, max open files) via `preexec_fn`. Imported `SecurityPolicy` to support policy-driven isolation.
SubprocessSandbox execute method enforcement `src/praisonai/praisonai/sandbox/subprocess.py`	Updated `execute` to use SecurityPolicy-based child environment, spawn subprocesses with `start_new_session=True` and resource limits, truncate stdout/stderr to `security_policy.max_output_size`, and kill entire process group on timeout via `os.killpg(SIGKILL)` with fallback to `proc.kill()`.
SubprocessSandbox run_command method enforcement `src/praisonai/praisonai/sandbox/subprocess.py`	Applied the same SecurityPolicy environment isolation, new process session, resource limits, output truncation, and process group termination to `run_command` as in `execute`.

Estimated code review effort

🎯 3 (Moderate) | ⏱️ ~25 minutes

Possibly related PRs

MervinPraison/PraisonAI#1621: Reworks LLM endpoint resolution in src/praisonai/praisonai/llm/env.py for provider-aware model/base_url/api_key derivation.
MervinPraison/PraisonAI#785: Removes OpenAI-key/provider handling constraints so non-OpenAI models work without requiring OPENAI_API_KEY, complementing the provider-specific API key resolution in this PR.
MervinPraison/PraisonAI#833: Removes import-time logging.basicConfig configuration, directly aligned with the logging cleanup in this PR.

Suggested labels

Review effort 2/5

Suggested reviewers

MervinPraison

Poem

🐰 Hops through security gates with glee,
Provider-aware, from anthropic to ollama we see,
No more OpenAI's stranglehold tight,
Process groups reaped just right,
Sandboxes sing with limits so free! ✨

🚥 Pre-merge checks | ✅ 5

✅ Passed checks (5 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Title check	✅ Passed	The title accurately describes the PR's primary objective: resolving three architectural gaps in the wrapper layer (credential resolution, SecurityPolicy enforcement, and root logger mutation).
Linked Issues check	✅ Passed	All three gaps from issue `#1866` are addressed: provider-aware credential resolver in llm/env.py, SubprocessSandbox security policy enforcement with resource limits and process group handling, and removal of module-level logging.basicConfig() from inc/models.py.
Out of Scope Changes check	✅ Passed	All changes are directly aligned with the three gaps defined in issue `#1866`; no unrelated modifications detected across the three modified files.
Docstring Coverage	✅ Passed	Docstring coverage is 100.00% which is sufficient. The required threshold is 80.00%.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing Touches

🧪 Generate unit tests (beta)

Create PR with unit tests
Commit unit tests in branch claude/issue-1866-20260608-0820

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

MervinPraison · 2026-06-08T08:25:04Z

@copilot Do a thorough review of this PR. Read ALL existing reviewer comments above from Qodo, Coderabbit, and Gemini first — incorporate their findings.

Review areas:

Bloat check: Are changes minimal and focused? Any unnecessary code or scope creep?
Security: Any hardcoded secrets, unsafe eval/exec, missing input validation?
Performance: Any module-level heavy imports? Hot-path regressions?
Tests: Are tests included? Do they cover the changes adequately?
Backward compat: Any public API changes without deprecation?
Code quality: DRY violations, naming conventions, error handling?
Address reviewer feedback: If Qodo, Coderabbit, or Gemini flagged valid issues, include them in your review
Suggest specific improvements with code examples where possible

greptile-apps · 2026-06-08T08:28:23Z

Greptile Summary

This PR addresses three architectural gaps in the wrapper layer: removing a root-logger mutation at import time in inc/models.py, adding a multi-provider credential resolver in llm/env.py, and tightening SubprocessSandbox to build a policy-driven child environment and apply POSIX rlimits instead of inheriting the host environment.

inc/models.py: The two-line logging.basicConfig() call is removed; no other logic is touched.
llm/env.py: _PROVIDER_MAP maps model prefixes (e.g. anthropic/, groq/) to their respective API-key env vars and default base URLs; resolve_llm_endpoint now selects the correct credential per provider. The or (os.environ.get(\"OPENAI_API_KEY\") if key_var == \"OPENAI_API_KEY\" else None) fallback clause is unreachable dead code — when key_var is already \"OPENAI_API_KEY\", the first os.environ.get(key_var) already covers it.
sandbox/subprocess.py: _build_child_env replaces the os.environ.copy() pattern with a minimal, policy-scoped environment; _apply_rlimits applies POSIX setrlimit calls via preexec_fn; timeout handling now kills the entire process group via os.killpg on POSIX, guarded by os.name == \"posix\" to avoid AttributeError on Windows.

Confidence Score: 3/5

The credential-resolver and logging fixes are safe, but the sandbox still buffers the full subprocess output into memory before any size check, and the new preexec_fn path calls Python's lock-backed logging in a post-fork context.

The proc.communicate() call in both execute and run_command accumulates the entire subprocess output before the max_output_size truncation runs — a gigabyte-producing child process can exhaust host memory before the check fires. This was raised in the prior review round and remains unaddressed. Additionally, _apply_rlimits calls logger.warning() inside preexec_fn; Python's logging module holds internal locks, and if a lock was held at fork time (e.g. from a ThreadPoolExecutor or concurrent log emission), the child deadlocks. The llm/env.py and inc/models.py changes are clean and low-risk.

src/praisonai/praisonai/sandbox/subprocess.py — the output-buffering path and the preexec_fn logging calls both need attention before this is safe to run against untrusted code.

Important Files Changed

Filename	Overview
src/praisonai/praisonai/inc/models.py	Removes the erroneous logging.basicConfig() call at import time; straightforward and safe cleanup with no side-effects.
src/praisonai/praisonai/llm/env.py	Adds _PROVIDER_MAP and _provider_from_model for multi-provider credential routing; key_var == OPENAI_API_KEY fallback condition is now correct but the or (...) branch is unreachable dead code since the first os.environ.get(key_var) already handles the OpenAI case; file still lacks a trailing newline.
src/praisonai/praisonai/sandbox/subprocess.py	Adds policy-driven child environment and rlimit enforcement; _apply_rlimits uses logger.warning() inside preexec_fn (fork-unsafe when threads are present), and proc.communicate() still buffers unlimited output before the max_output_size check runs (OOM protection is not truly enforced).

Flowchart

%%{init: {'theme': 'neutral'}}%%
flowchart TD
    A[resolve_llm_endpoint called] --> B[Read MODEL_NAME / OPENAI_MODEL_NAME]
    B --> C{model prefix in _PROVIDER_MAP?}
    C -- yes --> D[key_var = provider key e.g. ANTHROPIC_API_KEY]
    C -- no --> E[key_var = OPENAI_API_KEY]
    D --> F[base_url = OPENAI_BASE_URL or provider_base or default]
    E --> F
    D --> G[api_key = os.environ.get provider key or None]
    E --> H[api_key = os.environ.get OPENAI_API_KEY]
    F --> I[Return LLMEndpoint]
    G --> I
    H --> I

    subgraph SubprocessSandbox
        J[_build_child_env] --> K{policy.allow_network?}
        K -- yes --> L[include proxy vars]
        K -- no --> M[skip proxy vars]
        L --> N[set minimal PATH and HOME]
        M --> N
        N --> O[asyncio.create_subprocess_exec]
        O -- POSIX --> P[start_new_session + preexec_fn=_apply_rlimits]
        O -- Windows --> Q[no isolation]
        P --> R[proc.communicate with timeout]
        Q --> R
        R -- timeout --> S{os.name == posix?}
        S -- yes --> T[os.killpg SIGKILL]
        S -- no --> U[proc.kill]
        R -- success --> W[truncate to max_output_size]
        W --> X[return COMPLETED]
    end

_{Reviews (2): Last reviewed commit: "fix: resolve critical bugs in architectu..." | Re-trigger Greptile}

greptile-apps · 2026-06-08T08:28:29Z

            try:
+                # Enforce max_output_size by reading incrementally
+                max_output_size = self.config.security_policy.max_output_size
+                stdout_data = b""
+                stderr_data = b""
+
                stdout, stderr = await asyncio.wait_for(
                    proc.communicate(),
                    timeout=limits.timeout_seconds,
                )

+                # Truncate output if it exceeds max_output_size
+                if max_output_size and max_output_size > 0:
+                    if len(stdout) > max_output_size:
+                        stdout = stdout[:max_output_size] + b"\n[OUTPUT TRUNCATED]"
+                    if len(stderr) > max_output_size:
+                        stderr = stderr[:max_output_size] + b"\n[OUTPUT TRUNCATED]"


OOM protection is not actually enforced — full output is buffered before truncation

The comment "Enforce max_output_size by reading incrementally" and the two initialised-but-unused variables stdout_data/stderr_data indicate an incomplete refactoring. proc.communicate() still collects the entire subprocess output into memory before the length check runs, so a malicious or runaway script generating gigabytes of output can exhaust heap memory before any truncation occurs. Incremental reading via asyncio.StreamReader.read in a loop would be needed to honour max_output_size as a true safety bound.

coderabbitai

Actionable comments posted: 3

Caution

Some comments are outside the diff and can’t be posted inline due to platform limitations.

⚠️ Outside diff range comments (1)

src/praisonai/praisonai/sandbox/subprocess.py (1)

158-166: ⚠️ Potential issue | 🟠 Major | ⚡ Quick win

preexec_fn breaks execution on Windows.

The preexec_fn parameter is POSIX-only. On Windows, asyncio.create_subprocess_exec (which wraps subprocess.Popen) will raise a ValueError when preexec_fn is provided, causing all sandbox executions to fail with a generic error rather than running with weaker isolation.

Consider conditionally omitting preexec_fn on non-POSIX systems:

🐛 Proposed fix to handle Windows

+        # preexec_fn is POSIX-only; omit on Windows
+        popen_kwargs = {
+            "stdout": asyncio.subprocess.PIPE,
+            "stderr": asyncio.subprocess.PIPE,
+            "cwd": working_dir or self._temp_dir,
+            "env": process_env,
+        }
+        if os.name == "posix":
+            popen_kwargs["start_new_session"] = True
+            popen_kwargs["preexec_fn"] = lambda: self._apply_rlimits(limits)
+        else:
+            logger.warning("Resource limits and session isolation not available on Windows")
+
         try:
             proc = await asyncio.create_subprocess_exec(
                 *cmd,
-                stdout=asyncio.subprocess.PIPE,
-                stderr=asyncio.subprocess.PIPE,
-                cwd=working_dir or self._temp_dir,
-                env=process_env,
-                start_new_session=True,  # new pgid so we can SIGKILL the tree
-                preexec_fn=lambda: self._apply_rlimits(limits),
+                **popen_kwargs,
             )

🤖 Prompt for AI Agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@src/praisonai/praisonai/sandbox/subprocess.py` around lines 158 - 166, The
subprocess creation passes preexec_fn (calling self._apply_rlimits) which is
POSIX-only and raises on Windows; update the code in the coroutine that calls
asyncio.create_subprocess_exec so that you only include preexec_fn when running
on POSIX (e.g., os.name == "posix" or sys.platform check), and on non-POSIX
platforms omit preexec_fn (and ensure _apply_rlimits is not invoked there or
provide a no-op alternative); keep other args like start_new_session, cwd, env,
stdout/stderr unchanged and reference the create_subprocess_exec call and the
_apply_rlimits method to locate the change.

🧹 Nitpick comments (2)

src/praisonai/praisonai/sandbox/subprocess.py (1)
55-73: Subprocess sandbox env construction looks correct

Proxy env passthrough is gated by policy.allow_network, and PATH is pinned to a minimal value.

HOME uses self._temp_dir or "/tmp"; start() sets _temp_dir before _build_child_env is reached via execute()/run_command(), so the "/tmp" fallback should only be a defensive edge case.

Dict[str, str] | None is fine for this repo (requires-python = ">=3.10,<3.15").

If Ruff S108 continues to flag this line, consider an assert self._temp_dir (or a targeted noqa) to document the invariant.
🤖 Prompt for AI Agents
Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@src/praisonai/praisonai/sandbox/subprocess.py` around lines 55 - 73, Ruff
S108 is likely flagging the defensive HOME fallback (self._temp_dir or "/tmp") —
make the invariant explicit by asserting self._temp_dir before it is used: add
an assertion for self._temp_dir in _build_child_env (or earlier in
start/execute/run_command where _temp_dir is set) so the code can safely use
self._temp_dir for HOME; alternatively, if you prefer to keep the fallback, add
a targeted "# noqa: S108" on the line with the HOME default and document why the
fallback is only defensive.
src/praisonai/praisonai/llm/env.py (1)
81-85: 💤 Low value

Comment is misleading—fallback applies to all non-OpenAI providers.

The comment states fallback occurs "only when the model is the default OpenAI shape," but the code falls back to OPENAI_API_KEY for any provider where key_var != "OPENAI_API_KEY". For example, anthropic/claude-3 will try ANTHROPIC_API_KEY then fall back to OPENAI_API_KEY.

The docstring at line 64 correctly describes this behavior. Align the inline comment:
📝 Proposed fix
-    # api_key is read from the provider-specific var, falling back to OPENAI_API_KEY
-    # only when the model is the default OpenAI shape.
+    # Try provider-specific key first; fall back to OPENAI_API_KEY for
+    # OpenAI-compatible proxies that may use a single shared key.
🤖 Prompt for AI Agents
Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@src/praisonai/praisonai/llm/env.py` around lines 81 - 85, Update the
misleading inline comment above the api_key assignment: it currently says the
OPENAI_API_KEY fallback applies "only when the model is the default OpenAI
shape," but the code uses os.environ.get(key_var) or
(os.environ.get("OPENAI_API_KEY") if key_var != "OPENAI_API_KEY" else None),
which falls back to OPENAI_API_KEY for any provider whose key_var is not
"OPENAI_API_KEY" (e.g., ANTHROPIC_API_KEY → OPENAI_API_KEY). Change the comment
to state that the code falls back to OPENAI_API_KEY for all non-OpenAI
provider-specific vars, referencing the api_key and key_var variables to make
the behavior clear.

🤖 Prompt for all review comments with AI agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

Inline comments:
In `@src/praisonai/praisonai/llm/env.py`:
- Around line 21-30: The _PROVIDER_MAP currently uses None for provider_base for
"anthropic/", "google/", "gemini/", and "cohere/", which lets
resolve_llm_endpoint() fall back to the OpenAI default URL and misroute
non-OpenAI providers; update _PROVIDER_MAP so those prefixes map to
provider-appropriate default base URLs (e.g., set "anthropic/" ->
("ANTHROPIC_API_KEY", "https://api.anthropic.com/v1") or another real provider
endpoint) or use an explicit sentinel (e.g., empty string) to indicate
non-OpenAI-compatible endpoints, and ensure resolve_llm_endpoint() and callers
that rely on PraisonAIModel.base_url respect that sentinel; adjust any tests or
code that assume None means OpenAI to use the new sentinel/URLs.

In `@src/praisonai/praisonai/sandbox/subprocess.py`:
- Around line 169-185: Remove the dead variables stdout_data and stderr_data
declared in the subprocess output handling and stop referencing them (they're
unused); keep using proc.communicate() to get stdout/stderr, and immediately
apply the existing truncation logic (max_output_size) to stdout/stderr. Also add
a short inline comment near the communicate() call (and/or the truncation block)
stating that truncation happens after fully reading output and thus does not
prevent OOM for extremely large subprocess output, and that true
streaming/bounded reads would require a different approach than
asyncio.subprocess.communicate().
- Around line 198-205: The asyncio.TimeoutError handler currently calls
os.killpg(proc.pid, signal.SIGKILL) which will raise on Windows because
os.killpg and signal.SIGKILL may not exist; update the timeout block in the
subprocess timeout handler to guard/feature-detect before using os.killpg and
signal.SIGKILL (e.g., check hasattr(os, "killpg") and hasattr(signal,
"SIGKILL")) and otherwise fall back to proc.kill(), and also catch
AttributeError/OSError around the killpg call so the code always proceeds to
await proc.wait() without raising on Windows; reference the existing proc,
os.killpg, signal.SIGKILL, proc.kill(), and await proc.wait() in your patch.

---

Outside diff comments:
In `@src/praisonai/praisonai/sandbox/subprocess.py`:
- Around line 158-166: The subprocess creation passes preexec_fn (calling
self._apply_rlimits) which is POSIX-only and raises on Windows; update the code
in the coroutine that calls asyncio.create_subprocess_exec so that you only
include preexec_fn when running on POSIX (e.g., os.name == "posix" or
sys.platform check), and on non-POSIX platforms omit preexec_fn (and ensure
_apply_rlimits is not invoked there or provide a no-op alternative); keep other
args like start_new_session, cwd, env, stdout/stderr unchanged and reference the
create_subprocess_exec call and the _apply_rlimits method to locate the change.

---

Nitpick comments:
In `@src/praisonai/praisonai/llm/env.py`:
- Around line 81-85: Update the misleading inline comment above the api_key
assignment: it currently says the OPENAI_API_KEY fallback applies "only when the
model is the default OpenAI shape," but the code uses os.environ.get(key_var) or
(os.environ.get("OPENAI_API_KEY") if key_var != "OPENAI_API_KEY" else None),
which falls back to OPENAI_API_KEY for any provider whose key_var is not
"OPENAI_API_KEY" (e.g., ANTHROPIC_API_KEY → OPENAI_API_KEY). Change the comment
to state that the code falls back to OPENAI_API_KEY for all non-OpenAI
provider-specific vars, referencing the api_key and key_var variables to make
the behavior clear.

In `@src/praisonai/praisonai/sandbox/subprocess.py`:
- Around line 55-73: Ruff S108 is likely flagging the defensive HOME fallback
(self._temp_dir or "/tmp") — make the invariant explicit by asserting
self._temp_dir before it is used: add an assertion for self._temp_dir in
_build_child_env (or earlier in start/execute/run_command where _temp_dir is
set) so the code can safely use self._temp_dir for HOME; alternatively, if you
prefer to keep the fallback, add a targeted "# noqa: S108" on the line with the
HOME default and document why the fallback is only defensive.

🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

Push a commit to this branch (recommended)
Create a new PR with the fixes

ℹ️ Review info

⚙️ Run configuration

Configuration used: defaults

Review profile: CHILL

Plan: Pro

Run ID: 3e62623d-3d18-4296-a658-8846d402da1c

📥 Commits

Reviewing files that changed from the base of the PR and between ce97667 and 43c2333.

📒 Files selected for processing (3)

src/praisonai/praisonai/inc/models.py
src/praisonai/praisonai/llm/env.py
src/praisonai/praisonai/sandbox/subprocess.py

💤 Files with no reviewable changes (1)

src/praisonai/praisonai/inc/models.py

coderabbitai · 2026-06-08T08:32:28Z

+# Map well-known model prefixes to their (env-var, default base_url).
+_PROVIDER_MAP = {
+    "anthropic/":  ("ANTHROPIC_API_KEY",  None),
+    "google/":     ("GOOGLE_API_KEY",     None),
+    "gemini/":     ("GEMINI_API_KEY",     None),
+    "groq/":       ("GROQ_API_KEY",       "https://api.groq.com/openai/v1"),
+    "cohere/":     ("COHERE_API_KEY",     None),
+    "openrouter/": ("OPENROUTER_API_KEY", "https://openrouter.ai/api/v1"),
+    "ollama/":     ("OLLAMA_API_KEY",     "http://localhost:11434/v1"),
+}


⚠️ Potential issue | 🟠 Major | ⚡ Quick win

Non-OpenAI providers fall back to OpenAI's base URL.

For anthropic/, google/, gemini/, and cohere/ prefixes, provider_base is None. This causes base_url to fall through to default_base (https://api.openai.com/v1) when no env vars override it.

Looking at PraisonAIModel in models.py, these providers explicitly set base_url = "" to indicate they don't use an OpenAI-compatible endpoint. If resolve_llm_endpoint() returns OpenAI's URL for an Anthropic model and a caller constructs an OpenAI client with it, requests will be misdirected.

Consider either:

Setting provider-appropriate default URLs (e.g., https://api.anthropic.com/v1 for Anthropic)

Using an empty string or sentinel value to signal non-OpenAI-compatible providers

Documenting that callers must check api_key variable naming or model prefix to determine client type

🤖 Prompt for AI Agents

Verify each finding against current code. Fix only still-valid issues, skip the rest with a brief reason, keep changes minimal, and validate. In `@src/praisonai/praisonai/llm/env.py` around lines 21 - 30, The _PROVIDER_MAP currently uses None for provider_base for "anthropic/", "google/", "gemini/", and "cohere/", which lets resolve_llm_endpoint() fall back to the OpenAI default URL and misroute non-OpenAI providers; update _PROVIDER_MAP so those prefixes map to provider-appropriate default base URLs (e.g., set "anthropic/" -> ("ANTHROPIC_API_KEY", "https://api.anthropic.com/v1") or another real provider endpoint) or use an explicit sentinel (e.g., empty string) to indicate non-OpenAI-compatible endpoints, and ensure resolve_llm_endpoint() and callers that rely on PraisonAIModel.base_url respect that sentinel; adjust any tests or code that assume None means OpenAI to use the new sentinel/URLs.

coderabbitai · 2026-06-08T08:32:28Z

+                # Enforce max_output_size by reading incrementally
+                max_output_size = self.config.security_policy.max_output_size
+                stdout_data = b""
+                stderr_data = b""
+
                stdout, stderr = await asyncio.wait_for(
                    proc.communicate(),
                    timeout=limits.timeout_seconds,
                )

+                # Truncate output if it exceeds max_output_size
+                if max_output_size and max_output_size > 0:
+                    if len(stdout) > max_output_size:
+                        stdout = stdout[:max_output_size] + b"\n[OUTPUT TRUNCATED]"
+                    if len(stderr) > max_output_size:
+                        stderr = stderr[:max_output_size] + b"\n[OUTPUT TRUNCATED]"
+


⚠️ Potential issue | 🟡 Minor | ⚡ Quick win

Remove unused variables; note truncation timing.

Lines 171-172 declare stdout_data and stderr_data but they are never used—output comes directly from communicate().

Also note: truncation occurs after the full output is read into memory. While this limits the returned size, it doesn't prevent OOM if the subprocess produces extremely large output before the timeout. True OOM prevention would require bounded incremental reads, but that's significantly more complex with asyncio.subprocess.communicate(). The current approach is a reasonable tradeoff.

🧹 Remove dead code

try: - # Enforce max_output_size by reading incrementally + # Truncate output to max_output_size after reading max_output_size = self.config.security_policy.max_output_size - stdout_data = b"" - stderr_data = b"" - stdout, stderr = await asyncio.wait_for(

📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

# Enforce max_output_size by reading incrementally

max_output_size = self.config.security_policy.max_output_size

stdout_data = b""

stderr_data = b""

stdout, stderr = await asyncio.wait_for(

proc.communicate(),

timeout=limits.timeout_seconds,

)

# Truncate output if it exceeds max_output_size

if max_output_size and max_output_size > 0:

if len(stdout) > max_output_size:

stdout = stdout[:max_output_size] + b"\n[OUTPUT TRUNCATED]"

if len(stderr) > max_output_size:

stderr = stderr[:max_output_size] + b"\n[OUTPUT TRUNCATED]"

# Truncate output to max_output_size after reading

max_output_size = self.config.security_policy.max_output_size

stdout, stderr = await asyncio.wait_for(

proc.communicate(),

timeout=limits.timeout_seconds,

)

# Truncate output if it exceeds max_output_size

if max_output_size and max_output_size > 0:

if len(stdout) > max_output_size:

stdout = stdout[:max_output_size] + b"\n[OUTPUT TRUNCATED]"

if len(stderr) > max_output_size:

stderr = stderr[:max_output_size] + b"\n[OUTPUT TRUNCATED]"

🤖 Prompt for AI Agents

Verify each finding against current code. Fix only still-valid issues, skip the rest with a brief reason, keep changes minimal, and validate. In `@src/praisonai/praisonai/sandbox/subprocess.py` around lines 169 - 185, Remove the dead variables stdout_data and stderr_data declared in the subprocess output handling and stop referencing them (they're unused); keep using proc.communicate() to get stdout/stderr, and immediately apply the existing truncation logic (max_output_size) to stdout/stderr. Also add a short inline comment near the communicate() call (and/or the truncation block) stating that truncation happens after fully reading output and thus does not prevent OOM for extremely large subprocess output, and that true streaming/bounded reads would require a different approach than asyncio.subprocess.communicate().

coderabbitai · 2026-06-08T08:32:28Z

            except asyncio.TimeoutError:
-                proc.kill()
+                # Kill the whole process group, not just the leader
+                try:
+                    import signal
+                    os.killpg(proc.pid, signal.SIGKILL)
+                except (ProcessLookupError, PermissionError):
+                    proc.kill()
                await proc.wait()


⚠️ Potential issue | 🟠 Major | ⚡ Quick win

os.killpg and signal.SIGKILL are unavailable on Windows.

If the subprocess creation is made Windows-compatible (per earlier comment), this timeout handler would fail with AttributeError because:

os.killpg doesn't exist on Windows

signal.SIGKILL isn't defined on Windows

Consider adding an OSError or AttributeError fallback, or guard the entire block:

🐛 Proposed fix for cross-platform timeout handling

except asyncio.TimeoutError: # Kill the whole process group, not just the leader try: - import signal - os.killpg(proc.pid, signal.SIGKILL) - except (ProcessLookupError, PermissionError): + if os.name == "posix": + import signal + os.killpg(proc.pid, signal.SIGKILL) + else: + proc.kill() + except (ProcessLookupError, PermissionError, OSError): proc.kill()

📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

except asyncio.TimeoutError:

proc.kill()

# Kill the whole process group, not just the leader

try:

import signal

os.killpg(proc.pid, signal.SIGKILL)

except (ProcessLookupError, PermissionError):

proc.kill()

await proc.wait()

except asyncio.TimeoutError:

# Kill the whole process group, not just the leader

try:

if os.name == "posix":

import signal

os.killpg(proc.pid, signal.SIGKILL)

else:

proc.kill()

except (ProcessLookupError, PermissionError, OSError):

proc.kill()

await proc.wait()

🤖 Prompt for AI Agents

Verify each finding against current code. Fix only still-valid issues, skip the rest with a brief reason, keep changes minimal, and validate. In `@src/praisonai/praisonai/sandbox/subprocess.py` around lines 198 - 205, The asyncio.TimeoutError handler currently calls os.killpg(proc.pid, signal.SIGKILL) which will raise on Windows because os.killpg and signal.SIGKILL may not exist; update the timeout block in the subprocess timeout handler to guard/feature-detect before using os.killpg and signal.SIGKILL (e.g., check hasattr(os, "killpg") and hasattr(signal, "SIGKILL")) and otherwise fall back to proc.kill(), and also catch AttributeError/OSError around the killpg call so the code always proceeds to await proc.wait() without raising on Windows; reference the existing proc, os.killpg, signal.SIGKILL, proc.kill(), and await proc.wait() in your patch.

MervinPraison · 2026-06-08T08:34:57Z

@claude You are the FINAL architecture reviewer. If the branch is under MervinPraison/PraisonAI (not a fork), you are able to make modifications to this branch and push directly. SCOPE: Focus ONLY on Python packages (praisonaiagents, praisonai). Do NOT modify praisonai-rust or praisonai-ts. Read ALL comments above from Gemini, Qodo, CodeRabbit, and Copilot carefully before responding.

Phase 1: Review per AGENTS.md

Protocol-driven: check heavy implementations vs core SDK
Backward compatible: ensure zero feature regressions
Performance: no hot-path regressions

Phase 2: FIX Valid Issues
4. For any VALID bugs or architectural flaws found by Gemini, CodeRabbit, Qodo, Copilot, or any other reviewer: implement the fix
5. Push all code fixes directly to THIS branch (do NOT create a new PR)
6. Comment a summary of exact files modified and what you skipped

Phase 3: Final Verdict
7. If all issues are resolved, approve the PR / close the Issue
8. If blocking issues remain, request changes / leave clear action items

praisonai-triage-agent · 2026-06-08T08:35:13Z

- Fix inverted boolean condition in llm/env.py causing wrong API key fallback - Add proper provider base URLs to prevent misrouting to OpenAI endpoints - Add Windows compatibility for subprocess sandbox (preexec_fn, os.killpg) - Remove unused variables and improve comments - Add defensive documentation for hardcoded /tmp fallback Co-authored-by: Mervin Praison <MervinPraison@users.noreply.github.com>

praisonai-triage-agent Bot mentioned this pull request Jun 8, 2026

Wrapper layer: OpenAI-only credential resolver leaks across providers, SubprocessSandbox ignores SecurityPolicy/ResourceLimits, and inc/models.py mutates the root logger at import #1866

Open

Copilot AI requested a review from MervinPraison June 8, 2026 08:25

greptile-apps Bot reviewed Jun 8, 2026

View reviewed changes

coderabbitai Bot reviewed Jun 8, 2026

View reviewed changes

Uh oh!

Conversation

praisonai-triage-agent Bot commented Jun 8, 2026 • edited by coderabbitai Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Gap 1: OpenAI-only credential resolver

Gap 2: SubprocessSandbox ignores SecurityPolicy/ResourceLimits

Gap 3: inc/models.py mutates root logger at import

Testing

Summary by CodeRabbit

Uh oh!

MervinPraison commented Jun 8, 2026

Uh oh!

MervinPraison commented Jun 8, 2026

Uh oh!

qodo-code-review Bot commented Jun 8, 2026

Qodo reviews are paused for this user.

Uh oh!

coderabbitai Bot commented Jun 8, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

coderabbitai Bot commented Jun 8, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Review skipped

Walkthrough

Changes

Estimated code review effort

Possibly related PRs

Suggested labels

Suggested reviewers

Poem

Uh oh!

MervinPraison commented Jun 8, 2026

Uh oh!

greptile-apps Bot commented Jun 8, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Greptile Summary

Confidence Score: 3/5

Important Files Changed

Flowchart

Uh oh!

Uh oh!

greptile-apps Bot Jun 8, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot Jun 8, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot Jun 8, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot Jun 8, 2026

Choose a reason for hiding this comment

Uh oh!

MervinPraison commented Jun 8, 2026

Uh oh!

praisonai-triage-agent Bot commented Jun 8, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

✅ Final Architecture Review Complete

🔍 Architecture Review Results

✅ AGENTS.md Compliance

✅ Critical Bugs Fixed

1. Fixed Inverted Condition in llm/env.py

2. Added Provider Base URLs

3. Fixed Windows Compatibility

4. Cleanup & Documentation

🎯 Final Verdict: ✅ APPROVED

Commit SHA: ca55e54d

Files Modified:

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

praisonai-triage-agent Bot commented Jun 8, 2026 •

edited by coderabbitai Bot

Loading

coderabbitai Bot commented Jun 8, 2026 •

edited

Loading

coderabbitai Bot commented Jun 8, 2026 •

edited

Loading

greptile-apps Bot commented Jun 8, 2026 •

edited

Loading

praisonai-triage-agent Bot commented Jun 8, 2026 •

edited

Loading

1. Fixed Inverted Condition in `llm/env.py`

Commit SHA: `ca55e54d`