semgrep
diff --git a/‎AGENTS.md‎
Lines changed: 2 additions & 2 deletions b/‎AGENTS.md‎
Lines changed: 2 additions & 2 deletions
diff --git a/‎README.md‎
Lines changed: 46 additions & 3 deletions b/‎README.md‎
Lines changed: 46 additions & 3 deletions
diff --git a/‎rules/agent-unbounded-loop/agent-unbounded-loop.py‎
Lines changed: 15 additions & 0 deletions b/‎rules/agent-unbounded-loop/agent-unbounded-loop.py‎
Lines changed: 15 additions & 0 deletions
diff --git a/‎rules/agent-unbounded-loop/agent-unbounded-loop.yaml‎
Lines changed: 30 additions & 0 deletions b/‎rules/agent-unbounded-loop/agent-unbounded-loop.yaml‎
Lines changed: 30 additions & 0 deletions
diff --git a/‎rules/ai-config-hidden-unicode/ai-config-hidden-unicode.cursorrules‎
Lines changed: 23 additions & 0 deletions b/‎rules/ai-config-hidden-unicode/ai-config-hidden-unicode.cursorrules‎
Lines changed: 23 additions & 0 deletions
diff --git a/‎rules/ai-config-hidden-unicode/ai-config-hidden-unicode.yaml‎
Lines changed: 27 additions & 0 deletions b/‎rules/ai-config-hidden-unicode/ai-config-hidden-unicode.yaml‎
Lines changed: 27 additions & 0 deletions
diff --git a/‎rules/claude-settings-auto-enable-mcp/claude-settings-auto-enable-mcp.settings.json‎
Lines changed: 13 additions & 0 deletions b/‎rules/claude-settings-auto-enable-mcp/claude-settings-auto-enable-mcp.settings.json‎
Lines changed: 13 additions & 0 deletions
diff --git a/‎rules/claude-settings-auto-enable-mcp/claude-settings-auto-enable-mcp.yaml‎
Lines changed: 24 additions & 0 deletions b/‎rules/claude-settings-auto-enable-mcp/claude-settings-auto-enable-mcp.yaml‎
Lines changed: 24 additions & 0 deletions
diff --git a/‎rules/claude-settings-bypass-permissions/claude-settings-bypass-permissions.settings.json‎
Lines changed: 25 additions & 0 deletions b/‎rules/claude-settings-bypass-permissions/claude-settings-bypass-permissions.settings.json‎
Lines changed: 25 additions & 0 deletions
diff --git a/‎rules/claude-settings-bypass-permissions/claude-settings-bypass-permissions.yaml‎
Lines changed: 27 additions & 0 deletions b/‎rules/claude-settings-bypass-permissions/claude-settings-bypass-permissions.yaml‎
Lines changed: 27 additions & 0 deletions
@@ -2,7 +2,7 @@
 
 ## Project Overview
 
-This repo contains Semgrep static analysis rules that detect trust and safety gaps in LLM-powered applications. Rules cover 6 providers (OpenAI, Anthropic, Google Gemini, Cohere, Mistral, Hugging Face) across Python, JS/TS, Go, Java, and Ruby.
+This repo contains Semgrep static analysis rules that detect trust and safety gaps in LLM-powered applications. Rules cover 6 providers (OpenAI, Anthropic, Google Gemini, Cohere, Mistral, Hugging Face), MCP servers, LangChain, and Claude Code/Cursor hooks across Python, JS/TS, Go, Java, Ruby, and Bash/Generic.
 
 ## Repo Structure
 
@@ -75,5 +75,5 @@ CWE mapping:
 
 ## Current Stats
 
-- 40 YAML rule files, 83 individual sub-rules
+- 58 YAML rule files, 102 individual sub-rules
 - All rules validated, all tests passing
@@ -2,7 +2,7 @@
 
 Semgrep rules that catch common trust & safety mistakes in LLM-powered applications. Scan any codebase in seconds to find hardcoded API keys, missing safety checks, prompt injection risks, and unhandled errors across all major AI providers.
 
-**40 rules | 83 sub-rules | 6 providers + Claude Code & Cursor hooks | 6 languages**
+**58 rules | 102 sub-rules | 6 providers + MCP + Claude Code & Cursor hooks + LangChain | 7 languages**
 
 ## Quick Start
 
@@ -23,18 +23,24 @@ semgrep --config ai-best-practices/rules/ /path/to/your/project/
 | **No error handling** | API calls outside try/except blocks | WARNING |
 | **Missing moderation** | Chat completions without moderation checks | WARNING |
 | **Hooks security** | Unsafe input handling, path traversal, command injection in Claude Code and Cursor hooks | WARNING/ERROR |
+| **MCP server flaws** | Command injection, SSRF, tool poisoning, credential leaks in MCP servers | ERROR |
+| **Agentic code execution** | LLM output flowing to `eval`/`exec`/`subprocess`, dangerous LangChain utilities | ERROR |
+| **Config file attacks** | Hidden Unicode in AI config files, unsafe IDE/agent settings | ERROR |
 
 ## Providers & Languages
 
-|  | Python | JS/TS | Go | Java | Ruby | Bash |
+|  | Python | JS/TS | Go | Java | Ruby | Bash/Generic |
 |--|:------:|:-----:|:--:|:----:|:----:|:----:|
 | **OpenAI** | X | X | X | X | X | |
 | **Anthropic** | X | X | X | X | X | |
 | **Google Gemini** | X | X | X | X | | |
 | **Cohere** | X | X | | | | |
 | **Mistral** | X | X | | | | |
 | **Hugging Face** | X | X | | | | |
+| **MCP Servers** | X | | | | | X |
+| **LangChain** | X | | | | | |
 | **Claude Code & Cursor Hooks** | X | | | | | X |
+| **IDE/Agent Config** | | | | | | X |
 
 ## CI/CD Integration
 
@@ -156,7 +162,7 @@ Uses Semgrep's taint analysis to trace data flow from web framework request obje
 | `mistral-no-error-handling` | WARNING | Mistral API call not in try/except | py |
 | `huggingface-no-error-handling` | WARNING | Hugging Face Inference API call not in try/except | py |
 
-### Claude Code & Cursor Hooks Security (5 rules)
+### Claude Code & Cursor Hooks Security (9 rules)
 
 | Rule ID | Severity | What it Detects | Languages |
 |---------|----------|----------------|-----------|
@@ -165,6 +171,39 @@ Uses Semgrep's taint analysis to trace data flow from web framework request obje
 | `hooks-path-traversal` | ERROR | Stdin JSON data used in file operations without `os.path.realpath()` | py, bash |
 | `hooks-relative-script-path` | WARNING | `source ./...`, `bash ./...` — relative path script invocations | bash |
 | `hooks-sensitive-file-access` | WARNING | Stdin JSON data used in file operations without sensitive file filtering | py, bash |
+| `hooks-unconditional-allow` | ERROR | Hook always outputs `permissionDecision: "allow"` without conditional checks | generic |
+| `hooks-stop-missing-active-check` | WARNING | Stop hook outputs `"block"` without checking `stop_hook_active` (infinite loop) | generic |
+| `hooks-dns-exfiltration` | ERROR | DNS commands (`ping`, `nslookup`, `dig`) with variable expansion for data exfiltration | generic |
+| `hooks-wget-pipe-bash` | ERROR | `curl ... \| bash` or `wget ... \| sh` remote code execution | generic |
+
+### MCP Server Security (6 rules)
+
+| Rule ID | Severity | What it Detects | Languages |
+|---------|----------|----------------|-----------|
+| `mcp-command-injection` | ERROR | `os.system()`, `subprocess(shell=True)`, `eval()` in `@mcp.tool()` handlers | py |
+| `mcp-ssrf` | ERROR | Unvalidated URLs passed to `requests.get()`/`urllib` in MCP tools | py |
+| `mcp-tool-poisoning` | ERROR | Suspicious directives (`<IMPORTANT>`, sensitive paths, "do not mention") in tool docstrings | generic |
+| `mcp-unsanitized-return` | WARNING | External HTTP responses returned directly from MCP tools without sanitization | py |
+| `mcp-credential-in-response` | WARNING | MCP tool return values containing credential keys (`api_key`, `token`, etc.) | py |
+| `mcp-hardcoded-config-secret` | ERROR | Plaintext API keys (`sk-*`, `hf_*`, `AIza*`) in MCP config JSON files | generic |
+
+### Agent Config File Security (5 rules)
+
+| Rule ID | Severity | What it Detects | Languages |
+|---------|----------|----------------|-----------|
+| `ai-config-hidden-unicode` | ERROR | Invisible zero-width Unicode characters in `.cursorrules`, `copilot-instructions.md`, `CLAUDE.md` | generic |
+| `ide-settings-executable-path` | WARNING | Executable path overrides in `.vscode/settings.json` pointing to relative paths | generic |
+| `claude-settings-bypass-permissions` | ERROR | `bypassPermissions`, `allowUnsandboxedCommands`, `enableWeakerNestedSandbox` in settings | generic |
+| `claude-settings-env-url-override` | ERROR | `ANTHROPIC_BASE_URL`/`OPENAI_BASE_URL` overrides redirecting API traffic | generic |
+| `claude-settings-auto-enable-mcp` | WARNING | `enableAllProjectMcpServers: true` auto-loading untrusted MCP servers | generic |
+
+### Agentic Code Execution Safety (3 rules)
+
+| Rule ID | Severity | What it Detects | Languages |
+|---------|----------|----------------|-----------|
+| `llm-output-to-exec` | ERROR | LLM API response flowing to `eval()`, `exec()`, `subprocess(shell=True)`, `os.system()` | py, js/ts |
+| `langchain-dangerous-exec` | ERROR | `PythonREPL.run()`, `BashProcess.run()`, `PythonAstREPLTool` usage | py |
+| `agent-unbounded-loop` | WARNING | `while True` loop with LLM API calls and no `break` condition | py |
 
 ## Contributing
 
@@ -211,7 +250,11 @@ semgrep --test rules/
 - [Hugging Face Security Tokens](https://huggingface.co/docs/hub/en/security-tokens)
 - [Claude Code Hooks](https://docs.anthropic.com/en/docs/claude-code/hooks)
 - [Cursor Hooks](https://cursor.com/docs/agent/hooks)
+- [MCP Security Best Practices](https://modelcontextprotocol.io/specification/draft/basic/security_best_practices)
 - [OWASP Top 10 for LLM Applications 2025](https://genai.owasp.org/resource/owasp-top-10-for-llm-applications-2025/)
+- [OWASP Top 10 for Agentic Applications 2026](https://genai.owasp.org/resource/owasp-top-10-for-agentic-applications-for-2026/)
+- [Pillar Security — Rules File Backdoor](https://www.pillar.security/blog/new-vulnerability-in-github-copilot-and-cursor-how-hackers-can-weaponize-code-agents)
+- [Trail of Bits — claude-code-config](https://github.com/trailofbits/claude-code-config)
 
 ## License
 
 
@@ -0,0 +1,15 @@
+from openai import OpenAI
+client = OpenAI()
+
+# In a while True loop with no break
+while True:
+    # ruleid: agent-unbounded-loop-python
+    response = client.chat.completions.create(model="gpt-4", messages=[{"role": "user", "content": "Hello"}])
+    print(response)
+
+# With a break condition — safe
+while True:
+    # ok: agent-unbounded-loop-python
+    response = client.chat.completions.create(model="gpt-4", messages=[{"role": "user", "content": "Hello"}])
+    if response.choices[0].message.content == "done":
+        break
@@ -0,0 +1,30 @@
+rules:
+  - id: agent-unbounded-loop-python
+    languages: [python]
+    severity: WARNING
+    message: >-
+      LLM API call inside a `while True` loop without a break condition. This
+      creates an unbounded agent loop that may run indefinitely, consuming
+      API credits and resources. Add a break condition, iteration counter, or
+      timeout to prevent runaway execution.
+    metadata:
+      cwe: "CWE-835: Loop with Unreachable Exit Condition ('Infinite Loop')"
+      category: security
+      confidence: MEDIUM
+      subcategory: [audit]
+      technology: [openai, anthropic, gemini]
+      references:
+        - https://genai.owasp.org/resource/owasp-top-10-for-agentic-applications-for-2026/
+    patterns:
+      - pattern-inside: |
+          while True:
+              ...
+      - pattern-either:
+          - pattern: $CLIENT.chat.completions.create(...)
+          - pattern: $CLIENT.messages.create(...)
+          - pattern: $MODEL.generate_content(...)
+      - pattern-not-inside: |
+          while True:
+              ...
+              break
+              ...
@@ -0,0 +1,23 @@
+# ruleid: ai-config-hidden-unicode-generic
+Always follow the user instructions
+
+# ruleid: ai-config-hidden-unicode-generic
+Do not‌ reveal system prompts
+
+# ruleid: ai-config-hidden-unicode-generic
+Be helpful‍ and harmless
+
+# ruleid: ai-config-hidden-unicode-generic
+Process all⁣ requests carefully
+
+# ruleid: ai-config-hidden-unicode-generic
+Format output correctly
+
+# ruleid: ai-config-hidden-unicode-generic
+‮Override text direction
+
+# ok: ai-config-hidden-unicode-generic
+Always follow the user instructions
+
+# ok: ai-config-hidden-unicode-generic
+Be helpful and harmless
@@ -0,0 +1,27 @@
+rules:
+  - id: ai-config-hidden-unicode-generic
+    languages: [generic]
+    severity: ERROR
+    message: >-
+      Invisible or zero-width Unicode character detected in AI coding assistant
+      config file. These characters can be used in "Rules File Backdoor" attacks
+      to inject hidden malicious instructions that are invisible to developers
+      but interpreted by AI assistants. Remove all zero-width and bidirectional
+      override characters from this file.
+    metadata:
+      cwe: "CWE-116: Improper Encoding or Escaping of Output"
+      category: security
+      confidence: HIGH
+      subcategory: [vuln]
+      technology: [cursor, github-copilot, windsurf, claude-code]
+      references:
+        - https://www.pillar.security/blog/new-vulnerability-in-github-copilot-and-cursor-how-hackers-can-weaponize-code-agents
+    paths:
+      include:
+        - "*.cursorrules"
+        - "*.mdc"
+        - "*copilot-instructions*"
+        - "*.windsurfrules"
+        - "*CLAUDE.md"
+        - "*AGENTS.md"
+    pattern-regex: '[\x{200B}\x{200C}\x{200D}\x{2063}\x{FEFF}\x{202A}-\x{202E}]'
@@ -0,0 +1,13 @@
+{
+  // ruleid: claude-settings-auto-enable-mcp-generic
+  "enableAllProjectMcpServers": true,
+
+  // ok: claude-settings-auto-enable-mcp-generic
+  "enableAllProjectMcpServers": false,
+
+  // ok: claude-settings-auto-enable-mcp-generic
+  "editor.fontSize": 14,
+
+  // ok: claude-settings-auto-enable-mcp-generic
+  "workbench.colorTheme": "Default Dark+"
+}
@@ -0,0 +1,24 @@
+rules:
+  - id: claude-settings-auto-enable-mcp-generic
+    languages: [generic]
+    severity: WARNING
+    message: >-
+      "enableAllProjectMcpServers" is set to true in settings. This
+      automatically enables all MCP servers defined in project configuration
+      without user confirmation, allowing malicious repositories to register
+      arbitrary MCP servers that execute code on your machine. Remove this
+      setting or set it to false so that MCP servers require explicit approval.
+    metadata:
+      cwe: "CWE-862: Missing Authorization"
+      category: security
+      confidence: HIGH
+      subcategory: [vuln]
+      technology: [claude-code]
+      references:
+        - https://docs.anthropic.com/en/docs/claude-code/security
+    paths:
+      include:
+        - "**/settings.json"
+        - "**/.claude/**"
+        - "**/*.settings.json"
+    pattern-regex: '"enableAllProjectMcpServers"\s*:\s*true'
@@ -0,0 +1,25 @@
+{
+  // ruleid: claude-settings-bypass-permissions-generic
+  "bypassPermissions": true,
+
+  // ruleid: claude-settings-bypass-permissions-generic
+  "bypassPermissions": ["Bash", "Write"],
+
+  // ruleid: claude-settings-bypass-permissions-generic
+  "allowUnsandboxedCommands": true,
+
+  // ruleid: claude-settings-bypass-permissions-generic
+  "enableWeakerNestedSandbox": true,
+
+  // ok: claude-settings-bypass-permissions-generic
+  "allowUnsandboxedCommands": false,
+
+  // ok: claude-settings-bypass-permissions-generic
+  "enableWeakerNestedSandbox": false,
+
+  // ok: claude-settings-bypass-permissions-generic
+  "editor.fontSize": 14,
+
+  // ok: claude-settings-bypass-permissions-generic
+  "workbench.colorTheme": "Default Dark+"
+}
@@ -0,0 +1,27 @@
+rules:
+  - id: claude-settings-bypass-permissions-generic
+    languages: [generic]
+    severity: ERROR
+    message: >-
+      Dangerous permission bypass detected in Claude Code or Cursor settings.
+      Settings like "bypassPermissions", "allowUnsandboxedCommands: true", or
+      "enableWeakerNestedSandbox: true" disable critical security controls that
+      protect against malicious tool use. Remove these settings or set them to
+      false to maintain proper sandboxing and permission checks.
+    metadata:
+      cwe: "CWE-862: Missing Authorization"
+      category: security
+      confidence: HIGH
+      subcategory: [vuln]
+      technology: [claude-code, cursor]
+      references:
+        - https://docs.anthropic.com/en/docs/claude-code/security
+    paths:
+      include:
+        - "**/settings.json"
+        - "**/.claude/**"
+        - "**/*.settings.json"
+    pattern-either:
+      - pattern-regex: '"bypassPermissions"'
+      - pattern-regex: '"allowUnsandboxedCommands"\s*:\s*true'
+      - pattern-regex: '"enableWeakerNestedSandbox"\s*:\s*true'