Migrate llm-judge detector to TrustyAI #17

saichandrapandraju · 2025-06-29T22:15:39Z

Migrate LLM judge detector to official TrustyAI registry.

Changes

~~- Added build-and-push-judge.yaml workflow (adapted from existing HF build & push)~~

Updated build-and-push.yaml to include LLM Judge detector
Updated build-and-push.yaml to use an intermediate environment variable with env: and then use those in run: and body: to prevent potential injections
Updated servingruntime.yaml to use quay.io/trustyai/guardrails-detector-llm-judge:latest

Closes: #15

Summary by Sourcery

Migrate the LLM Judge detector to the official TrustyAI container registry and integrate it into the existing CI pipeline with automated build, security scanning, and container registry publishing

Enhancements:

Extend the build-and-push GitHub Actions workflow to build, tag, push, and scan the LLM Judge detector image alongside existing detectors
Refactor workflow to use intermediate environment variables for safer injection in run and body steps
Update KServe servingruntime manifest to reference quay.io/trustyai/guardrails-detector-llm-judge:latest

sourcery-ai · 2025-06-29T22:15:43Z

Reviewer's Guide

This PR migrates the LLM Judge detector to the TrustyAI registry by refactoring and extending the existing build-and-push GitHub Actions workflow—introducing secure env handling, adding build/scan/publish steps for the LLM Judge image—and updating the KServe servingruntime manifest to reference the new TrustyAI image.

Sequence diagram for CI pipeline build and publish of LLM Judge detector image

sequenceDiagram
    participant Dev as Developer
    participant GitHub as GitHub Actions
    participant Trivy as Trivy Scanner
    participant Quay as Quay.io Registry
    participant GHSec as GitHub Security Tab
    participant PR as Pull Request

    Dev->>GitHub: Push code / open PR
    GitHub->>GitHub: Build LLM Judge Docker image
    GitHub->>Trivy: Run security scan on image
    Trivy-->>GitHub: Return scan results (SARIF)
    GitHub->>Quay: Push image (ci or latest tag)
    GitHub->>GHSec: Upload SARIF scan results
    GitHub->>PR: Post comment with image link

Class diagram for updated environment variable handling in build-and-push workflow

classDiagram
    class BuildAndPushWorkflow {
        +env PR_HEAD_SHA
        +env GITHUB_REF_NAME
        +env QUAY_RELEASE_REPO
        +env GITHUB_REF
        +env GITHUB_HEAD_REF
        +env LLM_JUDGE_IMAGE_NAME
        +env TAG
        +env EXPIRY_LABEL
        +step Build LLM Judge image
        +step Push LLM Judge image
        +step Trivy scan LLM Judge image
        +step Upload SARIF for LLM Judge
    }

File-Level Changes

Change	Details	Files
Extend and refactor GitHub Actions workflow to include the LLM Judge detector and enhance security	Introduce job-level env vars (PR_HEAD_SHA, GITHUB_REF_NAME, QUAY_RELEASE_REPO, etc.) and use them in debug and assignment steps Consolidate expiry label logic into EXPIRY_LABEL and quote env variables in docker build/push to prevent injections Add LLM_JUDGE_IMAGE_NAME in CI, main, and tag contexts and include build & push steps for the Dockerfile.judge image Update PR comment action to list the LLM Judge CI image and add dedicated Trivy scan and SARIF upload steps for it	`.github/workflows/build-and-push.yaml`
Update KServe servingruntime manifest to use the official TrustyAI LLM Judge image	Change image reference to quay.io/trustyai/guardrails-detector-llm-judge:latest	`detectors/llm_judge/deploy/servingruntime.yaml`

Possibly linked issues

Migrate llm-judge image to TrustyAI registry and add image build automation #15: The PR updates the image to TrustyAI registry and adds a CI pipeline for building and pushing the LLM judge image, addressing the issue.

Tips and commands

Interacting with Sourcery

Trigger a new review: Comment @sourcery-ai review on the pull request.
Continue discussions: Reply directly to Sourcery's review comments.
Generate a GitHub issue from a review comment: Ask Sourcery to create an
issue from a review comment by replying to it. You can also reply to a
review comment with @sourcery-ai issue to create an issue from it.
Generate a pull request title: Write @sourcery-ai anywhere in the pull
request title to generate a title at any time. You can also comment
@sourcery-ai title on the pull request to (re-)generate the title at any time.
Generate a pull request summary: Write @sourcery-ai summary anywhere in
the pull request body to generate a PR summary at any time exactly where you
want it. You can also comment @sourcery-ai summary on the pull request to
(re-)generate the summary at any time.
Generate reviewer's guide: Comment @sourcery-ai guide on the pull
request to (re-)generate the reviewer's guide at any time.
Resolve all Sourcery comments: Comment @sourcery-ai resolve on the
pull request to resolve all Sourcery comments. Useful if you've already
addressed all the comments and don't want to see them anymore.
Dismiss all Sourcery reviews: Comment @sourcery-ai dismiss on the pull
request to dismiss all existing Sourcery reviews. Especially useful if you
want to start fresh with a new review - don't forget to comment
@sourcery-ai review to trigger a new review!

Customizing Your Experience

Access your dashboard to:

Enable or disable review features such as the Sourcery-generated pull request
summary, the reviewer's guide, and others.
Change the review language.
Add, remove or edit custom review instructions.
Adjust other review settings.

Getting Help

Contact our support team for questions or feedback.
Visit our documentation for detailed guides and information.
Keep in touch with the Sourcery team by following us on X/Twitter, LinkedIn or GitHub.

sourcery-ai

Hey @saichandrapandraju - I've reviewed your changes and found some issues that need to be addressed.

Blocking issues:

Using variable interpolation ${{...}} with github context data in a run: step could allow an attacker to inject their own code into the runner. This would allow them to steal secrets and code. github context data can have arbitrary user input and should be treated as untrusted. Instead, use an intermediate environment variable with env: to store the data and use the environment variable in the run: script. Be sure to use double-quotes the environment variable, like this: "$ENVVAR". (link)
Trivy scan is configured to always succeed, potentially masking critical vulnerabilities. (link)

General comments:

Instead of echoing an expiry label directly into the Dockerfile, use Docker’s --label flag in the build command to avoid mutating the source file.
Consider adding caching for Docker layers (e.g., via actions/cache or buildkit cache) to speed up build times for consecutive runs.
Using pull_request_target exposes the workflow to untrusted PR code; consider switching to a pull_request trigger to avoid running malicious changes with elevated permissions.

Prompt for AI Agents

Please address the comments from this code review:
## Overall Comments
- Instead of echoing an expiry label directly into the Dockerfile, use Docker’s `--label` flag in the build command to avoid mutating the source file.
- Consider adding caching for Docker layers (e.g., via `actions/cache` or buildkit cache) to speed up build times for consecutive runs.
- Using `pull_request_target` exposes the workflow to untrusted PR code; consider switching to a `pull_request` trigger to avoid running malicious changes with elevated permissions.

## Individual Comments

### Comment 1
<location> `.github/workflows/build-and-push-judge.yaml:107` </location>
<code_context>
+            PR image build completed successfully!
+            
+            📦 [PR image](https://quay.io/repository/trustyai/guardrails-detector-llm-judge-ci?tab=tags): `quay.io/trustyai/guardrails-detector-llm-judge-ci:${{ github.event.pull_request.head.sha }}`
+      - name: Trivy scan
+        uses: aquasecurity/[email protected]
+        with:
+          scan-type: 'image'
+          image-ref: "${{ env.IMAGE_NAME }}:${{ env.TAG }}"
+          format: 'sarif'
+          output: 'trivy-results.sarif'
+          severity: 'MEDIUM,HIGH,CRITICAL'
+          exit-code: '0'
+          ignore-unfixed: false
+          vuln-type: 'os,library'
</code_context>

<issue_to_address>
Trivy scan is configured to always succeed, potentially masking critical vulnerabilities.

With 'exit-code: 0', the workflow will not fail for HIGH or CRITICAL vulnerabilities, which may result in publishing insecure images. To enforce blocking on these severities, set 'exit-code: 1' or handle scan results explicitly.
</issue_to_address>

<suggested_fix>
<<<<<<< SEARCH
          exit-code: '0'
=======
          exit-code: '1'
>>>>>>> REPLACE

</suggested_fix>

## Security Issues

### Issue 1
<location> `.github/workflows/build-and-push-judge.yaml:51` </location>

<issue_to_address>
**security (opengrep-rules.yaml.github-actions.security.run-shell-injection):** Using variable interpolation `${{...}}` with `github` context data in a `run:` step could allow an attacker to inject their own code into the runner. This would allow them to steal secrets and code. `github` context data can have arbitrary user input and should be treated as untrusted. Instead, use an intermediate environment variable with `env:` to store the data and use the environment variable in the `run:` script. Be sure to use double-quotes the environment variable, like this: "$ENVVAR".

*Source: opengrep*
</issue_to_address>

Sourcery is free for open source - if you like our reviews please consider sharing them ✨

_{Help me be more useful! Please click 👍 or 👎 on each comment and I'll use the feedback to improve your reviews.}

.github/workflows/build-and-push-judge.yaml

…workflow

saichandrapandraju · 2025-08-08T14:30:26Z

@sourcery-ai review

sourcery-ai

Hey @saichandrapandraju - I've reviewed your changes - here's some feedback:

The build-and-push workflow has a lot of repeated steps for each detector; consider using a matrix job to DRY up the Docker build, push, and Trivy scan steps.
You’re echoing common vars like PR_HEAD_SHA and QUAY_RELEASE_REPO in multiple steps; moving them to the job-level env block would reduce repetition and make updates easier.
Instead of echoing ‘quay.expires-after’ into each Dockerfile in a separate step, you could pass --label to docker build to set the expiry label in one go.

Prompt for AI Agents

Please address the comments from this code review:
## Overall Comments
- The build-and-push workflow has a lot of repeated steps for each detector; consider using a matrix job to DRY up the Docker build, push, and Trivy scan steps.
- You’re echoing common vars like PR_HEAD_SHA and QUAY_RELEASE_REPO in multiple steps; moving them to the job-level env block would reduce repetition and make updates easier.
- Instead of echoing ‘quay.expires-after’ into each Dockerfile in a separate step, you could pass --label to docker build to set the expiry label in one go.

## Individual Comments

### Comment 1
<location> `.github/workflows/build-and-push.yaml:142` </location>
<code_context>
-            📦 [PR image](https://quay.io/trustyai/guardrails-detector-built-in-ci?tab=tags): `quay.io/trustyai/guardrails-detector-built-in-ci:${{ github.event.pull_request.head.sha }}`
+            📦 [PR image](https://quay.io/repository/trustyai/guardrails-detector-huggingface-runtime-ci?tab=tags): `quay.io/trustyai/guardrails-detector-huggingface-runtime-ci:$PR_HEAD_SHA`
+            📦 [PR image](https://quay.io/trustyai/guardrails-detector-built-in-ci?tab=tags): `quay.io/trustyai/guardrails-detector-built-in-ci:$PR_HEAD_SHA`
+            📦 [PR image](https://quay.io/trustyai/guardrails-detector-llm-judge-ci?tab=tags): `quay.io/trustyai/guardrails-detector-llm-judge-ci:$PR_HEAD_SHA`
       - name: Trivy scan
         uses: aquasecurity/[email protected]
</code_context>

<issue_to_address>
Clarify the label for the new PR image to distinguish it from others.

Specify a unique label such as 'LLM Judge PR image' for this entry to help users differentiate between the images.
</issue_to_address>

<suggested_fix>
<<<<<<< SEARCH
            📦 [PR image](https://quay.io/trustyai/guardrails-detector-llm-judge-ci?tab=tags): `quay.io/trustyai/guardrails-detector-llm-judge-ci:$PR_HEAD_SHA`
=======
            📦 [LLM Judge PR image](https://quay.io/trustyai/guardrails-detector-llm-judge-ci?tab=tags): `quay.io/trustyai/guardrails-detector-llm-judge-ci:$PR_HEAD_SHA`
>>>>>>> REPLACE

</suggested_fix>

Sourcery is free for open source - if you like our reviews please consider sharing them ✨

_{Help me be more useful! Please click 👍 or 👎 on each comment and I'll use the feedback to improve your reviews.}

.github/workflows/build-and-push.yaml

…el to docker file + add unique name

saichandrapandraju · 2025-08-08T14:47:14Z

@sourcery-ai review

sourcery-ai

Hey @saichandrapandraju - I've reviewed your changes and they look great!

Sourcery is free for open source - if you like our reviews please consider sharing them ✨

_{Help me be more useful! Please click 👍 or 👎 on each comment and I'll use the feedback to improve your reviews.}

migrate llm-judge detector to TrustyAI

9c86fef

sourcery-ai bot requested changes Jun 29, 2025

View reviewed changes

.github/workflows/build-and-push-judge.yaml Outdated Show resolved Hide resolved

.github/workflows/build-and-push-judge.yaml Outdated Show resolved Hide resolved

ruivieira added the enhancement New feature or request label Jul 9, 2025

ruivieira added this to TrustyAI planning Jul 9, 2025

ruivieira moved this to In Review in TrustyAI planning Jul 9, 2025

saichandrapandraju self-assigned this Jul 9, 2025

saichandrapandraju and others added 3 commits August 8, 2025 08:34

Merge branch 'trustyai-explainability:main' into migrate-image-judge

b020e2f

Integrate LLM Judge CI workflow to main build-and-push & reemove old …

7292ede

…workflow

address sourcery comment to prevent injection vulnerabilities

63c2c0f

sourcery-ai bot reviewed Aug 8, 2025

View reviewed changes

.github/workflows/build-and-push.yaml Outdated Show resolved Hide resolved

address sourcery comments regarding repeated env vars + passing --lab…

b1e5ac6

…el to docker file + add unique name

sourcery-ai bot reviewed Aug 8, 2025

View reviewed changes

saichandrapandraju requested a review from RobGeada August 8, 2025 14:51

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Migrate llm-judge detector to TrustyAI #17

Migrate llm-judge detector to TrustyAI #17

Uh oh!

saichandrapandraju commented Jun 29, 2025 •

edited

Loading

Uh oh!

sourcery-ai bot commented Jun 29, 2025 •

edited

Loading

Interacting with Sourcery

Customizing Your Experience

Getting Help

Uh oh!

sourcery-ai bot left a comment

Uh oh!

Uh oh!

Uh oh!

saichandrapandraju commented Aug 8, 2025

Uh oh!

sourcery-ai bot left a comment

Uh oh!

Uh oh!

saichandrapandraju commented Aug 8, 2025

Uh oh!

sourcery-ai bot left a comment

Uh oh!

Uh oh!

Migrate llm-judge detector to TrustyAI #17

Are you sure you want to change the base?

Migrate llm-judge detector to TrustyAI #17

Uh oh!

Conversation

saichandrapandraju commented Jun 29, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Changes

Closes: #15

Summary by Sourcery

Uh oh!

sourcery-ai bot commented Jun 29, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Reviewer's Guide

Sequence diagram for CI pipeline build and publish of LLM Judge detector image

Class diagram for updated environment variable handling in build-and-push workflow

File-Level Changes

Possibly linked issues

Interacting with Sourcery

Customizing Your Experience

Getting Help

Uh oh!

sourcery-ai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

saichandrapandraju commented Aug 8, 2025

Uh oh!

sourcery-ai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

saichandrapandraju commented Aug 8, 2025

Uh oh!

sourcery-ai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

saichandrapandraju commented Jun 29, 2025 •

edited

Loading

sourcery-ai bot commented Jun 29, 2025 •

edited

Loading