Skip to content
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
3 changes: 3 additions & 0 deletions packages/evals/.docker/entrypoints/runner.sh
Original file line number Diff line number Diff line change
@@ -1,5 +1,8 @@
#!/bin/bash

# Set environment variable to suppress WSL install prompt for VS Code
export DONT_PROMPT_WSL_INSTALL=1

if [ $# -eq 0 ]; then
exec bash
else
Expand Down
12 changes: 6 additions & 6 deletions packages/evals/Dockerfile.runner
Original file line number Diff line number Diff line change
Expand Up @@ -59,11 +59,11 @@ ARG PYTHON_EXT_VERSION=2025.6.1
ARG RUST_EXT_VERSION=0.3.2482

RUN mkdir -p /roo/.vscode-template \
&& code --no-sandbox --user-data-dir /roo/.vscode-template --install-extension golang.go@${GOLANG_EXT_VERSION} \
&& code --no-sandbox --user-data-dir /roo/.vscode-template --install-extension dbaeumer.vscode-eslint@${ESLINT_EXT_VERSION} \
&& code --no-sandbox --user-data-dir /roo/.vscode-template --install-extension redhat.java@${JAVA_EXT_VERSION} \
&& code --no-sandbox --user-data-dir /roo/.vscode-template --install-extension ms-python.python@${PYTHON_EXT_VERSION} \
&& code --no-sandbox --user-data-dir /roo/.vscode-template --install-extension rust-lang.rust-analyzer@${RUST_EXT_VERSION}
&& yes | code --no-sandbox --user-data-dir /roo/.vscode-template --install-extension golang.go@${GOLANG_EXT_VERSION} \
&& yes | code --no-sandbox --user-data-dir /roo/.vscode-template --install-extension dbaeumer.vscode-eslint@${ESLINT_EXT_VERSION} \
&& yes | code --no-sandbox --user-data-dir /roo/.vscode-template --install-extension redhat.java@${JAVA_EXT_VERSION} \
&& yes | code --no-sandbox --user-data-dir /roo/.vscode-template --install-extension ms-python.python@${PYTHON_EXT_VERSION} \
&& yes | code --no-sandbox --user-data-dir /roo/.vscode-template --install-extension rust-lang.rust-analyzer@${RUST_EXT_VERSION}

# Copy evals
ARG EVALS_COMMIT=main
Expand Down Expand Up @@ -128,7 +128,7 @@ RUN cp -r /roo/.vscode-template /roo/.vscode

# Build the Roo Code extension
RUN pnpm vsix -- --out ../bin/roo-code.vsix \
&& code --no-sandbox --user-data-dir /roo/.vscode --install-extension bin/roo-code.vsix
&& yes | code --no-sandbox --user-data-dir /roo/.vscode --install-extension bin/roo-code.vsix

# Copy entrypoint script
COPY packages/evals/.docker/entrypoints/runner.sh /usr/local/bin/entrypoint.sh
Expand Down
2 changes: 1 addition & 1 deletion packages/evals/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -35,7 +35,7 @@ The initial build process can take a minute or two. Upon success you should see
Additionally, you'll find in Docker Desktop that database and redis services are running:
<img width="1283" alt="Screenshot 2025-06-05 at 12 07 09 PM" src="https://github.com/user-attachments/assets/ad75d791-9cc7-41e3-8168-df7b21b49da2" />

Navigate to [localhost:3000](http://localhost:3000/) in your browser and click the 🚀 button.
Navigate to [localhost:3446](http://localhost:3446/) in your browser and click the 🚀 button.

By default a evals run will run all programming exercises in [Roo Code Evals](https://github.com/RooCodeInc/Roo-Code-Evals) repository with the Claude Sonnet 4 model and default settings. For basic configuration you can specify the LLM to use and any subset of the exercises you'd like. For advanced configuration you can import a Roo Code settings file which will allow you to run the evals with Roo Code configured any way you'd like (this includes custom modes, a footgun prompt, etc).

Expand Down
2 changes: 1 addition & 1 deletion packages/evals/docker-compose.yml
Original file line number Diff line number Diff line change
Expand Up @@ -56,7 +56,7 @@ services:
context: ../../
dockerfile: packages/evals/Dockerfile.web
ports:
- "${EVALS_WEB_PORT:-3000}:3000"
- "${EVALS_WEB_PORT:-3446}:3000"
environment:
- HOST_EXECUTION_METHOD=docker
volumes:
Expand Down
2 changes: 1 addition & 1 deletion packages/evals/scripts/setup.sh
Original file line number Diff line number Diff line change
Expand Up @@ -386,5 +386,5 @@ if ! nc -z localhost 3000; then
echo "💡 You can start it anytime with 'pnpm --filter @roo-code/web-evals dev'."
fi
else
echo "👟 The evals web app is running at http://localhost:3000"
echo "👟 The evals web app is running at http://localhost:3000 (or http://localhost:3446 if using Docker)"
fi
Loading