Skip to content

Commit 618af52

Browse files
committed
Forgot these
1 parent 004ada2 commit 618af52

File tree

3 files changed

+5
-6
lines changed

3 files changed

+5
-6
lines changed

packages/evals/README.md

Lines changed: 2 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -29,13 +29,13 @@ Start the evals service:
2929
docker compose -f packages/evals/docker-compose.yml --profile server --profile runner up --build --scale runner=0
3030
```
3131

32-
The initial build process can take a minute or two. Upon success you should see ouput indicating that a web service is running on [localhost:3000](http://localhost:3000/):
32+
The initial build process can take a minute or two. Upon success you should see ouput indicating that a web service is running on [localhost:8080](http://localhost:8080/):
3333
<img width="1182" alt="Screenshot 2025-06-05 at 12 05 38 PM" src="https://github.com/user-attachments/assets/34f25a59-1362-458c-aafa-25e13cdb2a7a" />
3434

3535
Additionally, you'll find in Docker Desktop that database and redis services are running:
3636
<img width="1283" alt="Screenshot 2025-06-05 at 12 07 09 PM" src="https://github.com/user-attachments/assets/ad75d791-9cc7-41e3-8168-df7b21b49da2" />
3737

38-
Navigate to [localhost:3000](http://localhost:3000/) in your browser and click the 🚀 button.
38+
Navigate to [localhost:8080](http://localhost:8080/) in your browser and click the 🚀 button.
3939

4040
By default a evals run will run all programming exercises in [Roo Code Evals](https://github.com/RooCodeInc/Roo-Code-Evals) repository with the Claude Sonnet 4 model and default settings. For basic configuration you can specify the LLM to use and any subset of the exercises you'd like. For advanced configuration you can import a Roo Code settings file which will allow you to run the evals with Roo Code configured any way you'd like (this includes custom modes, a footgun prompt, etc).
4141

@@ -68,7 +68,6 @@ To stop an evals run early you can simply stop the "controller" container using
6868

6969
<img width="1302" alt="Screenshot 2025-06-06 at 9 00 41 AM" src="https://github.com/user-attachments/assets/a9d4725b-730c-441a-ba24-ac99f9599ced" />
7070

71-
7271
## Advanced Usage / Debugging
7372

7473
The evals system runs VS Code headlessly in Docker containers for consistent, reproducible environments. While this design ensures reliability, it can make debugging more challenging. For debugging purposes, you can run the system locally on macOS, though this approach is less reliable due to hardware and environment variability.

packages/evals/docker-compose.yml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -52,7 +52,7 @@ services:
5252
context: ../../
5353
dockerfile: packages/evals/Dockerfile.web
5454
ports:
55-
- "3000:3000"
55+
- "8080:3000"
5656
environment:
5757
- HOST_EXECUTION_METHOD=docker
5858
volumes:

packages/evals/scripts/setup.sh

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -377,7 +377,7 @@ fi
377377

378378
echo -e "\n🚀 You're ready to rock and roll! \n"
379379

380-
if ! nc -z localhost 3000; then
380+
if ! nc -z localhost 8080; then
381381
read -p "🌐 Would you like to start the evals web app? (Y/n): " start_evals
382382

383383
if [[ "$start_evals" =~ ^[Yy]|^$ ]]; then
@@ -386,5 +386,5 @@ if ! nc -z localhost 3000; then
386386
echo "💡 You can start it anytime with 'pnpm --filter @roo-code/web-evals dev'."
387387
fi
388388
else
389-
echo "👟 The evals web app is running at http://localhost:3000"
389+
echo "👟 The evals web app is running at http://localhost:8080"
390390
fi

0 commit comments

Comments
 (0)