feat: add a script for running an evaluated app locally #29

crisbeto · 2025-09-19T12:58:36Z

Adds the web-codegen-scorer run script that allows users to run an evaluated app in their browser. It spins up a server using the local LLM output and the existing environment config.

devversion · 2025-09-19T13:09:07Z

runner/run-cli.ts

+
+async function resolveConfig(options: Options) {
+  if (!options.environment) {
+    throw new UserFacingError(


We aren't using Yargs demandOption here because we want this helpful error, right?

Yep, the Yargs error isn't super readable.

devversion · 2025-09-19T13:09:51Z

runner/run-cli.ts

+    .option('prompt', {
+      type: 'string',
+      default: '',
+      description: 'Prompt within the environment that should be run',


I think the description is a bit ambiguous/confusing. Is this a path to a prompt, or a basename? Maybe it should be a path?

It's actually the ID within the llm-output/<environment.id>. I'll update the message.

devversion · 2025-09-19T13:10:15Z

runner/run-cli.ts

+      console.error(
+        chalk.red('An error occurred during the assessment process:')
+      );
+      console.error(chalk.red(error));


should we print the stack in those cases? (if available)

For the UserFacingError we don't print the stack trace, because it can be noisy. It's meant for more readable errors that we produce (e.g. the environment path is wrong).

Right, but we could print if it's not the user facing error?

Ah yeah in that case we should just throw. I copied this over from the eval-cli 😅

Adds the `web-codegen-scorer run` script that allows users to run an evaluated app in their browser. It spins up a server using the local LLM output and the existing environment config.

crisbeto requested a review from devversion September 19, 2025 12:58

crisbeto force-pushed the run-script branch 2 times, most recently from 9db6609 to d494e7a Compare September 19, 2025 13:02

devversion reviewed Sep 19, 2025

View reviewed changes

crisbeto force-pushed the run-script branch from d494e7a to 3d2ee12 Compare September 19, 2025 13:35

devversion approved these changes Sep 19, 2025

View reviewed changes

feat: add a script for running an evaluated app locally

4380765

Adds the `web-codegen-scorer run` script that allows users to run an evaluated app in their browser. It spins up a server using the local LLM output and the existing environment config.

crisbeto force-pushed the run-script branch from 3d2ee12 to 4380765 Compare September 19, 2025 13:45

crisbeto merged commit d4ae0a6 into angular:main Sep 19, 2025
3 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: add a script for running an evaluated app locally #29

feat: add a script for running an evaluated app locally #29

Uh oh!

crisbeto commented Sep 19, 2025

Uh oh!

devversion Sep 19, 2025

Uh oh!

crisbeto Sep 19, 2025

Uh oh!

devversion Sep 19, 2025

Uh oh!

crisbeto Sep 19, 2025

Uh oh!

devversion Sep 19, 2025

Uh oh!

crisbeto Sep 19, 2025

Uh oh!

devversion Sep 19, 2025

Uh oh!

crisbeto Sep 19, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

feat: add a script for running an evaluated app locally #29

feat: add a script for running an evaluated app locally #29

Uh oh!

Conversation

crisbeto commented Sep 19, 2025

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants