Prompt tweak #5

pamelafox · 2025-02-10T22:53:43Z

Purpose

Does this introduce a breaking change?

When developers merge from main and run the server, azd up, or azd deploy, will this produce an error?
If you're not sure, try it out on an old environment.

[ ] Yes
[ ] No

Does this require changes to learn.microsoft.com docs?

This repository is referenced by this tutorial
which includes deployment, settings and usage instructions. If text or screenshot need to change in the tutorial,
check the box below and notify the tutorial author. A Microsoft employee can do this for you if you're an external contributor.

[ ] Yes
[ ] No

Type of change

[ ] Bugfix
[ ] Feature
[ ] Code style update (formatting, local variables)
[ ] Refactoring (no functional changes, no api changes)
[ ] Documentation content changes
[ ] Other... Please describe:

Code quality checklist

See CONTRIBUTING.md for more details.

The current tests all pass (python -m pytest).
I added tests that prove my fix is effective or that my feature works
I ran python -m pytest --cov to verify 100% coverage of added lines
I ran python -m mypy to check for type errors
I either used the pre-commit hooks or ran ruff and black manually on my code.

pamelafox · 2025-02-10T22:53:50Z

/evaluate

github-actions · 2025-02-10T22:54:05Z

Starting evaluation! Check the Actions tab for progress, or wait for a comment with the results.

github-actions · 2025-02-10T23:09:14Z

Evaluation results

metric	stat	baseline	gpt-4o-mini	pr5
gpt_groundedness	mean_rating	4.94	4.9	4.82
↑	pass_rate	0.98	0.98	0.98
gpt_relevance	mean_rating	4.42	4.54	4.26
↑	pass_rate	0.98	0.96	0.96
answer_length	mean	667.7	934.36	618.3
latency	mean	2.96	3.8	3.0
citations_matched	rate	0.45	0.53	0.43
any_citation	rate	1.0	1.0	1.0

Check the workflow run for more details.

pamelafox · 2025-02-10T23:12:10Z

/evaluate

github-actions · 2025-02-10T23:12:22Z

Starting evaluation! Check the Actions tab for progress, or wait for a comment with the results.

github-actions · 2025-02-11T00:57:50Z

Starting evaluation! Check the Actions tab for progress, or wait for a comment with the results.

github-actions · 2025-02-11T01:12:42Z

Evaluation results

metric	stat	baseline	gpt-4o-mini	pr5
gpt_groundedness	mean_rating	4.94	4.9	4.9
↑	pass_rate	0.98	0.98	0.98
gpt_relevance	mean_rating	4.42	4.54	4.3
↑	pass_rate	0.98	0.96	0.98
answer_length	mean	667.7	934.36	461.2
latency	mean	2.96	3.8	2.48
citations_matched	rate	0.45	0.53	0.45
any_citation	rate	1.0	1.0	1.0

Check the workflow run for more details.

github-actions · 2025-09-12T02:01:40Z

This PR is stale because it has been open 60 days with no activity. Remove stale label or comment or this will be closed.

pamelafox added 30 commits February 10, 2025 10:21

Add evaluation workflow

d593dc2

Trying to trigger workflow

a7bbd8b

Remove conditional

d7bdddd

Update workflow

a8b2beb

Add back old python file

9fbf046

New branch for eval

f28283c

Fix uv

f6dd98b

Remove python tests for now

0cac252

New PR for eval

e564b24

Add debug

a2c8469

Add workflow dispatch

9916bfc

Add workflow dispatch

934129c

Remove comment for now

68f9abe

Add workflow push

7c95d88

Add checkout

7b022b8

Try azd env new first

f932ef9

Try refresh

550ee3f

Add env config

feb7a00

Fix the action vars

36121f6

Fix local server start

1a3e00e

Fix app run

1050b50

logs pos

d07c263

Run app directly

f11813f

nohup

a076539

Log more

182c310

Logger calls

13b3f78

Fix log calls

340a411

Remove empty string values

86bd5eb

Ask less questions

c51afed

Evaluate all questions

a197f3c

pamelafox added 3 commits February 10, 2025 14:49

Base on comment

062e9b8

Base on comment

c4861fe

Prompt tweak

aac86c1

Prompt tweak

f984166

pamelafox mentioned this pull request Feb 11, 2025

Evaluation workflow for GitHub Actions Azure-Samples/azure-search-openai-demo#2350

Merged

5 tasks

pamelafox force-pushed the main branch from d7b105d to e873ba9 Compare February 11, 2025 18:46

github-actions bot added the Stale label Sep 12, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Prompt tweak #5

Prompt tweak #5

Uh oh!

pamelafox commented Feb 10, 2025

Uh oh!

pamelafox commented Feb 10, 2025

Uh oh!

github-actions bot commented Feb 10, 2025

Uh oh!

github-actions bot commented Feb 10, 2025

Uh oh!

pamelafox commented Feb 10, 2025

Uh oh!

github-actions bot commented Feb 10, 2025

Uh oh!

github-actions bot commented Feb 11, 2025

Uh oh!

github-actions bot commented Feb 11, 2025

Uh oh!

github-actions bot commented Sep 12, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Prompt tweak #5

Are you sure you want to change the base?

Prompt tweak #5

Uh oh!

Conversation

pamelafox commented Feb 10, 2025

Purpose

Does this introduce a breaking change?

Does this require changes to learn.microsoft.com docs?

Type of change

Code quality checklist

Uh oh!

pamelafox commented Feb 10, 2025

Uh oh!

github-actions bot commented Feb 10, 2025

Uh oh!

github-actions bot commented Feb 10, 2025

Evaluation results

Uh oh!

pamelafox commented Feb 10, 2025

Uh oh!

github-actions bot commented Feb 10, 2025

Uh oh!

github-actions bot commented Feb 11, 2025

Uh oh!

github-actions bot commented Feb 11, 2025

Evaluation results

Uh oh!

github-actions bot commented Sep 12, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants