Skip to content

test LLM output for semantic similarity using vector embeddings#61

Merged
paulz merged 36 commits intothisisartium:mainfrom
paulz:fix_example_1
Mar 24, 2025
Merged

test LLM output for semantic similarity using vector embeddings#61
paulz merged 36 commits intothisisartium:mainfrom
paulz:fix_example_1

Conversation

@paulz
Copy link
Contributor

@paulz paulz commented Mar 24, 2025

Add example how to test LLM output for semantic similarity using vector embeddings.

Snapshot testing is allows capture embeddings vector and notice when it changes.

This pull request includes significant updates to the examples/team_recommender/tests/example_1_text_response module, focusing on enhancing the functionality and improving the accuracy of the embeddings and similarity computations. The most important changes include the addition of new functions for embedding stabilization, new test cases, and updates to existing test cases to ensure robustness.

Enhancements to embeddings and similarity computations:

Updates to test cases:

Removal of outdated test data:

carl and others added 30 commits March 19, 2025 16:57
- update cosine similarity tests
… assertion from less than to higher than for log message
…testing for embedding object is not reliable
…plement snapshot loading for embedding equivalence tests
…t and implement snapshot loading for embedding equivalence tests"

This reverts commit d72fc74.
…ood_fit_for_project.py

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>
…ood_fit_for_project.py


add tolerance_margin = 0.05

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>
@paulz paulz marked this pull request as ready for review March 24, 2025 18:42
@paulz paulz changed the title Fix example 1 test LLM output for semantic similarity using vector embeddings Mar 24, 2025
@carl
Copy link
Contributor

carl commented Mar 24, 2025

🐻 approve

@paulz paulz merged commit e4ca299 into thisisartium:main Mar 24, 2025
1 check passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants