SCHOL-273: Run /chat API tests in local builds by jdohan · Pull Request #1025 · NYPL/digital-research-books

jdohan · 2026-03-25T22:48:30Z

Changes summary

Enables running tests for the /chat API endpoint against a fresh local dockerized build by introducing two scripts to generate and seed the local Postgres database with FRBR graph data associated with two known editions.

Only two editions were chosen as of right now since they're always returned for the prompt currently used in the corresponding /chat smoke tests. Even though /chat responses are non-deterministic, follow-up work that wires hybrid search to a dedicated testing namespace in the vector DB for builds in CI (and local, if desired) will minimize the potential for test flakiness.

Tests supported

The /chat API tests—while they exist in this feature branch—were not directly added to it and they do not yet exist in main (see #947) due to setup blockers resolved by NOREF/aws-auth-docker-compose, which pulled in the tests along with the crucial setup. This branch was merged with that one, therefore this PR includes the tests themselves and the ability to run them in CI.

Fetching data for the local Postgres DB

The script that handles generating the seed data has been committed to the dev-scripts folder along with these changes so that it can be used continually to add more as the test suite scales. If a different location for this script is desired (e.g. within the test suite), please indicate so.

Question:
Is there anything sensitive in the current seed data file? If so, it can instead be generated fresh each time in CI then cleaned up after, however that introduces an external dependency during test setup (i.e. reading from the production Postgres DB) along with increased run times.

Tests workflow

A new step has been added to the respective workflow to handle local seeding before the VRA integration tests are run.

Question:
Is there a different ordering desired for this step?

Re: ordering

The functional test run in CI was relocated so that it's run after integration tests due to a failure that was occurring which ended the job before the impact of the changes within immediate scope could be witnessed in CI, however that failure is no longer occurring in the latest commit (unsure why, but it isn't).

Is there any objection to keeping the test run ordering this way?

Unit tests
Integration tests for DRB (some of which may no longer be relevant once VRA goes live)
Integration tests for VRA
- /chat API tests
Functional tests

Functional tests are usually run after integration tests in a typical layered testing approach, but there is value in running them first to fail faster in CI if there is something awry.

Documentation

The README has been updated to include instructions on using this new alternate method of seeding a local DB.

How to test

Start by running make down-clean from etl-pipeline/ to wipe any existing Docker containers, networks, volumes, etc., since the goal of these changes is to ensure VRA non-unit tests can run in CI against freshly-built local dockerized services.

Then run the commands given in the docstring comment at the top of seed_frbr_data.py followed by make intregration-vra (defaults to local without specifying ENVIRONMENT) while local dockerized services are up.

Ensure all tests pass locally and in CI.

Perhaps more importantly, inspect the /chat response yielded from the test query to verify it is well-formed. Reviewers added to this PR are more familiar with that response than I am currently. Note that while logs may indicate hits on editions other than the two with associated FRBR data, only those two are returned as results.

ONLY for backend/etl-pipeline!

- allow refresh of tokens in docker container - remove dummy aws env vars - add google api key fo docker env - update create vra user/password fixture

…-tests

…o NOREF/aws-auth-docker-compose

- document - improve error clarity frbrized_record_data - read post ETL pipeline values by record.source_id to prevent confusion is cases where the same record data inserted multiple times into the DB resulting in multiple Records with the same source_id

vercel · 2026-03-25T22:48:35Z

The latest updates on your projects. Learn more about Vercel for GitHub.

Project	Deployment	Actions	Updated (UTC)
digital-research-books	Ready	Preview, Comment	Mar 31, 2026 9:11pm

jdohan and others added 30 commits February 20, 2026 15:35

Add folder for assistant tests

25eab77

Add conftest for assistant with an auth fixture

5ec6811

Add scalable chat endpoint smoke test

d67dc71

Add VRA tests to ETL pipeline tests workflow

b4454b4

Change test cases list to a constant

ee0dd7f

Split DRB and VRA integration tests

709fa8e

Update query to return results

4049ef8

Apply ruff formatting

c8ef519

Update workflow name for consistency

adf0707

Update user prompt

1d11ca9

Avoid passing secrets in argumen to test method

89d3da9

inherit aws auth in docker compose

9c1691c

rename API_KEY to VRA_API_KEY

4f7d943

ONLY for backend/etl-pipeline!

rename local-compose to docker-compose

f25368a

update create vra user/password fixture

be27644

- allow refresh of tokens in docker container - remove dummy aws env vars - add google api key fo docker env - update create vra user/password fixture

Merge branch 'main' into NOREF/aws-auth-docker-compose

653864e

add TP region + prevent password leak in traceback

852ed36

Merge w/ main

a924bb9

move vra user ssm param to qa env

7bde92b

stop returning password in fixture

022fdde

revert to local env ssm params

f00047c

Merge branch 'main' into SCHOL-273/chat-endpoint-catalog-search-smoke…

5ff3429

…-tests

Merge branch 'main' into NOREF/aws-auth-docker-compose

463ad94

Merge branch 'SCHOL-273/chat-endpoint-catalog-search-smoke-tests' int…

2928db8

…o NOREF/aws-auth-docker-compose

add aws auth to devsetup service

fff88f6

add superuser pswd back

e53206f

create_or_update_record

b780a61

- document - improve error clarity frbrized_record_data - read post ETL pipeline values by record.source_id to prevent confusion is cases where the same record data inserted multiple times into the DB resulting in multiple Records with the same source_id

Merge branch 'main' into NOREF/aws-auth-docker-compose

27274d6

code comments

49e2f65

fix Timer call

84e2f84

Add instructions for seeding DB using data file

c7d9766

Format with ruff

c025a22

vercel bot deployed to Preview March 25, 2026 22:55 View deployment

jdohan added 5 commits March 25, 2026 19:04

Shorten step name for brevity

9002e6c

Update description and usage instructions

b54659a

Minor updates to comments

b14c816

Add script for getting FRBR graph data for seeding

af4db22

Run functional tests after integration tests

cd8b96a

vercel bot deployed to Preview March 26, 2026 13:22 View deployment

jdohan requested review from alea12 and bantucaravan and removed request for bantucaravan March 26, 2026 14:01

Update usage command to improve clarity

666eacb

vercel bot deployed to Preview March 26, 2026 15:42 View deployment

Remove comment

1ed1252

vercel bot deployed to Preview March 26, 2026 15:49 View deployment

jdohan mentioned this pull request Mar 26, 2026

SCHOL-273: Chat endpoint catalog search smoke tests #947

Closed

Update commands in workflow and README for clarity

4c9a5e5

vercel bot deployed to Preview March 26, 2026 16:25 View deployment

jdohan changed the title ~~SCHOL-273: Seed local database for VRA API tests~~ SCHOL-273: Run /chat API tests in CI Mar 26, 2026

jdohan changed the title ~~SCHOL-273: Run /chat API tests in CI~~ SCHOL-273: Run /chat API tests in local builds Mar 26, 2026

vercel bot deployed to Preview March 27, 2026 19:10 View deployment

Overwrite dev script with contents from main

34b9053

jdohan force-pushed the SCHOL-273/assistant-api-tests-setup branch from 05398e3 to 34b9053 Compare March 27, 2026 20:11

vercel bot deployed to Preview March 27, 2026 20:12 View deployment

jdohan mentioned this pull request Mar 31, 2026

SCHOL 399: merge etl pipeline QA env tests #1033

Open

Update comment for clarity

9b31705

vercel bot deployed to Preview March 31, 2026 21:11 View deployment

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

SCHOL-273: Run /chat API tests in local builds#1025

SCHOL-273: Run /chat API tests in local builds#1025
jdohan wants to merge 46 commits intomainfrom
SCHOL-273/assistant-api-tests-setup

jdohan commented Mar 25, 2026 •

edited

Loading

Uh oh!

vercel bot commented Mar 25, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

jdohan commented Mar 25, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Changes summary

Tests supported

Fetching data for the local Postgres DB

Tests workflow

Re: ordering

Documentation

How to test

Uh oh!

vercel bot commented Mar 25, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

jdohan commented Mar 25, 2026 •

edited

Loading

vercel bot commented Mar 25, 2026 •

edited

Loading