Automated Integration Test Goldens Update from CI #5751
Google Cloud Build / website-pull-request-nl (datcom-ci)
failed
Nov 27, 2025 in 6m 51s
Summary
Build Information
| Trigger | website-pull-request-nl |
| Build | 4c4de92e-6d01-4cdb-9f70-76025672012c |
| Start | 2025-11-27T11:19:46-08:00 |
| Duration | 5m51.58s |
| Status | FAILURE |
Steps
| Step | Status | Duration |
|---|---|---|
| setup_python | SUCCESS | 3m22.402s |
| explore_tests | FAILURE | 2m6.033s |
| nl_tests | SUCCESS | 1m50.579s |
Details
starting build "4c4de92e-6d01-4cdb-9f70-76025672012c"
FETCHSOURCE
From https://github.com/datacommonsorg/website
* branch 5380acf33d2c61b45a73d2c3a81ca5893d16bca5 -> FETCH_HEAD
Updating files: 88% (2178/2448)
Updating files: 89% (2179/2448)
Updating files: 90% (2204/2448)
Updating files: 91% (2228/2448)
Updating files: 92% (2253/2448)
Updating files: 93% (2277/2448)
Updating files: 94% (2302/2448)
Updating files: 95% (2326/2448)
Updating files: 96% (2351/2448)
Updating files: 97% (2375/2448)
Updating files: 98% (2400/2448)
Updating files: 99% (2424/2448)
Updating files: 100% (2448/2448)
Updating files: 100% (2448/2448), done.
HEAD is now at 5380acf feat: Update goldens from Cloud Build workflow (build a690214d-bdb0-4633-b54d-3b4fa9167e69)
GitCommit:
5380acf33d2c61b45a73d2c3a81ca5893d16bca5
BUILD
Starting Step #0 - "setup_python"
Step #0 - "setup_python": Pulling image: python:3.11.3
Step #0 - "setup_python": 3.11.3: Pulling from library/python
Step #0 - "setup_python": bd73737482dd: Pulling fs layer
Step #0 - "setup_python": 6710592d62aa: Pulling fs layer
Step #0 - "setup_python": 75256935197e: Pulling fs layer
Step #0 - "setup_python": c1e5026c6457: Pulling fs layer
Step #0 - "setup_python": f0016544b8b9: Pulling fs layer
Step #0 - "setup_python": 1d58eee51ff2: Pulling fs layer
Step #0 - "setup_python": 93dc7b704cd1: Pulling fs layer
Step #0 - "setup_python": caefdefa531e: Pulling fs layer
Step #0 - "setup_python": 93dc7b704cd1: Waiting
Step #0 - "setup_python": caefdefa531e: Waiting
Step #0 - "setup_python": f0016544b8b9: Verifying Checksum
Step #0 - "setup_python": f0016544b8b9: Download complete
Step #0 - "setup_python": 6710592d62aa: Verifying Checksum
Step #0 - "setup_python": 6710592d62aa: Download complete
Step #0 - "setup_python": 1d58eee51ff2: Verifying Checksum
Step #0 - "setup_python": 1d58eee51ff2: Download complete
Step #0 - "setup_python": bd73737482dd: Verifying Checksum
Step #0 - "setup_python": bd73737482dd: Download complete
Step #0 - "setup_python": 93dc7b704cd1: Verifying Checksum
Step #0 - "setup_python": 93dc7b704cd1: Download complete
Step #0 - "setup_python": 75256935197e: Verifying Checksum
Step #0 - "setup_python": 75256935197e: Download complete
Step #0 - "setup_python": caefdefa531e: Verifying Checksum
Step #0 - "setup_python": caefdefa531e: Download complete
Step #0 - "setup_python": c1e5026c6457: Verifying Checksum
Step #0 - "setup_python": c1e5026c6457: Download complete
Step #0 - "setup_python": bd73737482dd: Pull complete
Step #0 - "setup_python": 6710592d62aa: Pull complete
Step #0 - "setup_python": 75256935197e: Pull complete
Step #0 - "setup_python": c1e5026c6457: Pull complete
Step #0 - "setup_python": f0016544b8b9: Pull complete
Step #0 - "setup_python": 1d58eee51ff2: Pull complete
Step #0 - "setup_python": 93dc7b704cd1: Pull complete
Step #0 - "setup_python": caefdefa531e: Pull complete
Step #0 - "setup_python": Digest: sha256:3a619e3c96fd4c5fc5e1998fd4dcb1f1403eb90c4c6409c70d7e80b9468df7df
Step #0 - "setup_python": Status: Downloaded newer image for python:3.11.3
Step #0 - "setup_python": docker.io/library/python:3.11.3
Step #0 - "setup_python": --setup_python ### Set up python environment
Step #0 - "setup_python": installing server/requirements.txt
Step #0 - "setup_python": DEPRECATION: langdetect is being installed using the legacy 'setup.py install' method, because it does not have a 'pyproject.toml' and the 'wheel' package is not installed. pip 23.1 will enforce this behaviour change. A possible replacement is to enable the '--use-pep517' option. Discussion can be found at https://github.com/pypa/pip/issues/8559
Step #0 - "setup_python": DEPRECATION: data-gemma is being installed using the legacy 'setup.py install' method, because it does not have a 'pyproject.toml' and the 'wheel' package is not installed. pip 23.1 will enforce this behaviour change. A possible replacement is to enable the '--use-pep517' option. Discussion can be found at https://github.com/pypa/pip/issues/8559
Step #0 - "setup_python": DEPRECATION: flask_testing is being installed using the legacy 'setup.py install' method, because it does not have a 'pyproject.toml' and the 'wheel' package is not installed. pip 23.1 will enforce this behaviour change. A possible replacement is to enable the '--use-pep517' option. Discussion can be found at https://github.com/pypa/pip/issues/8559
Step #0 - "setup_python":
Step #0 - "setup_python": [notice] A new release of pip available: 22.3.1 -> 25.3
Step #0 - "setup_python": [notice] To update, run: pip install --upgrade pip
Step #0 - "setup_python": Looking in indexes: https://pypi.org/simple, https://download.pytorch.org/whl/cpu
Step #0 - "setup_python": Collecting torch==2.2.2
Step #0 - "setup_python": Downloading https://download.pytorch.org/whl/cpu/torch-2.2.2%2Bcpu-cp311-cp311-linux_x86_64.whl (186.8 MB)
Step #0 - "setup_python": ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 186.8/186.8 MB 11.0 MB/s eta 0:00:00
Step #0 - "setup_python": Collecting filelock
Step #0 - "setup_python": Downloading filelock-3.20.0-py3-none-any.whl (16 kB)
Step #0 - "setup_python": Requirement already satisfied: typing-extensions>=4.8.0 in ./.env/lib/python3.11/site-packages (from torch==2.2.2) (4.12.2)
Step #0 - "setup_python": Collecting sympy
Step #0 - "setup_python": Obtaining dependency information for sympy from https://download.pytorch.org/whl/sympy-1.14.0-py3-none-any.whl.metadata
Step #0 - "setup_python": Downloading https://download.pytorch.org/whl/sympy-1.14.0-py3-none-any.whl.metadata (12 kB)
Step #0 - "setup_python": Collecting networkx
Step #0 - "setup_python": Downloading networkx-3.6-py3-none-any.whl (2.1 MB)
Step #0 - "setup_python": ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 2.1/2.1 MB 16.6 MB/s eta 0:00:00
Step #0 - "setup_python": Requirement already satisfied: jinja2 in ./.env/lib/python3.11/site-packages (from torch==2.2.2) (3.1.6)
Step #0 - "setup_python": Collecting fsspec
Step #0 - "setup_python": Downloading fsspec-2025.10.0-py3-none-any.whl (200 kB)
Step #0 - "setup_python": ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 201.0/201.0 kB 21.3 MB/s eta 0:00:00
Step #0 - "setup_python": Requirement already satisfied: MarkupSafe>=2.0 in ./.env/lib/python3.11/site-packages (from jinja2->torch==2.2.2) (2.1.2)
Step #0 - "setup_python": Collecting mpmath<1.4,>=1.1.0
Step #0 - "setup_python": Downloading https://download.pytorch.org/whl/mpmath-1.3.0-py3-none-any.whl (536 kB)
Step #0 - "setup_python": ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 536.2/536.2 kB 64.1 MB/s eta 0:00:00
Step #0 - "setup_python": Downloading https://download.pytorch.org/whl/sympy-1.14.0-py3-none-any.whl (6.3 MB)
Step #0 - "setup_python": ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 6.3/6.3 MB 121.0 MB/s eta 0:00:00
Step #0 - "setup_python": Using cached https://download.pytorch.org/whl/sympy-1.14.0-py3-none-any.whl (6.3 MB)
Step #0 - "setup_python": Installing collected packages: mpmath, sympy, networkx, fsspec, filelock, torch
Step #0 - "setup_python": Successfully installed filelock-3.20.0 fsspec-2025.10.0 mpmath-1.3.0 networkx-3.6 sympy-1.14.0 torch-2.2.2+cpu
Step #0 - "setup_python":
Step #0 - "setup_python": [notice] A new release of pip available: 22.3.1 -> 25.3
Step #0 - "setup_python": [notice] To update, run: pip install --upgrade pip
Step #0 - "setup_python": installing nl_server/requirements.txt
Step #0 - "setup_python":
Step #0 - "setup_python": [notice] A new release of pip available: 22.3.1 -> 25.3
Step #0 - "setup_python": [notice] To update, run: pip install --upgrade pip
Finished Step #0 - "setup_python"
Starting Step #1 - "explore_tests"
Starting Step #2 - "nl_tests"
Step #1 - "explore_tests": Already have image (with digest): python:3.11.3
Step #2 - "nl_tests": Already have image (with digest): python:3.11.3
Step #1 - "explore_tests": --explore ### Running explore page integration tests
Step #1 - "explore_tests": Using ENV_PREFIX=Staging
Step #1 - "explore_tests": Starting servers using run_servers.sh...
Step #1 - "explore_tests": FLASK_ENV=integration_test
Step #1 - "explore_tests": GOOGLE_CLOUD_PROJECT=datcom-website-staging
Step #1 - "explore_tests": ENABLE_MODEL=true
Step #1 - "explore_tests": Starting NL Server...
Step #1 - "explore_tests": Starting Website server...
Step #2 - "nl_tests": --nl ### Running nl page integration tests
Step #2 - "nl_tests": Using ENV_PREFIX=Staging
Step #2 - "nl_tests": Starting servers using run_servers.sh...
Step #2 - "nl_tests": FLASK_ENV=integration_test
Step #2 - "nl_tests": GOOGLE_CLOUD_PROJECT=datcom-website-staging
Step #2 - "nl_tests": ENABLE_MODEL=true
Step #2 - "nl_tests": Starting NL Server...
Step #2 - "nl_tests": Starting Website server...
Step #1 - "explore_tests": [19:23:36][INFO ][config_reader.py:120] Loading index and model catalog from: /workspace/nl_server/../deploy/helm_charts/dc_website/nl/catalog.yaml
Step #1 - "explore_tests": [19:23:36][INFO ][config_reader.py:86] server config:
Step #1 - "explore_tests": {
Step #1 - "explore_tests": "version": "1",
Step #1 - "explore_tests": "default_indexes": [
Step #1 - "explore_tests": "base_uae_mem"
Step #1 - "explore_tests": ],
Step #1 - "explore_tests": "indexes": {
Step #1 - "explore_tests": "base_uae_mem": {
Step #1 - "explore_tests": "store_type": "MEMORY",
Step #1 - "explore_tests": "source_path": "/workspace/tools/nl/embeddings/input/base",
Step #1 - "explore_tests": "model": "uae-large-v1-model",
Step #1 - "explore_tests": "healthcheck_query": "Life expectancy",
Step #1 - "explore_tests": "embeddings_path": "gs://datcom-nl-models/base_uae_mem_2025_11_03_07_10_42/embeddings.csv"
Step #1 - "explore_tests": },
Step #1 - "explore_tests": "bio_ft": {
Step #1 - "explore_tests": "store_type": "MEMORY",
Step #1 - "explore_tests": "source_path": "/workspace/tools/nl/embeddings/input/bio",
Step #1 - "explore_tests": "model": "ft-final-v20230717230459-all-MiniLM-L6-v2",
Step #1 - "explore_tests": "healthcheck_query": "Gene",
Step #1 - "explore_tests": "embeddings_path": "gs://datcom-nl-models/bio_ft_2024_11_08_19_00_38/embeddings.csv"
Step #1 - "explore_tests": },
Step #1 - "explore_tests": "medium_ft": {
Step #1 - "explore_tests": "store_type": "MEMORY",
Step #1 - "explore_tests": "source_path": null,
Step #1 - "explore_tests": "model": "ft-final-v20230717230459-all-MiniLM-L6-v2",
Step #1 - "explore_tests": "healthcheck_query": "Life expectancy",
Step #1 - "explore_tests": "embeddings_path": "gs://datcom-nl-models/embeddings_medium_2024_05_09_18_01_32.ft_final_v20230717230459.all-MiniLM-L6-v2.csv"
Step #1 - "explore_tests": },
Step #1 - "explore_tests": "base_mistral_mem": {
Step #1 - "explore_tests": "store_type": "MEMORY",
Step #1 - "explore_tests": "source_path": "/workspace/tools/nl/embeddings/input/base",
Step #1 - "explore_tests": "model": "sfr-embedding-mistral-model",
Step #1 - "explore_tests": "healthcheck_query": "Life expectancy",
Step #1 - "explore_tests": "embeddings_path": "gs://datcom-nl-models/base_mistral_mem_2024_07_01_10_23_43/embeddings.csv"
Step #1 - "explore_tests": },
Step #1 - "explore_tests": "sdg_ft": {
Step #1 - "explore_tests": "store_type": "MEMORY",
Step #1 - "explore_tests": "source_path": "/workspace/tools/nl/embeddings/input/sdg",
Step #1 - "explore_tests": "model": "ft-final-v20230717230459-all-MiniLM-L6-v2",
Step #1 - "explore_tests": "healthcheck_query": "Hunger",
Step #1 - "explore_tests": "embeddings_path": "gs://datcom-nl-models/sdg_ft_2024_06_24_23_45_46/embeddings.csv"
Step #1 - "explore_tests": },
Step #1 - "explore_tests": "undata_ft": {
Step #1 - "explore_tests": "store_type": "MEMORY",
Step #1 - "explore_tests": "source_path": "/workspace/tools/nl/embeddings/input/undata",
Step #1 - "explore_tests": "model": "ft-final-v20230717230459-all-MiniLM-L6-v2",
Step #1 - "explore_tests": "healthcheck_query": "Hunger",
Step #1 - "explore_tests": "embeddings_path": "gs://datcom-nl-models/undata_ft_2024_06_24_23_47_04/embeddings.csv"
Step #1 - "explore_tests": },
Step #1 - "explore_tests": "undata_ilo_ft": {
Step #1 - "explore_tests": "store_type": "MEMORY",
Step #1 - "explore_tests": "source_path": "/workspace/tools/nl/embeddings/input/undata_ilo",
Step #1 - "explore_tests": "model": "ft-final-v20230717230459-all-MiniLM-L6-v2",
Step #1 - "explore_tests": "healthcheck_query": "Employment",
Step #1 - "explore_tests": "embeddings_path": "gs://datcom-nl-models/undata_ilo_ft_2024_10_14_13_45_50/embeddings.csv"
Step #1 - "explore_tests": }
Step #1 - "explore_tests": },
Step #1 - "explore_tests": "models": {
Step #1 - "explore_tests": "cross-encoder-ms-marco-miniilm-l6-v2": {
Step #1 - "explore_tests": "type": "VERTEXAI",
Step #1 - "explore_tests": "usage": "RERANKING",
Step #1 - "explore_tests": "score_threshold": null,
Step #1 - "explore_tests": "project_id": "datcom-website-dev",
Step #1 - "explore_tests": "location": "us-central1",
Step #1 - "explore_tests": "prediction_endpoint_id": "3977846152316846080"
Step #1 - "explore_tests": },
Step #1 - "explore_tests": "cross-encoder-mxbai-rerank-base-v1": {
Step #1 - "explore_tests": "type": "VERTEXAI",
Step #1 - "explore_tests": "usage": "RERANKING",
Step #1 - "explore_tests": "score_threshold": null,
Step #1 - "explore_tests": "project_id": "datcom-website-dev",
Step #1 - "explore_tests": "location": "us-central1",
Step #1 - "explore_tests": "prediction_endpoint_id": "284894457873039360"
Step #1 - "explore_tests": },
Step #1 - "explore_tests": "uae-large-v1-model": {
Step #1 - "explore_tests": "type": "VERTEXAI",
Step #1 - "explore_tests": "usage": "EMBEDDINGS",
Step #1 - "explore_tests": "score_threshold": 0.7,
Step #1 - "explore_tests": "project_id": "datcom-nl",
Step #1 - "explore_tests": "location": "us-central1",
Step #1 - "explore_tests": "prediction_endpoint_id": "8110162693219942400"
Step #1 - "explore_tests": },
Step #1 - "explore_tests": "ft-final-v20230717230459-all-MiniLM-L6-v2": {
Step #1 - "explore_tests": "type": "LOCAL",
Step #1 - "explore_tests": "usage": "EMBEDDINGS",
Step #1 - "explore_tests": "score_threshold": 0.5,
Step #1 - "explore_tests": "gcs_folder": "gs://datcom-nl-models/ft_final_v20230717230459.all-MiniLM-L6-v2"
Step #1 - "explore_tests": },
Step #1 - "explore_tests": "sfr-embedding-mistral-model": {
Step #1 - "explore_tests": "type": "VERTEXAI",
Step #1 - "explore_tests": "usage": "EMBEDDINGS",
Step #1 - "explore_tests": "score_threshold": 0.5,
Step #1 - "explore_tests": "project_id": "datcom-website-dev",
Step #1 - "explore_tests": "location": "us-central1",
Step #1 - "explore_tests": "prediction_endpoint_id": "224012300019826688"
Step #1 - "explore_tests": }
Step #1 - "explore_tests": },
Step #1 - "explore_tests": "enable_reranking": true
Step #1 - "explore_tests": }
Step #2 - "nl_tests": [19:23:37][INFO ][config_reader.py:120] Loading index and model catalog from: /workspace/nl_server/../deploy/helm_charts/dc_website/nl/catalog.yaml
Step #2 - "nl_tests": [19:23:37][INFO ][config_reader.py:86] server config:
Step #2 - "nl_tests": {
Step #2 - "nl_tests": "version": "1",
Step #2 - "nl_tests": "default_indexes": [
Step #2 - "nl_tests": "base_uae_mem"
Step #2 - "nl_tests": ],
Step #2 - "nl_tests": "indexes": {
Step #2 - "nl_tests": "base_uae_mem": {
Step #2 - "nl_tests": "store_type": "MEMORY",
Step #2 - "nl_tests": "source_path": "/workspace/tools/nl/embeddings/input/base",
Step #2 - "nl_tests": "model": "uae-large-v1-model",
Step #2 - "nl_tests": "healthcheck_query": "Life expectancy",
Step #2 - "nl_tests": "embeddings_path": "gs://datcom-nl-models/base_uae_mem_2025_11_03_07_10_42/embeddings.csv"
Step #2 - "nl_tests": },
Step #2 - "nl_tests": "bio_ft": {
Step #2 - "nl_tests": "store_type": "MEMORY",
Step #2 - "nl_tests": "source_path": "/workspace/tools/nl/embeddings/input/bio",
Step #2 - "nl_tests": "model": "ft-final-v20230717230459-all-MiniLM-L6-v2",
Step #2 - "nl_tests": "healthcheck_query": "Gene",
Step #2 - "nl_tests": "embeddings_path": "gs://datcom-nl-models/bio_ft_2024_11_08_19_00_38/embeddings.csv"
Step #2 - "nl_tests": },
Step #2 - "nl_tests": "medium_ft": {
Step #2 - "nl_tests": "store_type": "MEMORY",
Step #2 - "nl_tests": "source_path": null,
Step #2 - "nl_tests": "model": "ft-final-v20230717230459-all-MiniLM-L6-v2",
Step #2 - "nl_tests": "healthcheck_query": "Life expectancy",
Step #2 - "nl_tests": "embeddings_path": "gs://datcom-nl-models/embeddings_medium_2024_05_09_18_01_32.ft_final_v20230717230459.all-MiniLM-L6-v2.csv"
Step #2 - "nl_tests": },
Step #2 - "nl_tests": "base_mistral_mem": {
Step #2 - "nl_tests": "store_type": "MEMORY",
Step #2 - "nl_tests": "source_path": "/workspace/tools/nl/embeddings/input/base",
Step #2 - "nl_tests": "model": "sfr-embedding-mistral-model",
Step #2 - "nl_tests": "healthcheck_query": "Life expectancy",
Step #2 - "nl_tests": "embeddings_path": "gs://datcom-nl-models/base_mistral_mem_2024_07_01_10_23_43/embeddings.csv"
Step #2 - "nl_tests": },
Step #2 - "nl_tests": "sdg_ft": {
Step #2 - "nl_tests": "store_type": "MEMORY",
Step #2 - "nl_tests": "source_path": "/workspace/tools/nl/embeddings/input/sdg",
Step #2 - "nl_tests": "model": "ft-final-v20230717230459-all-MiniLM-L6-v2",
Step #2 - "nl_tests": "healthcheck_query": "Hunger",
Step #2 - "nl_tests": "embeddings_path": "gs://datcom-nl-models/sdg_ft_2024_06_24_23_45_46/embeddings.csv"
Step #2 - "nl_tests": },
Step #2 - "nl_tests": "undata_ft": {
Step #2 - "nl_tests": "store_type": "MEMORY",
Step #2 - "nl_tests": "source_path": "/workspace/tools/nl/embeddings/input/undata",
Step #2 - "nl_tests": "model": "ft-final-v20230717230459-all-MiniLM-L6-v2",
Step #2 - "nl_tests": "healthcheck_query": "Hunger",
Step #2 - "nl_tests": "embeddings_path": "gs://datcom-nl-models/undata_ft_2024_06_24_23_47_04/embeddings.csv"
Step #2 - "nl_tests": },
Step #2 - "nl_tests": "undata_ilo_ft": {
Step #2 - "nl_tests": "store_type": "MEMORY",
Step #2 - "nl_tests": "source_path": "/workspace/tools/nl/embeddings/input/undata_ilo",
Step #2 - "nl_tests": "model": "ft-final-v20230717230459-all-MiniLM-L6-v2",
Step #2 - "nl_tests": "healthcheck_query": "Employment",
Step #2 - "nl_tests": "embeddings_path": "gs://datcom-nl-models/undata_ilo_ft_2024_10_14_13_45_50/embeddings.csv"
Step #2 - "nl_tests": }
Step #2 - "nl_tests": },
Step #2 - "nl_tests": "models": {
Step #2 - "nl_tests": "cross-encoder-ms-marco-miniilm-l6-v2": {
Step #2 - "nl_tests": "type": "VERTEXAI",
Step #2 - "nl_tests": "usage": "RERANKING",
Step #2 - "nl_tests": "score_threshold": null,
Step #2 - "nl_tests": "project_id": "datcom-website-dev",
Step #2 - "nl_tests": "location": "us-central1",
Step #2 - "nl_tests": "prediction_endpoint_id": "3977846152316846080"
Step #2 - "nl_tests": },
Step #2 - "nl_tests": "cross-encoder-mxbai-rerank-base-v1": {
Step #2 - "nl_tests": "type": "VERTEXAI",
Step #2 - "nl_tests": "usage": "RERANKING",
Step #2 - "nl_tests": "score_threshold": null,
Step #2 - "nl_tests": "project_id": "datcom-website-dev",
Step #2 - "nl_tests": "location": "us-central1",
Step #2 - "nl_tests": "prediction_endpoint_id": "284894457873039360"
Step #2 - "nl_tests": },
Step #2 - "nl_tests": "uae-large-v1-model": {
Step #2 - "nl_tests": "type": "VERTEXAI",
Step #2 - "nl_tests": "usage": "EMBEDDINGS",
Step #2 - "nl_tests": "score_threshold": 0.7,
Step #2 - "nl_tests": "project_id": "datcom-nl",
Step #2 - "nl_tests": "location": "us-central1",
Step #2 - "nl_tests": "prediction_endpoint_id": "8110162693219942400"
Step #2 - "nl_tests": },
Step #2 - "nl_tests": "ft-final-v20230717230459-all-MiniLM-L6-v2": {
Step #2 - "nl_tests": "type": "LOCAL",
Step #2 - "nl_tests": "usage": "EMBEDDINGS",
Step #2 - "nl_tests": "score_threshold": 0.5,
Step #2 - "nl_tests": "gcs_folder": "gs://datcom-nl-models/ft_final_v20230717230459.all-MiniLM-L6-v2"
Step #2 - "nl_tests": },
Step #2 - "nl_tests": "sfr-embedding-mistral-model": {
Step #2 - "nl_tests": "type": "VERTEXAI",
Step #2 - "nl_tests": "usage": "EMBEDDINGS",
Step #2 - "nl_tests": "score_threshold": 0.5,
Step #2 - "nl_tests": "project_id": "datcom-website-dev",
Step #2 - "nl_tests": "location": "us-central1",
Step #2 - "nl_tests": "prediction_endpoint_id": "224012300019826688"
Step #2 - "nl_tests": }
Step #2 - "nl_tests": },
Step #2 - "nl_tests": "enable_reranking": true
Step #2 - "nl_tests": }
Step #1 - "explore_tests": [19:23:39][INFO ][gcs.py:50] Download datcom-nl-models/ft_final_v20230717230459.all-MiniLM-L6-v2 to /tmp/datcom-nl-models/ft_final_v20230717230459.all-MiniLM-L6-v2
Step #2 - "nl_tests": [19:23:40][INFO ][gcs.py:50] Download datcom-nl-models/ft_final_v20230717230459.all-MiniLM-L6-v2 to /tmp/datcom-nl-models/ft_final_v20230717230459.all-MiniLM-L6-v2
Step #1 - "explore_tests": [19:23:43][INFO ][SentenceTransformer.py:113] Load pretrained SentenceTransformer: /tmp/datcom-nl-models/ft_final_v20230717230459.all-MiniLM-L6-v2
Step #2 - "nl_tests": [19:23:43][INFO ][SentenceTransformer.py:113] Load pretrained SentenceTransformer: /tmp/datcom-nl-models/ft_final_v20230717230459.all-MiniLM-L6-v2
Step #1 - "explore_tests": [19:23:43][INFO ][SentenceTransformer.py:219] Use pytorch device_name: cpu
Step #1 - "explore_tests": [19:23:43][INFO ][gcs.py:50] Download datcom-nl-models/base_uae_mem_2025_11_03_07_10_42/embeddings.csv to /tmp/datcom-nl-models/base_uae_mem_2025_11_03_07_10_42/embeddings.csv
Step #2 - "nl_tests": [19:23:44][INFO ][SentenceTransformer.py:219] Use pytorch device_name: cpu
Step #2 - "nl_tests": [19:23:44][INFO ][gcs.py:50] Download datcom-nl-models/base_uae_mem_2025_11_03_07_10_42/embeddings.csv to /tmp/datcom-nl-models/base_uae_mem_2025_11_03_07_10_42/embeddings.csv
Step #1 - "explore_tests": [19:23:46][INFO ][memory.py:67] Loading embeddings file: /tmp/datcom-nl-models/base_uae_mem_2025_11_03_07_10_42/embeddings.csv
Step #2 - "nl_tests": [19:23:47][INFO ][util.py:607] ['http://localhost:6070/healthz'] not ready, waiting for 5 seconds
Step #1 - "explore_tests":
Generating train split: 0 examples [00:00, ? examples/s][19:23:47][INFO ][util.py:607] ['http://localhost:6070/healthz'] not ready, waiting for 5 seconds
Step #2 - "nl_tests": [19:23:47][INFO ][memory.py:67] Loading embeddings file: /tmp/datcom-nl-models/base_uae_mem_2025_11_03_07_10_42/embeddings.csv
Step #2 - "nl_tests": ============================= test session starts ==============================
Step #2 - "nl_tests": platform linux -- Python 3.11.3, pytest-9.0.1, pluggy-1.6.0 -- /workspace/.env/bin/python3
Step #2 - "nl_tests": cachedir: .pytest_cache
Step #2 - "nl_tests": rootdir: /workspace
Step #2 - "nl_tests": configfile: pytest.ini
Step #2 - "nl_tests": plugins: rerunfailures-10.2, flakefinder-1.1.0, anyio-4.11.0, xdist-3.2.1
Step #2 - "nl_tests": gw0 I / gw1 I / gw2 I / gw3 I / gw4 I / gw5 I / gw6 I / gw7 I / gw8 I / gw9 I / gw10 I / gw11 I / gw12 I / gw13 I / gw14 I / gw15 I / gw16 I / gw17 I / gw18 I / gw19 I / gw20 I / gw21 I / gw22 I / gw23 I / gw24 I / gw25 I / gw26 I / gw27 I / gw28 I / gw29 I / gw30 I / gw31 I
Step #2 - "nl_tests":
[gw0] linux Python 3.11.3 cwd: /workspace
Step #2 - "nl_tests":
[gw1] linux Python 3.11.3 cwd: /workspace
Step #2 - "nl_tests":
[gw2] linux Python 3.11.3 cwd: /workspace
Step #2 - "nl_tests":
[gw3] linux Python 3.11.3 cwd: /workspace
Step #2 - "nl_tests":
[gw4] linux Python 3.11.3 cwd: /workspace
Step #2 - "nl_tests":
[gw5] linux Python 3.11.3 cwd: /workspace
Step #2 - "nl_tests":
[gw6] linux Python 3.11.3 cwd: /workspace
Step #2 - "nl_tests":
[gw7] linux Python 3.11.3 cwd: /workspace
Step #2 - "nl_tests":
[gw8] linux Python 3.11.3 cwd: /workspace
Step #2 - "nl_tests":
[gw9] linux Python 3.11.3 cwd: /workspace
Step #2 - "nl_tests":
[gw10] linux Python 3.11.3 cwd: /workspace
Step #2 - "nl_tests":
[gw11] linux Python 3.11.3 cwd: /workspace
Step #2 - "nl_tests":
[gw12] linux Python 3.11.3 cwd: /workspace
Step #2 - "nl_tests":
[gw13] linux Python 3.11.3 cwd: /workspace
Step #2 - "nl_tests":
[gw14] linux Python 3.11.3 cwd: /workspace
Step #2 - "nl_tests":
[gw15] linux Python 3.11.3 cwd: /workspace
Step #2 - "nl_tests":
[gw16] linux Python 3.11.3 cwd: /workspace
Step #2 - "nl_tests":
[gw17] linux Python 3.11.3 cwd: /workspace
Step #2 - "nl_tests":
[gw18] linux Python 3.11.3 cwd: /workspace
Step #2 - "nl_tests":
[gw19] linux Python 3.11.3 cwd: /workspace
Step #2 - "nl_tests":
[gw20] linux Python 3.11.3 cwd: /workspace
Step #2 - "nl_tests":
[gw21] linux Python 3.11.3 cwd: /workspace
Step #2 - "nl_tests":
[gw22] linux Python 3.11.3 cwd: /workspace
Step #2 - "nl_tests":
[gw23] linux Python 3.11.3 cwd: /workspace
Step #2 - "nl_tests":
[gw24] linux Python 3.11.3 cwd: /workspace
Step #2 - "nl_tests":
[gw25] linux Python 3.11.3 cwd: /workspace
Step #2 - "nl_tests":
[gw26] linux Python 3.11.3 cwd: /workspace
Step #2 - "nl_tests":
[gw27] linux Python 3.11.3 cwd: /workspace
Step #2 - "nl_tests":
[gw28] linux Python 3.11.3 cwd: /workspace
Step #2 - "nl_tests":
[gw29] linux Python 3.11.3 cwd: /workspace
Step #2 - "nl_tests":
[gw30] linux Python 3.11.3 cwd: /workspace
Step #2 - "nl_tests":
[gw31] linux Python 3.11.3 cwd: /workspace
Step #2 - "nl_tests":
[gw0] Python 3.11.3 (main, May 23 2023, 13:25:46) [GCC 10.2.1 20210110]
Step #2 - "nl_tests":
[gw1] Python 3.11.3 (main, May 23 2023, 13:25:46) [GCC 10.2.1 20210110]
Step #2 - "nl_tests":
[gw2] Python 3.11.3 (main, May 23 2023, 13:25:46) [GCC 10.2.1 20210110]
Step #2 - "nl_tests":
[gw4] Python 3.11.3 (main, May 23 2023, 13:25:46) [GCC 10.2.1 20210110]
Step #2 - "nl_tests":
[gw3] Python 3.11.3 (main, May 23 2023, 13:25:46) [GCC 10.2.1 20210110]
Step #2 - "nl_tests":
[gw7] Python 3.11.3 (main, May 23 2023, 13:25:46) [GCC 10.2.1 20210110]
Step #2 - "nl_tests":
[gw5] Python 3.11.3 (main, May 23 2023, 13:25:46) [GCC 10.2.1 20210110]
Step #2 - "nl_tests":
[gw6] Python 3.11.3 (main, May 23 2023, 13:25:46) [GCC 10.2.1 20210110]
Step #2 - "nl_tests":
[gw8] Python 3.11.3 (main, May 23 2023, 13:25:46) [GCC 10.2.1 20210110]
Step #2 - "nl_tests":
[gw9] Python 3.11.3 (main, May 23 2023, 13:25:46) [GCC 10.2.1 20210110]
Step #2 - "nl_tests":
[gw10] Python 3.11.3 (main, May 23 2023, 13:25:46) [GCC 10.2.1 20210110]
Step #2 - "nl_tests":
[gw11] Python 3.11.3 (main, May 23 2023, 13:25:46) [GCC 10.2.1 20210110]
Step #2 - "nl_tests":
[gw12] Python 3.11.3 (main, May 23 2023, 13:25:46) [GCC 10.2.1 20210110]
Step #2 - "nl_tests":
[gw13] Python 3.11.3 (main, May 23 2023, 13:25:46) [GCC 10.2.1 20210110]
Step #2 - "nl_tests":
[gw15] Python 3.11.3 (main, May 23 2023, 13:25:46) [GCC 10.2.1 20210110]
Step #2 - "nl_tests":
[gw14] Python 3.11.3 (main, May 23 2023, 13:25:46) [GCC 10.2.1 20210110]
Step #2 - "nl_tests":
[gw18] Python 3.11.3 (main, May 23 2023, 13:25:46) [GCC 10.2.1 20210110]
Step #2 - "nl_tests":
[gw16] Python 3.11.3 (main, May 23 2023, 13:25:46) [GCC 10.2.1 20210110]
Step #2 - "nl_tests":
[gw17] Python 3.11.3 (main, May 23 2023, 13:25:46) [GCC 10.2.1 20210110]
Step #2 - "nl_tests":
[gw19] Python 3.11.3 (main, May 23 2023, 13:25:46) [GCC 10.2.1 20210110]
Step #2 - "nl_tests":
[gw20] Python 3.11.3 (main, May 23 2023, 13:25:46) [GCC 10.2.1 20210110]
Step #2 - "nl_tests":
[gw21] Python 3.11.3 (main, May 23 2023, 13:25:46) [GCC 10.2.1 20210110]
Step #2 - "nl_tests":
[gw23] Python 3.11.3 (main, May 23 2023, 13:25:46) [GCC 10.2.1 20210110]
Step #2 - "nl_tests":
[gw24] Python 3.11.3 (main, May 23 2023, 13:25:46) [GCC 10.2.1 20210110]
Step #2 - "nl_tests":
[gw22] Python 3.11.3 (main, May 23 2023, 13:25:46) [GCC 10.2.1 20210110]
Step #2 - "nl_tests":
[gw25] Python 3.11.3 (main, May 23 2023, 13:25:46) [GCC 10.2.1 20210110]
Step #2 - "nl_tests":
[gw26] Python 3.11.3 (main, May 23 2023, 13:25:46) [GCC 10.2.1 20210110]
Step #2 - "nl_tests":
[gw27] Python 3.11.3 (main, May 23 2023, 13:25:46) [GCC 10.2.1 20210110]
Step #2 - "nl_tests":
[gw29] Python 3.11.3 (main, May 23 2023, 13:25:46) [GCC 10.2.1 20210110]
Step #2 - "nl_tests":
[gw28] Python 3.11.3 (main, May 23 2023, 13:25:46) [GCC 10.2.1 20210110]
Step #2 - "nl_tests":
[gw31] Python 3.11.3 (main, May 23 2023, 13:25:46) [GCC 10.2.1 20210110]
Step #2 - "nl_tests":
[gw30] Python 3.11.3 (main, May 23 2023, 13:25:46) [GCC 10.2.1 20210110]
Step #2 - "nl_tests": gw0 [14] / gw1 [14] / gw2 [14] / gw3 [14] / gw4 [14] / gw5 [14] / gw6 [14] / gw7 [14] / gw8 [14] / gw9 [14] / gw10 [14] / gw11 [14] / gw12 [14] / gw13 [14] / gw14 [14] / gw15 [14] / gw16 [14] / gw17 [14] / gw18 [14] / gw19 [14] / gw20 [14] / gw21 [14] / gw22 [14] / gw23 [14] / gw24 [14] / gw25 [14] / gw26 [14] / gw27 [14] / gw28 [14] / gw29 [14] / gw30 [14] / gw31 [14]
Step #2 - "nl_tests":
Step #2 - "nl_tests": scheduling tests via LoadScheduling
Step #2 - "nl_tests":
Step #2 - "nl_tests": server/integration_tests/nl_test.py::NLTestMisc::test_strict_default_place
Step #2 - "nl_tests": server/integration_tests/nl_test.py::NLTestMisc::test_place_detection_e2e_dc
Step #2 - "nl_tests": server/integration_tests/nl_test.py::NLTestDemo::test_demo_fallback
Step #2 - "nl_tests": server/integration_tests/nl_test.py::NLTestMisc::test_international
Step #2 - "nl_tests": server/integration_tests/nl_test.py::NLTestDemo::test_textbox_sample
Step #2 - "nl_tests": server/integration_tests/nl_test.py::NLTestDemo::test_demo_climatetrace
Step #2 - "nl_tests": server/integration_tests/nl_test.py::NLTestMisc::test_sdg
Step #2 - "nl_tests": server/integration_tests/nl_test.py::NLTestMisc::test_demo_usa_map_types
Step #2 - "nl_tests": server/integration_tests/nl_test.py::NLTestMisc::test_strict_low_confidence
Step #2 - "nl_tests": server/integration_tests/nl_test.py::NLTestMisc::test_translate
Step #2 - "nl_tests": server/integration_tests/nl_test.py::NLTestMisc::test_strict_multi_verb
Step #2 - "nl_tests": server/integration_tests/nl_test.py::NLTestDemo::test_demo_cities_feb2023
Step #2 - "nl_tests": server/integration_tests/nl_test.py::NLTestDemo::test_demo_multisv
Step #1 - "explore_tests":
Generating train split: 7305 examples [00:03, 1949.62 examples/s]
Generating train split: 7305 examples [00:03, 1947.65 examples/s]
Step
...
[Logs truncated due to log size limitations. For full logs, see https://console.cloud.google.com/cloud-build/builds/4c4de92e-6d01-4cdb-9f70-76025672012c?project=879489846695.]
...
/undata_ft_2024_06_24_23_47_04/embeddings.csv
Step #1 - "explore_tests": [19:24:25][INFO ][memory.py:67] Loading embeddings file: /tmp/datcom-nl-models/undata_ft_2024_06_24_23_47_04/embeddings.csv
Step #2 - "nl_tests": [19:24:25][INFO ][memory.py:67] Loading embeddings file: /tmp/datcom-nl-models/undata_ilo_ft_2024_10_14_13_45_50/embeddings.csv
Step #1 - "explore_tests": [19:24:25][INFO ][memory.py:67] Loading embeddings file: /tmp/datcom-nl-models/undata_ilo_ft_2024_10_14_13_45_50/embeddings.csv
Step #2 - "nl_tests": [19:24:25][INFO ][flask.py:79] NL Server Flask app initialized
Step #2 - "nl_tests": [19:24:25][INFO ][nl_app.py:27] Run nl server in local mode (host=localhost), port=6070
Step #2 - "nl_tests": [19:24:25][WARNING ][_internal.py:97] * Debugger is active!
Step #1 - "explore_tests": [19:24:25][INFO ][flask.py:79] NL Server Flask app initialized
Step #1 - "explore_tests": [19:24:25][INFO ][nl_app.py:27] Run nl server in local mode (host=localhost), port=6070
Step #1 - "explore_tests": [19:24:25][WARNING ][_internal.py:97] * Debugger is active!
Step #1 - "explore_tests": [19:24:26][INFO ][util.py:593] http://localhost:6070/healthz is up running
Step #2 - "nl_tests": [19:24:26][INFO ][util.py:593] http://localhost:6070/healthz is up running
Step #1 - "explore_tests": [19:24:26][INFO ][gcs.py:50] Download datcom-website-config/nl_bad_words.txt to /tmp/datcom-website-config/nl_bad_words.txt
Step #2 - "nl_tests": [19:24:26][INFO ][gcs.py:50] Download datcom-website-config/nl_bad_words.txt to /tmp/datcom-website-config/nl_bad_words.txt
Step #1 - "explore_tests": [19:24:27][INFO ][util.py:593] https://staging.api.datacommons.org/version is up running
Step #1 - "explore_tests": [19:24:27][INFO ][web_app.py:27] Run web server in local mode
Step #1 - "explore_tests": * Serving Flask app 'server.__init__'
Step #1 - "explore_tests": * Debug mode: on
Step #2 - "nl_tests": [19:24:27][INFO ][util.py:593] https://staging.api.datacommons.org/version is up running
Step #2 - "nl_tests": [19:24:27][INFO ][web_app.py:27] Run web server in local mode
Step #2 - "nl_tests": * Serving Flask app 'server.__init__'
Step #2 - "nl_tests": * Debug mode: on
Step #1 - "explore_tests": [19:24:37][INFO ][util.py:593] http://localhost:6070/healthz is up running
Step #2 - "nl_tests": [19:24:37][INFO ][util.py:593] http://localhost:6070/healthz is up running
Step #2 - "nl_tests": [19:24:38][INFO ][util.py:593] https://staging.api.datacommons.org/version is up running
Step #2 - "nl_tests": [19:24:38][INFO ][web_app.py:27] Run web server in local mode
Step #2 - "nl_tests": [19:24:38][WARNING ][_internal.py:97] * Debugger is active!
Step #1 - "explore_tests": [19:24:38][INFO ][util.py:593] https://staging.api.datacommons.org/version is up running
Step #1 - "explore_tests": [19:24:38][INFO ][web_app.py:27] Run web server in local mode
Step #1 - "explore_tests": [19:24:38][WARNING ][_internal.py:97] * Debugger is active!
Step #2 - "nl_tests":
Step #1 - "explore_tests":
Step #1 - "explore_tests": [gw26] [ 2%] SKIPPED server/integration_tests/explore_test.py::ExploreTestEE1::test_e2e_answer_places
Step #1 - "explore_tests": [gw0] [ 4%] PASSED server/integration_tests/explore_test.py::ExploreTestDetection::test_detection_basic
Step #1 - "explore_tests": server/integration_tests/explore_test.py::ExploreTestEE1::test_e2e_sdg
Step #1 - "explore_tests": [gw4] [ 6%] PASSED server/integration_tests/explore_test.py::ExploreTestDetection::test_detection_basic_uae
Step #1 - "explore_tests": server/integration_tests/explore_test.py::ExploreTestEE1::test_e2e_us_demo
Step #1 - "explore_tests": [gw9] [ 8%] PASSED server/integration_tests/explore_test.py::ExploreTestDetection::test_detection_bugs
Step #1 - "explore_tests": server/integration_tests/explore_test.py::ExploreTestEE2::test_e2e_fallbacks
Step #1 - "explore_tests": [gw7] [ 10%] PASSED server/integration_tests/explore_test.py::ExploreTestDetection::test_detection_basic_undata
Step #1 - "explore_tests": [gw6] [ 12%] PASSED server/integration_tests/explore_test.py::ExploreTestDetection::test_detection_basic_undata_ilo
Step #1 - "explore_tests": server/integration_tests/explore_test.py::ExploreTestEE1::test_filter_query_disabled
Step #2 - "nl_tests": [gw11] [ 7%] PASSED server/integration_tests/nl_test.py::NLTestMisc::test_strict_low_confidence
Step #1 - "explore_tests": server/integration_tests/explore_test.py::ExploreTestEE2::test_e2e_date_range
Step #1 - "explore_tests": [gw1] [ 14%] PASSED server/integration_tests/explore_test.py::ExploreTestDetection::test_detection_basic_bio
Step #1 - "explore_tests": server/integration_tests/explore_test.py::ExploreTestEE1::test_e2e_superlatives
Step #1 - "explore_tests": [gw2] [ 16%] PASSED server/integration_tests/explore_test.py::ExploreTestDetection::test_detection_basic_sdg
Step #1 - "explore_tests": server/integration_tests/explore_test.py::ExploreTestEE1::test_e2e_translate
Step #1 - "explore_tests": [gw12] [ 18%] PASSED server/integration_tests/explore_test.py::ExploreTestDetection::test_detection_sdg
Step #1 - "explore_tests": server/integration_tests/explore_test.py::ExploreTestEE2::test_e2e_strict_default_place
Step #1 - "explore_tests": [gw5] [ 20%] PASSED server/integration_tests/explore_test.py::ExploreTestDetection::test_detection_basic_undata_dev
Step #1 - "explore_tests": server/integration_tests/explore_test.py::ExploreTestEE2::test_e2e_correlation_simple_place
Step #1 - "explore_tests": [gw3] [ 22%] PASSED server/integration_tests/explore_test.py::ExploreTestDetection::test_detection_basic_sfr
Step #1 - "explore_tests": server/integration_tests/explore_test.py::ExploreTestEE1::test_e2e_undata
Step #1 - "explore_tests": [gw24] [ 24%] PASSED server/integration_tests/explore_test.py::ExploreTestFulfillment::test_fulfillment_sdg_specialvars
Step #1 - "explore_tests": [gw20] [ 26%] PASSED server/integration_tests/explore_test.py::ExploreTestFulfillment::test_fulfillment_sdg
Step #1 - "explore_tests": [gw25] [ 28%] PASSED server/integration_tests/explore_test.py::ExploreTestFulfillment::test_fulfillment_undata
Step #1 - "explore_tests": [gw21] [ 30%] PASSED server/integration_tests/explore_test.py::ExploreTestFulfillment::test_fulfillment_sdg_global
Step #1 - "explore_tests": [gw22] [ 32%] PASSED server/integration_tests/explore_test.py::ExploreTestFulfillment::test_fulfillment_sdg_global_specialvars
Step #1 - "explore_tests": [gw15] [ 34%] PASSED server/integration_tests/explore_test.py::ExploreTestFulfillment::test_fulfillment_basic
Step #2 - "nl_tests": [gw10] [ 14%] PASSED server/integration_tests/nl_test.py::NLTestMisc::test_strict_default_place
Step #2 - "nl_tests": [gw12] [ 21%] PASSED server/integration_tests/nl_test.py::NLTestMisc::test_strict_multi_verb
Step #1 - "explore_tests": server/integration_tests/explore_test.py::ExploreTestEE2::test_e2e_toolformer_rag_mode
Step #1 - "explore_tests": [gw14] [ 36%] PASSED server/integration_tests/explore_test.py::ExploreTestDetection::test_detection_statvars
Step #2 - "nl_tests": [19:24:45][INFO ][llm_api.py:88] Gemini model used for LLM API: gemini-2.5-flash
Step #2 - "nl_tests": [19:24:45][INFO ][models.py:4993] AFC is enabled with max remote calls: 10.
Step #2 - "nl_tests": [gw13] [ 28%] PASSED server/integration_tests/nl_test.py::NLTestMisc::test_translate
Step #2 - "nl_tests": [gw7] [ 35%] PASSED server/integration_tests/nl_test.py::NLTestDemo::test_textbox_sample
Step #1 - "explore_tests": server/integration_tests/explore_test.py::ExploreTestEE2::test_e2e_strict_multi_sv
Step #1 - "explore_tests": [gw8] [ 38%] PASSED server/integration_tests/explore_test.py::ExploreTestDetection::test_detection_bio
Step #1 - "explore_tests": server/integration_tests/explore_test.py::ExploreTestEE2::test_e2e_default_place
Step #1 - "explore_tests": [gw14] [ 40%] PASSED server/integration_tests/explore_test.py::ExploreTestEE2::test_e2e_strict_multi_sv
Step #1 - "explore_tests": [gw18] [ 42%] PASSED server/integration_tests/explore_test.py::ExploreTestFulfillment::test_fulfillment_explore_more
Step #1 - "explore_tests": [gw19] [ 44%] PASSED server/integration_tests/explore_test.py::ExploreTestFulfillment::test_fulfillment_nl_size
Step #2 - "nl_tests": [19:24:49][INFO ][_client.py:1025] HTTP Request: POST https://generativelanguage.googleapis.com/v1/models/gemini-2.5-flash:generateContent "HTTP/1.1 200 OK"
Step #1 - "explore_tests": [gw12] [ 46%] PASSED server/integration_tests/explore_test.py::ExploreTestEE2::test_e2e_strict_default_place
Step #1 - "explore_tests": [gw13] [ 48%] PASSED server/integration_tests/explore_test.py::ExploreTestDetection::test_detection_translate
Step #1 - "explore_tests": server/integration_tests/explore_test.py::ExploreTestEE2::test_e2e_strict_multi_verb
Step #1 - "explore_tests": [gw27] [ 50%] PASSED server/integration_tests/explore_test.py::ExploreTestEE1::test_e2e_correlation_bugs
Step #2 - "nl_tests": [gw1] [ 42%] PASSED server/integration_tests/nl_test.py::NLTestDemo::test_demo_climatetrace
Step #1 - "explore_tests": [gw23] [ 52%] PASSED server/integration_tests/explore_test.py::ExploreTestFulfillment::test_fulfillment_statvars
Step #1 - "explore_tests": [gw17] [ 54%] PASSED server/integration_tests/explore_test.py::ExploreTestFulfillment::test_fulfillment_correlation
Step #1 - "explore_tests": server/integration_tests/explore_test.py::ExploreTestEE2::test_e2e_triple
Step #1 - "explore_tests": [gw0] [ 56%] PASSED server/integration_tests/explore_test.py::ExploreTestEE1::test_e2e_sdg
Step #1 - "explore_tests": [gw16] [ 58%] PASSED server/integration_tests/explore_test.py::ExploreTestFulfillment::test_fulfillment_comparison
Step #2 - "nl_tests": [gw9] [ 50%] PASSED server/integration_tests/nl_test.py::NLTestMisc::test_sdg
Step #1 - "explore_tests": [19:24:52][INFO ][llm_api.py:88] Gemini model used for LLM API: gemini-2.5-flash
Step #1 - "explore_tests": [19:24:52][INFO ][models.py:4993] AFC is enabled with max remote calls: 10.
Step #1 - "explore_tests": [19:24:52][INFO ][llm_api.py:88] Gemini model used for LLM API: gemini-2.5-flash
Step #1 - "explore_tests": [19:24:52][INFO ][models.py:4993] AFC is enabled with max remote calls: 10.
Step #1 - "explore_tests": [19:24:52][INFO ][llm_api.py:88] Gemini model used for LLM API: gemini-2.5-flash
Step #1 - "explore_tests": [19:24:52][INFO ][models.py:4993] AFC is enabled with max remote calls: 10.
Step #1 - "explore_tests": [19:24:52][INFO ][llm_api.py:88] Gemini model used for LLM API: gemini-2.5-flash
Step #1 - "explore_tests": [19:24:52][INFO ][models.py:4993] AFC is enabled with max remote calls: 10.
Step #1 - "explore_tests": [19:24:53][INFO ][_client.py:1025] HTTP Request: POST https://generativelanguage.googleapis.com/v1/models/gemini-2.5-flash:generateContent "HTTP/1.1 200 OK"
Step #1 - "explore_tests": server/integration_tests/explore_test.py::ExploreTestEE2::test_e2e_toolformer_rig_mode
Step #1 - "explore_tests": [gw13] [ 60%] PASSED server/integration_tests/explore_test.py::ExploreTestEE2::test_e2e_strict_multi_verb
Step #1 - "explore_tests": [gw7] [ 62%] PASSED server/integration_tests/explore_test.py::ExploreTestEE1::test_filter_query_disabled
Step #2 - "nl_tests": [gw5] [ 57%] PASSED server/integration_tests/nl_test.py::NLTestMisc::test_demo_usa_map_types
Step #1 - "explore_tests": [19:24:55][INFO ][_client.py:1025] HTTP Request: POST https://generativelanguage.googleapis.com/v1/models/gemini-2.5-flash:generateContent "HTTP/1.1 200 OK"
Step #1 - "explore_tests": [19:24:55][INFO ][_client.py:1025] HTTP Request: POST https://generativelanguage.googleapis.com/v1/models/gemini-2.5-flash:generateContent "HTTP/1.1 200 OK"
Step #1 - "explore_tests": [gw3] [ 64%] PASSED server/integration_tests/explore_test.py::ExploreTestEE1::test_e2e_undata
Step #1 - "explore_tests": [gw2] [ 66%] RERUN server/integration_tests/explore_test.py::ExploreTestEE1::test_e2e_translate
Step #1 - "explore_tests": [19:24:56][INFO ][_client.py:1025] HTTP Request: POST https://generativelanguage.googleapis.com/v1/models/gemini-2.5-flash:generateContent "HTTP/1.1 200 OK"
Step #1 - "explore_tests": server/integration_tests/explore_test.py::ExploreTestEE1::test_e2e_translate
Step #1 - "explore_tests": [gw10] [ 68%] RERUN server/integration_tests/explore_test.py::ExploreTestDetection::test_detection_context
Step #2 - "nl_tests": [gw3] [ 64%] PASSED server/integration_tests/nl_test.py::NLTestDemo::test_demo_multisv
Step #1 - "explore_tests": [19:24:56][INFO ][llm_api.py:88] Gemini model used for LLM API: gemini-2.5-flash
Step #1 - "explore_tests": [19:24:56][INFO ][models.py:4993] AFC is enabled with max remote calls: 10.
Step #1 - "explore_tests": server/integration_tests/explore_test.py::ExploreTestDetection::test_detection_context
Step #1 - "explore_tests": [gw11] [ 70%] RERUN server/integration_tests/explore_test.py::ExploreTestDetection::test_detection_multivar
Step #1 - "explore_tests": [19:24:57][INFO ][llm_api.py:88] Gemini model used for LLM API: gemini-2.5-flash
Step #1 - "explore_tests": [19:24:57][INFO ][models.py:4993] AFC is enabled with max remote calls: 10.
Step #2 - "nl_tests": [gw8] [ 71%] PASSED server/integration_tests/nl_test.py::NLTestMisc::test_place_detection_e2e_dc
Step #1 - "explore_tests": [19:25:00][INFO ][_client.py:1025] HTTP Request: POST https://generativelanguage.googleapis.com/v1/models/gemini-2.5-flash:generateContent "HTTP/1.1 200 OK"
Step #1 - "explore_tests": server/integration_tests/explore_test.py::ExploreTestDetection::test_detection_multivar
Step #1 - "explore_tests": [gw5] [ 72%] PASSED server/integration_tests/explore_test.py::ExploreTestEE2::test_e2e_correlation_simple_place
Step #1 - "explore_tests": [gw10] [ 72%] RERUN server/integration_tests/explore_test.py::ExploreTestDetection::test_detection_context
Step #1 - "explore_tests": [19:25:01][INFO ][llm_api.py:88] Gemini model used for LLM API: gemini-2.5-flash
Step #1 - "explore_tests": [19:25:01][INFO ][models.py:4993] AFC is enabled with max remote calls: 10.
Step #1 - "explore_tests": server/integration_tests/explore_test.py::ExploreTestDetection::test_detection_context
Step #2 - "nl_tests": [gw6] [ 78%] PASSED server/integration_tests/nl_test.py::NLTestMisc::test_international
Step #1 - "explore_tests": [gw2] [ 72%] PASSED server/integration_tests/explore_test.py::ExploreTestEE1::test_e2e_translate
Step #2 - "nl_tests": [gw2] [ 85%] PASSED server/integration_tests/nl_test.py::NLTestDemo::test_demo_fallback
Step #1 - "explore_tests": [gw1] [ 74%] PASSED server/integration_tests/explore_test.py::ExploreTestEE1::test_e2e_superlatives
Step #1 - "explore_tests": [19:25:04][INFO ][_client.py:1025] HTTP Request: POST https://generativelanguage.googleapis.com/v1/models/gemini-2.5-flash:generateContent "HTTP/1.1 200 OK"
Step #1 - "explore_tests": [19:25:04][INFO ][llm_api.py:88] Gemini model used for LLM API: gemini-2.5-flash
Step #1 - "explore_tests": [19:25:04][INFO ][models.py:4993] AFC is enabled with max remote calls: 10.
Step #1 - "explore_tests": [gw9] [ 76%] PASSED server/integration_tests/explore_test.py::ExploreTestEE2::test_e2e_fallbacks
Step #1 - "explore_tests": [gw11] [ 76%] RERUN server/integration_tests/explore_test.py::ExploreTestDetection::test_detection_multivar
Step #1 - "explore_tests": [19:25:04][INFO ][llm_api.py:88] Gemini model used for LLM API: gemini-2.5-flash
Step #1 - "explore_tests": [19:25:04][INFO ][models.py:4993] AFC is enabled with max remote calls: 10.
Step #1 - "explore_tests": [19:25:05][INFO ][_client.py:1025] HTTP Request: POST https://generativelanguage.googleapis.com/v1/models/gemini-2.5-flash:generateContent "HTTP/1.1 200 OK"
Step #1 - "explore_tests": [19:25:06][INFO ][_client.py:1025] HTTP Request: POST https://generativelanguage.googleapis.com/v1/models/gemini-2.5-flash:generateContent "HTTP/1.1 200 OK"
Step #1 - "explore_tests": server/integration_tests/explore_test.py::ExploreTestDetection::test_detection_multivar
Step #1 - "explore_tests": [gw10] [ 76%] FAILED server/integration_tests/explore_test.py::ExploreTestDetection::test_detection_context
Step #2 - "nl_tests": [gw0] [ 92%] PASSED server/integration_tests/nl_test.py::NLTestDemo::test_demo_cities_feb2023
Step #1 - "explore_tests": [19:25:08][INFO ][_client.py:1025] HTTP Request: POST https://generativelanguage.googleapis.com/v1/models/gemini-2.5-flash:generateContent "HTTP/1.1 200 OK"
Step #1 - "explore_tests": server/integration_tests/explore_test.py::ExploreTestEE2::test_e2e_high_sv_threshold
Step #1 - "explore_tests": [gw30] [ 78%] PASSED server/integration_tests/explore_test.py::ExploreTestEE1::test_e2e_india_demo
Step #1 - "explore_tests": [gw11] [ 78%] FAILED server/integration_tests/explore_test.py::ExploreTestDetection::test_detection_multivar
Step #1 - "explore_tests": server/integration_tests/explore_test.py::ExploreTestEE2::test_e2e_single_date
Step #1 - "explore_tests": [gw16] [ 80%] PASSED server/integration_tests/explore_test.py::ExploreTestEE2::test_e2e_toolformer_rig_mode
Step #2 - "nl_tests": [gw4] [100%] PASSED server/integration_tests/nl_test.py::NLTestDemo::test_demo_feb2023
Step #2 - "nl_tests":
Step #2 - "nl_tests": =============================== warnings summary ===============================
Step #2 - "nl_tests": .env/lib/python3.11/site-packages/flask_babel/__init__.py:183: 32 warnings
Step #2 - "nl_tests": /workspace/.env/lib/python3.11/site-packages/flask_babel/__init__.py:183: DeprecationWarning: 'locked_cached_property' is deprecated and will be removed in Flask 2.4. Use a lock inside the decorated function if locking is needed.
Step #2 - "nl_tests": @locked_cached_property
Step #2 - "nl_tests":
Step #2 - "nl_tests": -- Docs: https://docs.pytest.org/en/stable/how-to/capture-warnings.html
Step #2 - "nl_tests": ================= 14 passed, 32 warnings in 105.59s (0:01:45) ==================
Finished Step #2 - "nl_tests"
Step #1 - "explore_tests": [19:25:11][INFO ][llm_api.py:88] Gemini model used for LLM API: gemini-2.5-flash
Step #1 - "explore_tests": [19:25:11][INFO ][models.py:4993] AFC is enabled with max remote calls: 10.
Step #1 - "explore_tests": [gw28] [ 82%] PASSED server/integration_tests/explore_test.py::ExploreTestEE1::test_e2e_edge_cases
Step #1 - "explore_tests": [gw10] [ 84%] PASSED server/integration_tests/explore_test.py::ExploreTestEE2::test_e2e_high_sv_threshold
Step #1 - "explore_tests": [gw31] [ 86%] PASSED server/integration_tests/explore_test.py::ExploreTestEE1::test_e2e_electrification_demo
Step #1 - "explore_tests": [gw15] [ 88%] PASSED server/integration_tests/explore_test.py::ExploreTestEE2::test_e2e_toolformer_rag_mode
Step #1 - "explore_tests": [19:25:14][INFO ][_client.py:1025] HTTP Request: POST https://generativelanguage.googleapis.com/v1/models/gemini-2.5-flash:generateContent "HTTP/1.1 200 OK"
Step #1 - "explore_tests": [gw4] [ 90%] PASSED server/integration_tests/explore_test.py::ExploreTestEE1::test_e2e_us_demo
Step #1 - "explore_tests": [gw29] [ 92%] PASSED server/integration_tests/explore_test.py::ExploreTestEE1::test_e2e_edge_cases2
Step #1 - "explore_tests": [gw8] [ 94%] PASSED server/integration_tests/explore_test.py::ExploreTestEE2::test_e2e_default_place
Step #1 - "explore_tests": [gw6] [ 96%] PASSED server/integration_tests/explore_test.py::ExploreTestEE2::test_e2e_date_range
Step #1 - "explore_tests": [gw17] [ 98%] PASSED server/integration_tests/explore_test.py::ExploreTestEE2::test_e2e_triple
Step #1 - "explore_tests": [gw11] [100%] PASSED server/integration_tests/explore_test.py::ExploreTestEE2::test_e2e_single_date
Step #1 - "explore_tests":
Step #1 - "explore_tests": =================================== FAILURES ===================================
Step #1 - "explore_tests": _________________ ExploreTestDetection.test_detection_context __________________
Step #1 - "explore_tests": [gw10] linux -- Python 3.11.3 /workspace/.env/bin/python3
Step #1 - "explore_tests":
Step #1 - "explore_tests": self = <workspace.server.integration_tests.explore_test.ExploreTestDetection testMethod=test_detection_context>
Step #1 - "explore_tests":
Step #1 - "explore_tests": def test_detection_context(self):
Step #1 - "explore_tests": > self.run_detection('detection_api_context', [
Step #1 - "explore_tests": 'States with highest PHDs', 'Commute in tracts of California',
Step #1 - "explore_tests": 'Compare with Nevada', 'Correlate with asthma',
Step #1 - "explore_tests": 'countries with greenhouse gas emissions',
Step #1 - "explore_tests": 'median income in Santa Clara county and Alameda county'
Step #1 - "explore_tests": ])
Step #1 - "explore_tests":
Step #1 - "explore_tests": server/integration_tests/explore_test.py:343:
Step #1 - "explore_tests": _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
Step #1 - "explore_tests": server/integration_tests/explore_test.py:70: in run_detection
Step #1 - "explore_tests": self.handle_response(q, resp, test_dir, d, failure, check_detection)
Step #1 - "explore_tests": server/integration_tests/explore_test.py:197: in handle_response
Step #1 - "explore_tests": self.assertEqual(a, b)
Step #1 - "explore_tests": E AssertionError: '{\n [63 chars] "quantity": {\n "idx": 0,\n [1124 chars]]\n}' != '{\n [63 chars] "ranking_type": [\n 1\n ],\n [946 chars]]\n}'
Step #1 - "explore_tests": E {
Step #1 - "explore_tests": E "childEntityType": "State",
Step #1 - "explore_tests": E "classifications": [
Step #1 - "explore_tests": E - {
Step #1 - "explore_tests": E - "quantity": {
Step #1 - "explore_tests": E - "idx": 0,
Step #1 - "explore_tests": E - "qval": {
Step #1 - "explore_tests": E - "cmp": "GE",
Step #1 - "explore_tests": E - "val": 2.2250738585072014e-308
Step #1 - "explore_tests": E - }
Step #1 - "explore_tests": E - },
Step #1 - "explore_tests": E - "type": 3
Step #1 - "explore_tests": E - },
Step #1 - "explore_tests": E {
Step #1 - "explore_tests": E "ranking_type": [
Step #1 - "explore_tests": E 1
Step #1 - "explore_tests": E ],
Step #1 - "explore_tests": E "type": 2
Step #1 - "explore_tests": E },
Step #1 - "explore_tests": E {
Step #1 - "explore_tests": E "contained_in_place_type": "State",
Step #1 - "explore_tests": E "had_default_type": false,
Step #1 - "explore_tests": E "type": 4
Step #1 - "explore_tests": E }
Step #1 - "explore_tests": E ],
Step #1 - "explore_tests": E "client": "test_detect",
Step #1 - "explore_tests": E "comparisonEntities": [],
Step #1 - "explore_tests": E "comparisonVariables": [],
Step #1 - "explore_tests": E "context": {},
Step #1 - "explore_tests": E "debug": {},
Step #1 - "explore_tests": E "entities": [
Step #1 - "explore_tests": E "country/USA"
Step #1 - "explore_tests": E ],
Step #1 - "explore_tests": E "nonPlaceEntities": [],
Step #1 - "explore_tests": E "properties": [],
Step #1 - "explore_tests": E "sessionId": "007_999999999",
Step #1 - "explore_tests": E "variables": [
Step #1 - "explore_tests": E "Count_Person_EducationalAttainmentDoctorateDegree",
Step #1 - "explore_tests": E "Count_Person_25OrMoreYears_EducationalAttainmentDoctorateDegree_Female",
Step #1 - "explore_tests": E "Count_Person_25OrMoreYears_EducationalAttainmentDoctorateDegree_Male",
Step #1 - "explore_tests": E "Count_Person_25OrMoreYears_DoctorateDegree_AsFractionOf_Count_Person_25OrMoreYears",
Step #1 - "explore_tests": E "Count_Person_25OrMoreYears_Female_DoctorateDegree_AsFractionOf_Count_Person_25OrMoreYears_Female",
Step #1 - "explore_tests": E "Count_Person_25OrMoreYears_Male_DoctorateDegree_AsFractionOf_Count_Person_25OrMoreYears_Male",
Step #1 - "explore_tests": E "dc/topic/StudentsInCollege"
Step #1 - "explore_tests": E ]
Step #1 - "explore_tests": E }
Step #1 - "explore_tests": _________________ ExploreTestDetection.test_detection_multivar _________________
Step #1 - "explore_tests": [gw11] linux -- Python 3.11.3 /workspace/.env/bin/python3
Step #1 - "explore_tests":
Step #1 - "explore_tests": self = <workspace.server.integration_tests.explore_test.ExploreTestDetection testMethod=test_detection_multivar>
Step #1 - "explore_tests":
Step #1 - "explore_tests": def test_detection_multivar(self):
Step #1 - "explore_tests": > self.run_detection('detection_api_multivar', [
Step #1 - "explore_tests": 'number of poor hispanic women with phd',
Step #1 - "explore_tests": 'compare obesity vs. poverty',
Step #1 - "explore_tests": 'show me the impact of climate change on drought',
Step #1 - "explore_tests": 'how are factors like obesity, blood pressure and asthma impacted by climate change',
Step #1 - "explore_tests": 'Compare "Male population" with "Female Population"',
Step #1 - "explore_tests": ],
Step #1 - "explore_tests": check_detection=True)
Step #1 - "explore_tests":
Step #1 - "explore_tests": server/integration_tests/explore_test.py:333:
Step #1 - "explore_tests": _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
Step #1 - "explore_tests": server/integration_tests/explore_test.py:70: in run_detection
Step #1 - "explore_tests": self.handle_response(q, resp, test_dir, d, failure, check_detection)
Step #1 - "explore_tests": server/integration_tests/explore_test.py:226: in handle_response
Step #1 - "explore_tests": self._check_multivars(dbg["sv_matching"], expected["sv_matching"])
Step #1 - "explore_tests": _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
Step #1 - "explore_tests":
Step #1 - "explore_tests": self = <workspace.server.integration_tests.explore_test.ExploreTestDetection testMethod=test_detection_multivar>
Step #1 - "explore_tests": got = {'CosineScore': [], 'MultiSV': {}, 'Query': 'number of poor hispanic women with phd', 'SV': [], ...}
Step #1 - "explore_tests": want = {'CosineScore': [], 'MultiSV': {}, 'Query': 'number of poor hispanic women with phd', 'SV': []}
Step #1 - "explore_tests":
Step #1 - "explore_tests": def _check_multivars(self, got, want):
Step #1 - "explore_tests": > self.assertEqual(got['SV'][0], want['SV'][0])
Step #1 - "explore_tests": ^^^^^^^^^^^^
Step #1 - "explore_tests": E IndexError: list index out of range
Step #1 - "explore_tests":
Step #1 - "explore_tests": server/integration_tests/explore_test.py:229: IndexError
Step #1 - "explore_tests": =============================== warnings summary ===============================
Step #1 - "explore_tests": .env/lib/python3.11/site-packages/flask_babel/__init__.py:183: 32 warnings
Step #1 - "explore_tests": /workspace/.env/lib/python3.11/site-packages/flask_babel/__init__.py:183: DeprecationWarning: 'locked_cached_property' is deprecated and will be removed in Flask 2.4. Use a lock inside the decorated function if locking is needed.
Step #1 - "explore_tests": @locked_cached_property
Step #1 - "explore_tests":
Step #1 - "explore_tests": -- Docs: https://docs.pytest.org/en/stable/how-to/capture-warnings.html
Step #1 - "explore_tests": =========================== short test summary info ============================
Step #1 - "explore_tests": FAILED server/integration_tests/explore_test.py::ExploreTestDetection::test_detection_context - AssertionError: '{\n [63 chars] "quantity": {\n "idx": 0,\n [1124 chars]]\n}' != '{\n [63 chars] "ranking_type": [\n 1\n ],\n [946 chars]]\n}'
Step #1 - "explore_tests": {
Step #1 - "explore_tests": "childEntityType": "State",
Step #1 - "explore_tests": "classifications": [
Step #1 - "explore_tests": - {
Step #1 - "explore_tests": - "quantity": {
Step #1 - "explore_tests": - "idx": 0,
Step #1 - "explore_tests": - "qval": {
Step #1 - "explore_tests": - "cmp": "GE",
Step #1 - "explore_tests": - "val": 2.2250738585072014e-308
Step #1 - "explore_tests": - }
Step #1 - "explore_tests": - },
Step #1 - "explore_tests": - "type": 3
Step #1 - "explore_tests": - },
Step #1 - "explore_tests": {
Step #1 - "explore_tests": "ranking_type": [
Step #1 - "explore_tests": 1
Step #1 - "explore_tests": ],
Step #1 - "explore_tests": "type": 2
Step #1 - "explore_tests": },
Step #1 - "explore_tests": {
Step #1 - "explore_tests": "contained_in_place_type": "State",
Step #1 - "explore_tests": "had_default_type": false,
Step #1 - "explore_tests": "type": 4
Step #1 - "explore_tests": }
Step #1 - "explore_tests": ],
Step #1 - "explore_tests": "client": "test_detect",
Step #1 - "explore_tests": "comparisonEntities": [],
Step #1 - "explore_tests": "comparisonVariables": [],
Step #1 - "explore_tests": "context": {},
Step #1 - "explore_tests": "debug": {},
Step #1 - "explore_tests": "entities": [
Step #1 - "explore_tests": "country/USA"
Step #1 - "explore_tests": ],
Step #1 - "explore_tests": "nonPlaceEntities": [],
Step #1 - "explore_tests": "properties": [],
Step #1 - "explore_tests": "sessionId": "007_999999999",
Step #1 - "explore_tests": "variables": [
Step #1 - "explore_tests": "Count_Person_EducationalAttainmentDoctorateDegree",
Step #1 - "explore_tests": "Count_Person_25OrMoreYears_EducationalAttainmentDoctorateDegree_Female",
Step #1 - "explore_tests": "Count_Person_25OrMoreYears_EducationalAttainmentDoctorateDegree_Male",
Step #1 - "explore_tests": "Count_Person_25OrMoreYears_DoctorateDegree_AsFractionOf_Count_Person_25OrMoreYears",
Step #1 - "explore_tests": "Count_Person_25OrMoreYears_Female_DoctorateDegree_AsFractionOf_Count_Person_25OrMoreYears_Female",
Step #1 - "explore_tests": "Count_Person_25OrMoreYears_Male_DoctorateDegree_AsFractionOf_Count_Person_25OrMoreYears_Male",
Step #1 - "explore_tests": "dc/topic/StudentsInCollege"
Step #1 - "explore_tests": ]
Step #1 - "explore_tests": }
Step #1 - "explore_tests": FAILED server/integration_tests/explore_test.py::ExploreTestDetection::test_detection_multivar - IndexError: list index out of range
Step #1 - "explore_tests": == 2 failed, 47 passed, 1 skipped, 32 warnings, 5 rerun in 121.12s (0:02:01) ===
Finished Step #1 - "explore_tests"
ERROR
ERROR: build step 1 "python:3.11.3" failed: step exited with non-zero status: 1
Loading