Automated Integration Test Goldens Update from CI #5751

Summary

Build Information

Trigger	website-pull-request-nl
Build	4c4de92e-6d01-4cdb-9f70-76025672012c
Start	2025-11-27T11:19:46-08:00
Duration	5m51.58s
Status	FAILURE

Steps

Step	Status	Duration
setup_python	SUCCESS	3m22.402s
explore_tests	FAILURE	2m6.033s
nl_tests	SUCCESS	1m50.579s

starting build "4c4de92e-6d01-4cdb-9f70-76025672012c"

FETCHSOURCE
From https://github.com/datacommonsorg/website
 * branch            5380acf33d2c61b45a73d2c3a81ca5893d16bca5 -> FETCH_HEAD
Updating files:  88% (2178/2448)
Updating files:  89% (2179/2448)
Updating files:  90% (2204/2448)
Updating files:  91% (2228/2448)
Updating files:  92% (2253/2448)
Updating files:  93% (2277/2448)
Updating files:  94% (2302/2448)
Updating files:  95% (2326/2448)
Updating files:  96% (2351/2448)
Updating files:  97% (2375/2448)
Updating files:  98% (2400/2448)
Updating files:  99% (2424/2448)
Updating files: 100% (2448/2448)
Updating files: 100% (2448/2448), done.
HEAD is now at 5380acf feat: Update goldens from Cloud Build workflow (build a690214d-bdb0-4633-b54d-3b4fa9167e69)
GitCommit:
5380acf33d2c61b45a73d2c3a81ca5893d16bca5
BUILD
Starting Step #0 - "setup_python"
Step #0 - "setup_python": Pulling image: python:3.11.3
Step #0 - "setup_python": 3.11.3: Pulling from library/python
Step #0 - "setup_python": bd73737482dd: Pulling fs layer
Step #0 - "setup_python": 6710592d62aa: Pulling fs layer
Step #0 - "setup_python": 75256935197e: Pulling fs layer
Step #0 - "setup_python": c1e5026c6457: Pulling fs layer
Step #0 - "setup_python": f0016544b8b9: Pulling fs layer
Step #0 - "setup_python": 1d58eee51ff2: Pulling fs layer
Step #0 - "setup_python": 93dc7b704cd1: Pulling fs layer
Step #0 - "setup_python": caefdefa531e: Pulling fs layer
Step #0 - "setup_python": 93dc7b704cd1: Waiting
Step #0 - "setup_python": caefdefa531e: Waiting
Step #0 - "setup_python": f0016544b8b9: Verifying Checksum
Step #0 - "setup_python": f0016544b8b9: Download complete
Step #0 - "setup_python": 6710592d62aa: Verifying Checksum
Step #0 - "setup_python": 6710592d62aa: Download complete
Step #0 - "setup_python": 1d58eee51ff2: Verifying Checksum
Step #0 - "setup_python": 1d58eee51ff2: Download complete
Step #0 - "setup_python": bd73737482dd: Verifying Checksum
Step #0 - "setup_python": bd73737482dd: Download complete
Step #0 - "setup_python": 93dc7b704cd1: Verifying Checksum
Step #0 - "setup_python": 93dc7b704cd1: Download complete
Step #0 - "setup_python": 75256935197e: Verifying Checksum
Step #0 - "setup_python": 75256935197e: Download complete
Step #0 - "setup_python": caefdefa531e: Verifying Checksum
Step #0 - "setup_python": caefdefa531e: Download complete
Step #0 - "setup_python": c1e5026c6457: Verifying Checksum
Step #0 - "setup_python": c1e5026c6457: Download complete
Step #0 - "setup_python": bd73737482dd: Pull complete
Step #0 - "setup_python": 6710592d62aa: Pull complete
Step #0 - "setup_python": 75256935197e: Pull complete
Step #0 - "setup_python": c1e5026c6457: Pull complete
Step #0 - "setup_python": f0016544b8b9: Pull complete
Step #0 - "setup_python": 1d58eee51ff2: Pull complete
Step #0 - "setup_python": 93dc7b704cd1: Pull complete
Step #0 - "setup_python": caefdefa531e: Pull complete
Step #0 - "setup_python": Digest: sha256:3a619e3c96fd4c5fc5e1998fd4dcb1f1403eb90c4c6409c70d7e80b9468df7df
Step #0 - "setup_python": Status: Downloaded newer image for python:3.11.3
Step #0 - "setup_python": docker.io/library/python:3.11.3
Step #0 - "setup_python": --setup_python ### Set up python environment
Step #0 - "setup_python": installing server/requirements.txt
Step #0 - "setup_python":   DEPRECATION: langdetect is being installed using the legacy 'setup.py install' method, because it does not have a 'pyproject.toml' and the 'wheel' package is not installed. pip 23.1 will enforce this behaviour change. A possible replacement is to enable the '--use-pep517' option. Discussion can be found at https://github.com/pypa/pip/issues/8559
Step #0 - "setup_python":   DEPRECATION: data-gemma is being installed using the legacy 'setup.py install' method, because it does not have a 'pyproject.toml' and the 'wheel' package is not installed. pip 23.1 will enforce this behaviour change. A possible replacement is to enable the '--use-pep517' option. Discussion can be found at https://github.com/pypa/pip/issues/8559
Step #0 - "setup_python":   DEPRECATION: flask_testing is being installed using the legacy 'setup.py install' method, because it does not have a 'pyproject.toml' and the 'wheel' package is not installed. pip 23.1 will enforce this behaviour change. A possible replacement is to enable the '--use-pep517' option. Discussion can be found at https://github.com/pypa/pip/issues/8559
Step #0 - "setup_python": 
Step #0 - "setup_python": [notice] A new release of pip available: 22.3.1 -> 25.3
Step #0 - "setup_python": [notice] To update, run: pip install --upgrade pip
Step #0 - "setup_python": Looking in indexes: https://pypi.org/simple, https://download.pytorch.org/whl/cpu
Step #0 - "setup_python": Collecting torch==2.2.2
Step #0 - "setup_python":   Downloading https://download.pytorch.org/whl/cpu/torch-2.2.2%2Bcpu-cp311-cp311-linux_x86_64.whl (186.8 MB)
Step #0 - "setup_python":      ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 186.8/186.8 MB 11.0 MB/s eta 0:00:00
Step #0 - "setup_python": Collecting filelock
Step #0 - "setup_python":   Downloading filelock-3.20.0-py3-none-any.whl (16 kB)
Step #0 - "setup_python": Requirement already satisfied: typing-extensions>=4.8.0 in ./.env/lib/python3.11/site-packages (from torch==2.2.2) (4.12.2)
Step #0 - "setup_python": Collecting sympy
Step #0 - "setup_python":   Obtaining dependency information for sympy from https://download.pytorch.org/whl/sympy-1.14.0-py3-none-any.whl.metadata
Step #0 - "setup_python":   Downloading https://download.pytorch.org/whl/sympy-1.14.0-py3-none-any.whl.metadata (12 kB)
Step #0 - "setup_python": Collecting networkx
Step #0 - "setup_python":   Downloading networkx-3.6-py3-none-any.whl (2.1 MB)
Step #0 - "setup_python":      ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 2.1/2.1 MB 16.6 MB/s eta 0:00:00
Step #0 - "setup_python": Requirement already satisfied: jinja2 in ./.env/lib/python3.11/site-packages (from torch==2.2.2) (3.1.6)
Step #0 - "setup_python": Collecting fsspec
Step #0 - "setup_python":   Downloading fsspec-2025.10.0-py3-none-any.whl (200 kB)
Step #0 - "setup_python":      ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 201.0/201.0 kB 21.3 MB/s eta 0:00:00
Step #0 - "setup_python": Requirement already satisfied: MarkupSafe>=2.0 in ./.env/lib/python3.11/site-packages (from jinja2->torch==2.2.2) (2.1.2)
Step #0 - "setup_python": Collecting mpmath<1.4,>=1.1.0
Step #0 - "setup_python":   Downloading https://download.pytorch.org/whl/mpmath-1.3.0-py3-none-any.whl (536 kB)
Step #0 - "setup_python":      ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 536.2/536.2 kB 64.1 MB/s eta 0:00:00
Step #0 - "setup_python": Downloading https://download.pytorch.org/whl/sympy-1.14.0-py3-none-any.whl (6.3 MB)
Step #0 - "setup_python":    ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 6.3/6.3 MB 121.0 MB/s eta 0:00:00
Step #0 - "setup_python": Using cached https://download.pytorch.org/whl/sympy-1.14.0-py3-none-any.whl (6.3 MB)
Step #0 - "setup_python": Installing collected packages: mpmath, sympy, networkx, fsspec, filelock, torch
Step #0 - "setup_python": Successfully installed filelock-3.20.0 fsspec-2025.10.0 mpmath-1.3.0 networkx-3.6 sympy-1.14.0 torch-2.2.2+cpu
Step #0 - "setup_python": 
Step #0 - "setup_python": [notice] A new release of pip available: 22.3.1 -> 25.3
Step #0 - "setup_python": [notice] To update, run: pip install --upgrade pip
Step #0 - "setup_python": installing nl_server/requirements.txt
Step #0 - "setup_python": 
Step #0 - "setup_python": [notice] A new release of pip available: 22.3.1 -> 25.3
Step #0 - "setup_python": [notice] To update, run: pip install --upgrade pip
Finished Step #0 - "setup_python"
Starting Step #1 - "explore_tests"
Starting Step #2 - "nl_tests"
Step #1 - "explore_tests": Already have image (with digest): python:3.11.3
Step #2 - "nl_tests": Already have image (with digest): python:3.11.3
Step #1 - "explore_tests": --explore ### Running explore page integration tests
Step #1 - "explore_tests": Using ENV_PREFIX=Staging
Step #1 - "explore_tests": Starting servers using run_servers.sh...
Step #1 - "explore_tests": FLASK_ENV=integration_test
Step #1 - "explore_tests": GOOGLE_CLOUD_PROJECT=datcom-website-staging
Step #1 - "explore_tests": ENABLE_MODEL=true
Step #1 - "explore_tests": Starting NL Server...
Step #1 - "explore_tests": Starting Website server...
Step #2 - "nl_tests": --nl ### Running nl page integration tests
Step #2 - "nl_tests": Using ENV_PREFIX=Staging
Step #2 - "nl_tests": Starting servers using run_servers.sh...
Step #2 - "nl_tests": FLASK_ENV=integration_test
Step #2 - "nl_tests": GOOGLE_CLOUD_PROJECT=datcom-website-staging
Step #2 - "nl_tests": ENABLE_MODEL=true
Step #2 - "nl_tests": Starting NL Server...
Step #2 - "nl_tests": Starting Website server...
Step #1 - "explore_tests": [19:23:36][INFO    ][config_reader.py:120] Loading index and model catalog from: /workspace/nl_server/../deploy/helm_charts/dc_website/nl/catalog.yaml 
Step #1 - "explore_tests": [19:23:36][INFO    ][config_reader.py:86] server config:
Step #1 - "explore_tests": {
Step #1 - "explore_tests":   "version": "1",
Step #1 - "explore_tests":   "default_indexes": [
Step #1 - "explore_tests":     "base_uae_mem"
Step #1 - "explore_tests":   ],
Step #1 - "explore_tests":   "indexes": {
Step #1 - "explore_tests":     "base_uae_mem": {
Step #1 - "explore_tests":       "store_type": "MEMORY",
Step #1 - "explore_tests":       "source_path": "/workspace/tools/nl/embeddings/input/base",
Step #1 - "explore_tests":       "model": "uae-large-v1-model",
Step #1 - "explore_tests":       "healthcheck_query": "Life expectancy",
Step #1 - "explore_tests":       "embeddings_path": "gs://datcom-nl-models/base_uae_mem_2025_11_03_07_10_42/embeddings.csv"
Step #1 - "explore_tests":     },
Step #1 - "explore_tests":     "bio_ft": {
Step #1 - "explore_tests":       "store_type": "MEMORY",
Step #1 - "explore_tests":       "source_path": "/workspace/tools/nl/embeddings/input/bio",
Step #1 - "explore_tests":       "model": "ft-final-v20230717230459-all-MiniLM-L6-v2",
Step #1 - "explore_tests":       "healthcheck_query": "Gene",
Step #1 - "explore_tests":       "embeddings_path": "gs://datcom-nl-models/bio_ft_2024_11_08_19_00_38/embeddings.csv"
Step #1 - "explore_tests":     },
Step #1 - "explore_tests":     "medium_ft": {
Step #1 - "explore_tests":       "store_type": "MEMORY",
Step #1 - "explore_tests":       "source_path": null,
Step #1 - "explore_tests":       "model": "ft-final-v20230717230459-all-MiniLM-L6-v2",
Step #1 - "explore_tests":       "healthcheck_query": "Life expectancy",
Step #1 - "explore_tests":       "embeddings_path": "gs://datcom-nl-models/embeddings_medium_2024_05_09_18_01_32.ft_final_v20230717230459.all-MiniLM-L6-v2.csv"
Step #1 - "explore_tests":     },
Step #1 - "explore_tests":     "base_mistral_mem": {
Step #1 - "explore_tests":       "store_type": "MEMORY",
Step #1 - "explore_tests":       "source_path": "/workspace/tools/nl/embeddings/input/base",
Step #1 - "explore_tests":       "model": "sfr-embedding-mistral-model",
Step #1 - "explore_tests":       "healthcheck_query": "Life expectancy",
Step #1 - "explore_tests":       "embeddings_path": "gs://datcom-nl-models/base_mistral_mem_2024_07_01_10_23_43/embeddings.csv"
Step #1 - "explore_tests":     },
Step #1 - "explore_tests":     "sdg_ft": {
Step #1 - "explore_tests":       "store_type": "MEMORY",
Step #1 - "explore_tests":       "source_path": "/workspace/tools/nl/embeddings/input/sdg",
Step #1 - "explore_tests":       "model": "ft-final-v20230717230459-all-MiniLM-L6-v2",
Step #1 - "explore_tests":       "healthcheck_query": "Hunger",
Step #1 - "explore_tests":       "embeddings_path": "gs://datcom-nl-models/sdg_ft_2024_06_24_23_45_46/embeddings.csv"
Step #1 - "explore_tests":     },
Step #1 - "explore_tests":     "undata_ft": {
Step #1 - "explore_tests":       "store_type": "MEMORY",
Step #1 - "explore_tests":       "source_path": "/workspace/tools/nl/embeddings/input/undata",
Step #1 - "explore_tests":       "model": "ft-final-v20230717230459-all-MiniLM-L6-v2",
Step #1 - "explore_tests":       "healthcheck_query": "Hunger",
Step #1 - "explore_tests":       "embeddings_path": "gs://datcom-nl-models/undata_ft_2024_06_24_23_47_04/embeddings.csv"
Step #1 - "explore_tests":     },
Step #1 - "explore_tests":     "undata_ilo_ft": {
Step #1 - "explore_tests":       "store_type": "MEMORY",
Step #1 - "explore_tests":       "source_path": "/workspace/tools/nl/embeddings/input/undata_ilo",
Step #1 - "explore_tests":       "model": "ft-final-v20230717230459-all-MiniLM-L6-v2",
Step #1 - "explore_tests":       "healthcheck_query": "Employment",
Step #1 - "explore_tests":       "embeddings_path": "gs://datcom-nl-models/undata_ilo_ft_2024_10_14_13_45_50/embeddings.csv"
Step #1 - "explore_tests":     }
Step #1 - "explore_tests":   },
Step #1 - "explore_tests":   "models": {
Step #1 - "explore_tests":     "cross-encoder-ms-marco-miniilm-l6-v2": {
Step #1 - "explore_tests":       "type": "VERTEXAI",
Step #1 - "explore_tests":       "usage": "RERANKING",
Step #1 - "explore_tests":       "score_threshold": null,
Step #1 - "explore_tests":       "project_id": "datcom-website-dev",
Step #1 - "explore_tests":       "location": "us-central1",
Step #1 - "explore_tests":       "prediction_endpoint_id": "3977846152316846080"
Step #1 - "explore_tests":     },
Step #1 - "explore_tests":     "cross-encoder-mxbai-rerank-base-v1": {
Step #1 - "explore_tests":       "type": "VERTEXAI",
Step #1 - "explore_tests":       "usage": "RERANKING",
Step #1 - "explore_tests":       "score_threshold": null,
Step #1 - "explore_tests":       "project_id": "datcom-website-dev",
Step #1 - "explore_tests":       "location": "us-central1",
Step #1 - "explore_tests":       "prediction_endpoint_id": "284894457873039360"
Step #1 - "explore_tests":     },
Step #1 - "explore_tests":     "uae-large-v1-model": {
Step #1 - "explore_tests":       "type": "VERTEXAI",
Step #1 - "explore_tests":       "usage": "EMBEDDINGS",
Step #1 - "explore_tests":       "score_threshold": 0.7,
Step #1 - "explore_tests":       "project_id": "datcom-nl",
Step #1 - "explore_tests":       "location": "us-central1",
Step #1 - "explore_tests":       "prediction_endpoint_id": "8110162693219942400"
Step #1 - "explore_tests":     },
Step #1 - "explore_tests":     "ft-final-v20230717230459-all-MiniLM-L6-v2": {
Step #1 - "explore_tests":       "type": "LOCAL",
Step #1 - "explore_tests":       "usage": "EMBEDDINGS",
Step #1 - "explore_tests":       "score_threshold": 0.5,
Step #1 - "explore_tests":       "gcs_folder": "gs://datcom-nl-models/ft_final_v20230717230459.all-MiniLM-L6-v2"
Step #1 - "explore_tests":     },
Step #1 - "explore_tests":     "sfr-embedding-mistral-model": {
Step #1 - "explore_tests":       "type": "VERTEXAI",
Step #1 - "explore_tests":       "usage": "EMBEDDINGS",
Step #1 - "explore_tests":       "score_threshold": 0.5,
Step #1 - "explore_tests":       "project_id": "datcom-website-dev",
Step #1 - "explore_tests":       "location": "us-central1",
Step #1 - "explore_tests":       "prediction_endpoint_id": "224012300019826688"
Step #1 - "explore_tests":     }
Step #1 - "explore_tests":   },
Step #1 - "explore_tests":   "enable_reranking": true
Step #1 - "explore_tests": } 
Step #2 - "nl_tests": [19:23:37][INFO    ][config_reader.py:120] Loading index and model catalog from: /workspace/nl_server/../deploy/helm_charts/dc_website/nl/catalog.yaml 
Step #2 - "nl_tests": [19:23:37][INFO    ][config_reader.py:86] server config:
Step #2 - "nl_tests": {
Step #2 - "nl_tests":   "version": "1",
Step #2 - "nl_tests":   "default_indexes": [
Step #2 - "nl_tests":     "base_uae_mem"
Step #2 - "nl_tests":   ],
Step #2 - "nl_tests":   "indexes": {
Step #2 - "nl_tests":     "base_uae_mem": {
Step #2 - "nl_tests":       "store_type": "MEMORY",
Step #2 - "nl_tests":       "source_path": "/workspace/tools/nl/embeddings/input/base",
Step #2 - "nl_tests":       "model": "uae-large-v1-model",
Step #2 - "nl_tests":       "healthcheck_query": "Life expectancy",
Step #2 - "nl_tests":       "embeddings_path": "gs://datcom-nl-models/base_uae_mem_2025_11_03_07_10_42/embeddings.csv"
Step #2 - "nl_tests":     },
Step #2 - "nl_tests":     "bio_ft": {
Step #2 - "nl_tests":       "store_type": "MEMORY",
Step #2 - "nl_tests":       "source_path": "/workspace/tools/nl/embeddings/input/bio",
Step #2 - "nl_tests":       "model": "ft-final-v20230717230459-all-MiniLM-L6-v2",
Step #2 - "nl_tests":       "healthcheck_query": "Gene",
Step #2 - "nl_tests":       "embeddings_path": "gs://datcom-nl-models/bio_ft_2024_11_08_19_00_38/embeddings.csv"
Step #2 - "nl_tests":     },
Step #2 - "nl_tests":     "medium_ft": {
Step #2 - "nl_tests":       "store_type": "MEMORY",
Step #2 - "nl_tests":       "source_path": null,
Step #2 - "nl_tests":       "model": "ft-final-v20230717230459-all-MiniLM-L6-v2",
Step #2 - "nl_tests":       "healthcheck_query": "Life expectancy",
Step #2 - "nl_tests":       "embeddings_path": "gs://datcom-nl-models/embeddings_medium_2024_05_09_18_01_32.ft_final_v20230717230459.all-MiniLM-L6-v2.csv"
Step #2 - "nl_tests":     },
Step #2 - "nl_tests":     "base_mistral_mem": {
Step #2 - "nl_tests":       "store_type": "MEMORY",
Step #2 - "nl_tests":       "source_path": "/workspace/tools/nl/embeddings/input/base",
Step #2 - "nl_tests":       "model": "sfr-embedding-mistral-model",
Step #2 - "nl_tests":       "healthcheck_query": "Life expectancy",
Step #2 - "nl_tests":       "embeddings_path": "gs://datcom-nl-models/base_mistral_mem_2024_07_01_10_23_43/embeddings.csv"
Step #2 - "nl_tests":     },
Step #2 - "nl_tests":     "sdg_ft": {
Step #2 - "nl_tests":       "store_type": "MEMORY",
Step #2 - "nl_tests":       "source_path": "/workspace/tools/nl/embeddings/input/sdg",
Step #2 - "nl_tests":       "model": "ft-final-v20230717230459-all-MiniLM-L6-v2",
Step #2 - "nl_tests":       "healthcheck_query": "Hunger",
Step #2 - "nl_tests":       "embeddings_path": "gs://datcom-nl-models/sdg_ft_2024_06_24_23_45_46/embeddings.csv"
Step #2 - "nl_tests":     },
Step #2 - "nl_tests":     "undata_ft": {
Step #2 - "nl_tests":       "store_type": "MEMORY",
Step #2 - "nl_tests":       "source_path": "/workspace/tools/nl/embeddings/input/undata",
Step #2 - "nl_tests":       "model": "ft-final-v20230717230459-all-MiniLM-L6-v2",
Step #2 - "nl_tests":       "healthcheck_query": "Hunger",
Step #2 - "nl_tests":       "embeddings_path": "gs://datcom-nl-models/undata_ft_2024_06_24_23_47_04/embeddings.csv"
Step #2 - "nl_tests":     },
Step #2 - "nl_tests":     "undata_ilo_ft": {
Step #2 - "nl_tests":       "store_type": "MEMORY",
Step #2 - "nl_tests":       "source_path": "/workspace/tools/nl/embeddings/input/undata_ilo",
Step #2 - "nl_tests":       "model": "ft-final-v20230717230459-all-MiniLM-L6-v2",
Step #2 - "nl_tests":       "healthcheck_query": "Employment",
Step #2 - "nl_tests":       "embeddings_path": "gs://datcom-nl-models/undata_ilo_ft_2024_10_14_13_45_50/embeddings.csv"
Step #2 - "nl_tests":     }
Step #2 - "nl_tests":   },
Step #2 - "nl_tests":   "models": {
Step #2 - "nl_tests":     "cross-encoder-ms-marco-miniilm-l6-v2": {
Step #2 - "nl_tests":       "type": "VERTEXAI",
Step #2 - "nl_tests":       "usage": "RERANKING",
Step #2 - "nl_tests":       "score_threshold": null,
Step #2 - "nl_tests":       "project_id": "datcom-website-dev",
Step #2 - "nl_tests":       "location": "us-central1",
Step #2 - "nl_tests":       "prediction_endpoint_id": "3977846152316846080"
Step #2 - "nl_tests":     },
Step #2 - "nl_tests":     "cross-encoder-mxbai-rerank-base-v1": {
Step #2 - "nl_tests":       "type": "VERTEXAI",
Step #2 - "nl_tests":       "usage": "RERANKING",
Step #2 - "nl_tests":       "score_threshold": null,
Step #2 - "nl_tests":       "project_id": "datcom-website-dev",
Step #2 - "nl_tests":       "location": "us-central1",
Step #2 - "nl_tests":       "prediction_endpoint_id": "284894457873039360"
Step #2 - "nl_tests":     },
Step #2 - "nl_tests":     "uae-large-v1-model": {
Step #2 - "nl_tests":       "type": "VERTEXAI",
Step #2 - "nl_tests":       "usage": "EMBEDDINGS",
Step #2 - "nl_tests":       "score_threshold": 0.7,
Step #2 - "nl_tests":       "project_id": "datcom-nl",
Step #2 - "nl_tests":       "location": "us-central1",
Step #2 - "nl_tests":       "prediction_endpoint_id": "8110162693219942400"
Step #2 - "nl_tests":     },
Step #2 - "nl_tests":     "ft-final-v20230717230459-all-MiniLM-L6-v2": {
Step #2 - "nl_tests":       "type": "LOCAL",
Step #2 - "nl_tests":       "usage": "EMBEDDINGS",
Step #2 - "nl_tests":       "score_threshold": 0.5,
Step #2 - "nl_tests":       "gcs_folder": "gs://datcom-nl-models/ft_final_v20230717230459.all-MiniLM-L6-v2"
Step #2 - "nl_tests":     },
Step #2 - "nl_tests":     "sfr-embedding-mistral-model": {
Step #2 - "nl_tests":       "type": "VERTEXAI",
Step #2 - "nl_tests":       "usage": "EMBEDDINGS",
Step #2 - "nl_tests":       "score_threshold": 0.5,
Step #2 - "nl_tests":       "project_id": "datcom-website-dev",
Step #2 - "nl_tests":       "location": "us-central1",
Step #2 - "nl_tests":       "prediction_endpoint_id": "224012300019826688"
Step #2 - "nl_tests":     }
Step #2 - "nl_tests":   },
Step #2 - "nl_tests":   "enable_reranking": true
Step #2 - "nl_tests": } 
Step #1 - "explore_tests": [19:23:39][INFO    ][gcs.py:50] Download datcom-nl-models/ft_final_v20230717230459.all-MiniLM-L6-v2 to /tmp/datcom-nl-models/ft_final_v20230717230459.all-MiniLM-L6-v2 
Step #2 - "nl_tests": [19:23:40][INFO    ][gcs.py:50] Download datcom-nl-models/ft_final_v20230717230459.all-MiniLM-L6-v2 to /tmp/datcom-nl-models/ft_final_v20230717230459.all-MiniLM-L6-v2 
Step #1 - "explore_tests": [19:23:43][INFO    ][SentenceTransformer.py:113] Load pretrained SentenceTransformer: /tmp/datcom-nl-models/ft_final_v20230717230459.all-MiniLM-L6-v2 
Step #2 - "nl_tests": [19:23:43][INFO    ][SentenceTransformer.py:113] Load pretrained SentenceTransformer: /tmp/datcom-nl-models/ft_final_v20230717230459.all-MiniLM-L6-v2 
Step #1 - "explore_tests": [19:23:43][INFO    ][SentenceTransformer.py:219] Use pytorch device_name: cpu 
Step #1 - "explore_tests": [19:23:43][INFO    ][gcs.py:50] Download datcom-nl-models/base_uae_mem_2025_11_03_07_10_42/embeddings.csv to /tmp/datcom-nl-models/base_uae_mem_2025_11_03_07_10_42/embeddings.csv 
Step #2 - "nl_tests": [19:23:44][INFO    ][SentenceTransformer.py:219] Use pytorch device_name: cpu 
Step #2 - "nl_tests": [19:23:44][INFO    ][gcs.py:50] Download datcom-nl-models/base_uae_mem_2025_11_03_07_10_42/embeddings.csv to /tmp/datcom-nl-models/base_uae_mem_2025_11_03_07_10_42/embeddings.csv 
Step #1 - "explore_tests": [19:23:46][INFO    ][memory.py:67] Loading embeddings file: /tmp/datcom-nl-models/base_uae_mem_2025_11_03_07_10_42/embeddings.csv 
Step #2 - "nl_tests": [19:23:47][INFO    ][util.py:607] ['http://localhost:6070/healthz'] not ready, waiting for 5 seconds 
Step #1 - "explore_tests": 
Generating train split: 0 examples [00:00, ? examples/s][19:23:47][INFO    ][util.py:607] ['http://localhost:6070/healthz'] not ready, waiting for 5 seconds 
Step #2 - "nl_tests": [19:23:47][INFO    ][memory.py:67] Loading embeddings file: /tmp/datcom-nl-models/base_uae_mem_2025_11_03_07_10_42/embeddings.csv 
Step #2 - "nl_tests": ============================= test session starts ==============================
Step #2 - "nl_tests": platform linux -- Python 3.11.3, pytest-9.0.1, pluggy-1.6.0 -- /workspace/.env/bin/python3
Step #2 - "nl_tests": cachedir: .pytest_cache
Step #2 - "nl_tests": rootdir: /workspace
Step #2 - "nl_tests": configfile: pytest.ini
Step #2 - "nl_tests": plugins: rerunfailures-10.2, flakefinder-1.1.0, anyio-4.11.0, xdist-3.2.1
Step #2 - "nl_tests": gw0 I / gw1 I / gw2 I / gw3 I / gw4 I / gw5 I / gw6 I / gw7 I / gw8 I / gw9 I / gw10 I / gw11 I / gw12 I / gw13 I / gw14 I / gw15 I / gw16 I / gw17 I / gw18 I / gw19 I / gw20 I / gw21 I / gw22 I / gw23 I / gw24 I / gw25 I / gw26 I / gw27 I / gw28 I / gw29 I / gw30 I / gw31 I
Step #2 - "nl_tests": 
[gw0] linux Python 3.11.3 cwd: /workspace
Step #2 - "nl_tests": 
[gw1] linux Python 3.11.3 cwd: /workspace
Step #2 - "nl_tests": 
[gw2] linux Python 3.11.3 cwd: /workspace
Step #2 - "nl_tests": 
[gw3] linux Python 3.11.3 cwd: /workspace
Step #2 - "nl_tests": 
[gw4] linux Python 3.11.3 cwd: /workspace
Step #2 - "nl_tests": 
[gw5] linux Python 3.11.3 cwd: /workspace
Step #2 - "nl_tests": 
[gw6] linux Python 3.11.3 cwd: /workspace
Step #2 - "nl_tests": 
[gw7] linux Python 3.11.3 cwd: /workspace
Step #2 - "nl_tests": 
[gw8] linux Python 3.11.3 cwd: /workspace
Step #2 - "nl_tests": 
[gw9] linux Python 3.11.3 cwd: /workspace
Step #2 - "nl_tests": 
[gw10] linux Python 3.11.3 cwd: /workspace
Step #2 - "nl_tests": 
[gw11] linux Python 3.11.3 cwd: /workspace
Step #2 - "nl_tests": 
[gw12] linux Python 3.11.3 cwd: /workspace
Step #2 - "nl_tests": 
[gw13] linux Python 3.11.3 cwd: /workspace
Step #2 - "nl_tests": 
[gw14] linux Python 3.11.3 cwd: /workspace
Step #2 - "nl_tests": 
[gw15] linux Python 3.11.3 cwd: /workspace
Step #2 - "nl_tests": 
[gw16] linux Python 3.11.3 cwd: /workspace
Step #2 - "nl_tests": 
[gw17] linux Python 3.11.3 cwd: /workspace
Step #2 - "nl_tests": 
[gw18] linux Python 3.11.3 cwd: /workspace
Step #2 - "nl_tests": 
[gw19] linux Python 3.11.3 cwd: /workspace
Step #2 - "nl_tests": 
[gw20] linux Python 3.11.3 cwd: /workspace
Step #2 - "nl_tests": 
[gw21] linux Python 3.11.3 cwd: /workspace
Step #2 - "nl_tests": 
[gw22] linux Python 3.11.3 cwd: /workspace
Step #2 - "nl_tests": 
[gw23] linux Python 3.11.3 cwd: /workspace
Step #2 - "nl_tests": 
[gw24] linux Python 3.11.3 cwd: /workspace
Step #2 - "nl_tests": 
[gw25] linux Python 3.11.3 cwd: /workspace
Step #2 - "nl_tests": 
[gw26] linux Python 3.11.3 cwd: /workspace
Step #2 - "nl_tests": 
[gw27] linux Python 3.11.3 cwd: /workspace
Step #2 - "nl_tests": 
[gw28] linux Python 3.11.3 cwd: /workspace
Step #2 - "nl_tests": 
[gw29] linux Python 3.11.3 cwd: /workspace
Step #2 - "nl_tests": 
[gw30] linux Python 3.11.3 cwd: /workspace
Step #2 - "nl_tests": 
[gw31] linux Python 3.11.3 cwd: /workspace
Step #2 - "nl_tests": 
[gw0] Python 3.11.3 (main, May 23 2023, 13:25:46) [GCC 10.2.1 20210110]
Step #2 - "nl_tests": 
[gw1] Python 3.11.3 (main, May 23 2023, 13:25:46) [GCC 10.2.1 20210110]
Step #2 - "nl_tests": 
[gw2] Python 3.11.3 (main, May 23 2023, 13:25:46) [GCC 10.2.1 20210110]
Step #2 - "nl_tests": 
[gw4] Python 3.11.3 (main, May 23 2023, 13:25:46) [GCC 10.2.1 20210110]
Step #2 - "nl_tests": 
[gw3] Python 3.11.3 (main, May 23 2023, 13:25:46) [GCC 10.2.1 20210110]
Step #2 - "nl_tests": 
[gw7] Python 3.11.3 (main, May 23 2023, 13:25:46) [GCC 10.2.1 20210110]
Step #2 - "nl_tests": 
[gw5] Python 3.11.3 (main, May 23 2023, 13:25:46) [GCC 10.2.1 20210110]
Step #2 - "nl_tests": 
[gw6] Python 3.11.3 (main, May 23 2023, 13:25:46) [GCC 10.2.1 20210110]
Step #2 - "nl_tests": 
[gw8] Python 3.11.3 (main, May 23 2023, 13:25:46) [GCC 10.2.1 20210110]
Step #2 - "nl_tests": 
[gw9] Python 3.11.3 (main, May 23 2023, 13:25:46) [GCC 10.2.1 20210110]
Step #2 - "nl_tests": 
[gw10] Python 3.11.3 (main, May 23 2023, 13:25:46) [GCC 10.2.1 20210110]
Step #2 - "nl_tests": 
[gw11] Python 3.11.3 (main, May 23 2023, 13:25:46) [GCC 10.2.1 20210110]
Step #2 - "nl_tests": 
[gw12] Python 3.11.3 (main, May 23 2023, 13:25:46) [GCC 10.2.1 20210110]
Step #2 - "nl_tests": 
[gw13] Python 3.11.3 (main, May 23 2023, 13:25:46) [GCC 10.2.1 20210110]
Step #2 - "nl_tests": 
[gw15] Python 3.11.3 (main, May 23 2023, 13:25:46) [GCC 10.2.1 20210110]
Step #2 - "nl_tests": 
[gw14] Python 3.11.3 (main, May 23 2023, 13:25:46) [GCC 10.2.1 20210110]
Step #2 - "nl_tests": 
[gw18] Python 3.11.3 (main, May 23 2023, 13:25:46) [GCC 10.2.1 20210110]
Step #2 - "nl_tests": 
[gw16] Python 3.11.3 (main, May 23 2023, 13:25:46) [GCC 10.2.1 20210110]
Step #2 - "nl_tests": 
[gw17] Python 3.11.3 (main, May 23 2023, 13:25:46) [GCC 10.2.1 20210110]
Step #2 - "nl_tests": 
[gw19] Python 3.11.3 (main, May 23 2023, 13:25:46) [GCC 10.2.1 20210110]
Step #2 - "nl_tests": 
[gw20] Python 3.11.3 (main, May 23 2023, 13:25:46) [GCC 10.2.1 20210110]
Step #2 - "nl_tests": 
[gw21] Python 3.11.3 (main, May 23 2023, 13:25:46) [GCC 10.2.1 20210110]
Step #2 - "nl_tests": 
[gw23] Python 3.11.3 (main, May 23 2023, 13:25:46) [GCC 10.2.1 20210110]
Step #2 - "nl_tests": 
[gw24] Python 3.11.3 (main, May 23 2023, 13:25:46) [GCC 10.2.1 20210110]
Step #2 - "nl_tests": 
[gw22] Python 3.11.3 (main, May 23 2023, 13:25:46) [GCC 10.2.1 20210110]
Step #2 - "nl_tests": 
[gw25] Python 3.11.3 (main, May 23 2023, 13:25:46) [GCC 10.2.1 20210110]
Step #2 - "nl_tests": 
[gw26] Python 3.11.3 (main, May 23 2023, 13:25:46) [GCC 10.2.1 20210110]
Step #2 - "nl_tests": 
[gw27] Python 3.11.3 (main, May 23 2023, 13:25:46) [GCC 10.2.1 20210110]
Step #2 - "nl_tests": 
[gw29] Python 3.11.3 (main, May 23 2023, 13:25:46) [GCC 10.2.1 20210110]
Step #2 - "nl_tests": 
[gw28] Python 3.11.3 (main, May 23 2023, 13:25:46) [GCC 10.2.1 20210110]
Step #2 - "nl_tests": 
[gw31] Python 3.11.3 (main, May 23 2023, 13:25:46) [GCC 10.2.1 20210110]
Step #2 - "nl_tests": 
[gw30] Python 3.11.3 (main, May 23 2023, 13:25:46) [GCC 10.2.1 20210110]
Step #2 - "nl_tests": gw0 [14] / gw1 [14] / gw2 [14] / gw3 [14] / gw4 [14] / gw5 [14] / gw6 [14] / gw7 [14] / gw8 [14] / gw9 [14] / gw10 [14] / gw11 [14] / gw12 [14] / gw13 [14] / gw14 [14] / gw15 [14] / gw16 [14] / gw17 [14] / gw18 [14] / gw19 [14] / gw20 [14] / gw21 [14] / gw22 [14] / gw23 [14] / gw24 [14] / gw25 [14] / gw26 [14] / gw27 [14] / gw28 [14] / gw29 [14] / gw30 [14] / gw31 [14]
Step #2 - "nl_tests": 
Step #2 - "nl_tests": scheduling tests via LoadScheduling
Step #2 - "nl_tests": 
Step #2 - "nl_tests": server/integration_tests/nl_test.py::NLTestMisc::test_strict_default_place 
Step #2 - "nl_tests": server/integration_tests/nl_test.py::NLTestMisc::test_place_detection_e2e_dc 
Step #2 - "nl_tests": server/integration_tests/nl_test.py::NLTestDemo::test_demo_fallback 
Step #2 - "nl_tests": server/integration_tests/nl_test.py::NLTestMisc::test_international 
Step #2 - "nl_tests": server/integration_tests/nl_test.py::NLTestDemo::test_textbox_sample 
Step #2 - "nl_tests": server/integration_tests/nl_test.py::NLTestDemo::test_demo_climatetrace 
Step #2 - "nl_tests": server/integration_tests/nl_test.py::NLTestMisc::test_sdg 
Step #2 - "nl_tests": server/integration_tests/nl_test.py::NLTestMisc::test_demo_usa_map_types 
Step #2 - "nl_tests": server/integration_tests/nl_test.py::NLTestMisc::test_strict_low_confidence 
Step #2 - "nl_tests": server/integration_tests/nl_test.py::NLTestMisc::test_translate 
Step #2 - "nl_tests": server/integration_tests/nl_test.py::NLTestMisc::test_strict_multi_verb 
Step #2 - "nl_tests": server/integration_tests/nl_test.py::NLTestDemo::test_demo_cities_feb2023 
Step #2 - "nl_tests": server/integration_tests/nl_test.py::NLTestDemo::test_demo_multisv 
Step #1 - "explore_tests": 
Generating train split: 7305 examples [00:03, 1949.62 examples/s]
Generating train split: 7305 examples [00:03, 1947.65 examples/s]
Step 
...
[Logs truncated due to log size limitations. For full logs, see https://console.cloud.google.com/cloud-build/builds/4c4de92e-6d01-4cdb-9f70-76025672012c?project=879489846695.]
...
/undata_ft_2024_06_24_23_47_04/embeddings.csv 
Step #1 - "explore_tests": [19:24:25][INFO    ][memory.py:67] Loading embeddings file: /tmp/datcom-nl-models/undata_ft_2024_06_24_23_47_04/embeddings.csv 
Step #2 - "nl_tests": [19:24:25][INFO    ][memory.py:67] Loading embeddings file: /tmp/datcom-nl-models/undata_ilo_ft_2024_10_14_13_45_50/embeddings.csv 
Step #1 - "explore_tests": [19:24:25][INFO    ][memory.py:67] Loading embeddings file: /tmp/datcom-nl-models/undata_ilo_ft_2024_10_14_13_45_50/embeddings.csv 
Step #2 - "nl_tests": [19:24:25][INFO    ][flask.py:79] NL Server Flask app initialized 
Step #2 - "nl_tests": [19:24:25][INFO    ][nl_app.py:27] Run nl server in local mode (host=localhost), port=6070 
Step #2 - "nl_tests": [19:24:25][WARNING ][_internal.py:97]  * Debugger is active! 
Step #1 - "explore_tests": [19:24:25][INFO    ][flask.py:79] NL Server Flask app initialized 
Step #1 - "explore_tests": [19:24:25][INFO    ][nl_app.py:27] Run nl server in local mode (host=localhost), port=6070 
Step #1 - "explore_tests": [19:24:25][WARNING ][_internal.py:97]  * Debugger is active! 
Step #1 - "explore_tests": [19:24:26][INFO    ][util.py:593] http://localhost:6070/healthz is up running 
Step #2 - "nl_tests": [19:24:26][INFO    ][util.py:593] http://localhost:6070/healthz is up running 
Step #1 - "explore_tests": [19:24:26][INFO    ][gcs.py:50] Download datcom-website-config/nl_bad_words.txt to /tmp/datcom-website-config/nl_bad_words.txt 
Step #2 - "nl_tests": [19:24:26][INFO    ][gcs.py:50] Download datcom-website-config/nl_bad_words.txt to /tmp/datcom-website-config/nl_bad_words.txt 
Step #1 - "explore_tests": [19:24:27][INFO    ][util.py:593] https://staging.api.datacommons.org/version is up running 
Step #1 - "explore_tests": [19:24:27][INFO    ][web_app.py:27] Run web server in local mode 
Step #1 - "explore_tests":  * Serving Flask app 'server.__init__'
Step #1 - "explore_tests":  * Debug mode: on
Step #2 - "nl_tests": [19:24:27][INFO    ][util.py:593] https://staging.api.datacommons.org/version is up running 
Step #2 - "nl_tests": [19:24:27][INFO    ][web_app.py:27] Run web server in local mode 
Step #2 - "nl_tests":  * Serving Flask app 'server.__init__'
Step #2 - "nl_tests":  * Debug mode: on
Step #1 - "explore_tests": [19:24:37][INFO    ][util.py:593] http://localhost:6070/healthz is up running 
Step #2 - "nl_tests": [19:24:37][INFO    ][util.py:593] http://localhost:6070/healthz is up running 
Step #2 - "nl_tests": [19:24:38][INFO    ][util.py:593] https://staging.api.datacommons.org/version is up running 
Step #2 - "nl_tests": [19:24:38][INFO    ][web_app.py:27] Run web server in local mode 
Step #2 - "nl_tests": [19:24:38][WARNING ][_internal.py:97]  * Debugger is active! 
Step #1 - "explore_tests": [19:24:38][INFO    ][util.py:593] https://staging.api.datacommons.org/version is up running 
Step #1 - "explore_tests": [19:24:38][INFO    ][web_app.py:27] Run web server in local mode 
Step #1 - "explore_tests": [19:24:38][WARNING ][_internal.py:97]  * Debugger is active! 
Step #2 - "nl_tests": 
Step #1 - "explore_tests": 
Step #1 - "explore_tests": [gw26] [  2%] SKIPPED server/integration_tests/explore_test.py::ExploreTestEE1::test_e2e_answer_places 
Step #1 - "explore_tests": [gw0] [  4%] PASSED server/integration_tests/explore_test.py::ExploreTestDetection::test_detection_basic 
Step #1 - "explore_tests": server/integration_tests/explore_test.py::ExploreTestEE1::test_e2e_sdg 
Step #1 - "explore_tests": [gw4] [  6%] PASSED server/integration_tests/explore_test.py::ExploreTestDetection::test_detection_basic_uae 
Step #1 - "explore_tests": server/integration_tests/explore_test.py::ExploreTestEE1::test_e2e_us_demo 
Step #1 - "explore_tests": [gw9] [  8%] PASSED server/integration_tests/explore_test.py::ExploreTestDetection::test_detection_bugs 
Step #1 - "explore_tests": server/integration_tests/explore_test.py::ExploreTestEE2::test_e2e_fallbacks 
Step #1 - "explore_tests": [gw7] [ 10%] PASSED server/integration_tests/explore_test.py::ExploreTestDetection::test_detection_basic_undata 
Step #1 - "explore_tests": [gw6] [ 12%] PASSED server/integration_tests/explore_test.py::ExploreTestDetection::test_detection_basic_undata_ilo 
Step #1 - "explore_tests": server/integration_tests/explore_test.py::ExploreTestEE1::test_filter_query_disabled 
Step #2 - "nl_tests": [gw11] [  7%] PASSED server/integration_tests/nl_test.py::NLTestMisc::test_strict_low_confidence 
Step #1 - "explore_tests": server/integration_tests/explore_test.py::ExploreTestEE2::test_e2e_date_range 
Step #1 - "explore_tests": [gw1] [ 14%] PASSED server/integration_tests/explore_test.py::ExploreTestDetection::test_detection_basic_bio 
Step #1 - "explore_tests": server/integration_tests/explore_test.py::ExploreTestEE1::test_e2e_superlatives 
Step #1 - "explore_tests": [gw2] [ 16%] PASSED server/integration_tests/explore_test.py::ExploreTestDetection::test_detection_basic_sdg 
Step #1 - "explore_tests": server/integration_tests/explore_test.py::ExploreTestEE1::test_e2e_translate 
Step #1 - "explore_tests": [gw12] [ 18%] PASSED server/integration_tests/explore_test.py::ExploreTestDetection::test_detection_sdg 
Step #1 - "explore_tests": server/integration_tests/explore_test.py::ExploreTestEE2::test_e2e_strict_default_place 
Step #1 - "explore_tests": [gw5] [ 20%] PASSED server/integration_tests/explore_test.py::ExploreTestDetection::test_detection_basic_undata_dev 
Step #1 - "explore_tests": server/integration_tests/explore_test.py::ExploreTestEE2::test_e2e_correlation_simple_place 
Step #1 - "explore_tests": [gw3] [ 22%] PASSED server/integration_tests/explore_test.py::ExploreTestDetection::test_detection_basic_sfr 
Step #1 - "explore_tests": server/integration_tests/explore_test.py::ExploreTestEE1::test_e2e_undata 
Step #1 - "explore_tests": [gw24] [ 24%] PASSED server/integration_tests/explore_test.py::ExploreTestFulfillment::test_fulfillment_sdg_specialvars 
Step #1 - "explore_tests": [gw20] [ 26%] PASSED server/integration_tests/explore_test.py::ExploreTestFulfillment::test_fulfillment_sdg 
Step #1 - "explore_tests": [gw25] [ 28%] PASSED server/integration_tests/explore_test.py::ExploreTestFulfillment::test_fulfillment_undata 
Step #1 - "explore_tests": [gw21] [ 30%] PASSED server/integration_tests/explore_test.py::ExploreTestFulfillment::test_fulfillment_sdg_global 
Step #1 - "explore_tests": [gw22] [ 32%] PASSED server/integration_tests/explore_test.py::ExploreTestFulfillment::test_fulfillment_sdg_global_specialvars 
Step #1 - "explore_tests": [gw15] [ 34%] PASSED server/integration_tests/explore_test.py::ExploreTestFulfillment::test_fulfillment_basic 
Step #2 - "nl_tests": [gw10] [ 14%] PASSED server/integration_tests/nl_test.py::NLTestMisc::test_strict_default_place 
Step #2 - "nl_tests": [gw12] [ 21%] PASSED server/integration_tests/nl_test.py::NLTestMisc::test_strict_multi_verb 
Step #1 - "explore_tests": server/integration_tests/explore_test.py::ExploreTestEE2::test_e2e_toolformer_rag_mode 
Step #1 - "explore_tests": [gw14] [ 36%] PASSED server/integration_tests/explore_test.py::ExploreTestDetection::test_detection_statvars 
Step #2 - "nl_tests": [19:24:45][INFO    ][llm_api.py:88] Gemini model used for LLM API: gemini-2.5-flash 
Step #2 - "nl_tests": [19:24:45][INFO    ][models.py:4993] AFC is enabled with max remote calls: 10. 
Step #2 - "nl_tests": [gw13] [ 28%] PASSED server/integration_tests/nl_test.py::NLTestMisc::test_translate 
Step #2 - "nl_tests": [gw7] [ 35%] PASSED server/integration_tests/nl_test.py::NLTestDemo::test_textbox_sample 
Step #1 - "explore_tests": server/integration_tests/explore_test.py::ExploreTestEE2::test_e2e_strict_multi_sv 
Step #1 - "explore_tests": [gw8] [ 38%] PASSED server/integration_tests/explore_test.py::ExploreTestDetection::test_detection_bio 
Step #1 - "explore_tests": server/integration_tests/explore_test.py::ExploreTestEE2::test_e2e_default_place 
Step #1 - "explore_tests": [gw14] [ 40%] PASSED server/integration_tests/explore_test.py::ExploreTestEE2::test_e2e_strict_multi_sv 
Step #1 - "explore_tests": [gw18] [ 42%] PASSED server/integration_tests/explore_test.py::ExploreTestFulfillment::test_fulfillment_explore_more 
Step #1 - "explore_tests": [gw19] [ 44%] PASSED server/integration_tests/explore_test.py::ExploreTestFulfillment::test_fulfillment_nl_size 
Step #2 - "nl_tests": [19:24:49][INFO    ][_client.py:1025] HTTP Request: POST https://generativelanguage.googleapis.com/v1/models/gemini-2.5-flash:generateContent "HTTP/1.1 200 OK" 
Step #1 - "explore_tests": [gw12] [ 46%] PASSED server/integration_tests/explore_test.py::ExploreTestEE2::test_e2e_strict_default_place 
Step #1 - "explore_tests": [gw13] [ 48%] PASSED server/integration_tests/explore_test.py::ExploreTestDetection::test_detection_translate 
Step #1 - "explore_tests": server/integration_tests/explore_test.py::ExploreTestEE2::test_e2e_strict_multi_verb 
Step #1 - "explore_tests": [gw27] [ 50%] PASSED server/integration_tests/explore_test.py::ExploreTestEE1::test_e2e_correlation_bugs 
Step #2 - "nl_tests": [gw1] [ 42%] PASSED server/integration_tests/nl_test.py::NLTestDemo::test_demo_climatetrace 
Step #1 - "explore_tests": [gw23] [ 52%] PASSED server/integration_tests/explore_test.py::ExploreTestFulfillment::test_fulfillment_statvars 
Step #1 - "explore_tests": [gw17] [ 54%] PASSED server/integration_tests/explore_test.py::ExploreTestFulfillment::test_fulfillment_correlation 
Step #1 - "explore_tests": server/integration_tests/explore_test.py::ExploreTestEE2::test_e2e_triple 
Step #1 - "explore_tests": [gw0] [ 56%] PASSED server/integration_tests/explore_test.py::ExploreTestEE1::test_e2e_sdg 
Step #1 - "explore_tests": [gw16] [ 58%] PASSED server/integration_tests/explore_test.py::ExploreTestFulfillment::test_fulfillment_comparison 
Step #2 - "nl_tests": [gw9] [ 50%] PASSED server/integration_tests/nl_test.py::NLTestMisc::test_sdg 
Step #1 - "explore_tests": [19:24:52][INFO    ][llm_api.py:88] Gemini model used for LLM API: gemini-2.5-flash 
Step #1 - "explore_tests": [19:24:52][INFO    ][models.py:4993] AFC is enabled with max remote calls: 10. 
Step #1 - "explore_tests": [19:24:52][INFO    ][llm_api.py:88] Gemini model used for LLM API: gemini-2.5-flash 
Step #1 - "explore_tests": [19:24:52][INFO    ][models.py:4993] AFC is enabled with max remote calls: 10. 
Step #1 - "explore_tests": [19:24:52][INFO    ][llm_api.py:88] Gemini model used for LLM API: gemini-2.5-flash 
Step #1 - "explore_tests": [19:24:52][INFO    ][models.py:4993] AFC is enabled with max remote calls: 10. 
Step #1 - "explore_tests": [19:24:52][INFO    ][llm_api.py:88] Gemini model used for LLM API: gemini-2.5-flash 
Step #1 - "explore_tests": [19:24:52][INFO    ][models.py:4993] AFC is enabled with max remote calls: 10. 
Step #1 - "explore_tests": [19:24:53][INFO    ][_client.py:1025] HTTP Request: POST https://generativelanguage.googleapis.com/v1/models/gemini-2.5-flash:generateContent "HTTP/1.1 200 OK" 
Step #1 - "explore_tests": server/integration_tests/explore_test.py::ExploreTestEE2::test_e2e_toolformer_rig_mode 
Step #1 - "explore_tests": [gw13] [ 60%] PASSED server/integration_tests/explore_test.py::ExploreTestEE2::test_e2e_strict_multi_verb 
Step #1 - "explore_tests": [gw7] [ 62%] PASSED server/integration_tests/explore_test.py::ExploreTestEE1::test_filter_query_disabled 
Step #2 - "nl_tests": [gw5] [ 57%] PASSED server/integration_tests/nl_test.py::NLTestMisc::test_demo_usa_map_types 
Step #1 - "explore_tests": [19:24:55][INFO    ][_client.py:1025] HTTP Request: POST https://generativelanguage.googleapis.com/v1/models/gemini-2.5-flash:generateContent "HTTP/1.1 200 OK" 
Step #1 - "explore_tests": [19:24:55][INFO    ][_client.py:1025] HTTP Request: POST https://generativelanguage.googleapis.com/v1/models/gemini-2.5-flash:generateContent "HTTP/1.1 200 OK" 
Step #1 - "explore_tests": [gw3] [ 64%] PASSED server/integration_tests/explore_test.py::ExploreTestEE1::test_e2e_undata 
Step #1 - "explore_tests": [gw2] [ 66%] RERUN server/integration_tests/explore_test.py::ExploreTestEE1::test_e2e_translate 
Step #1 - "explore_tests": [19:24:56][INFO    ][_client.py:1025] HTTP Request: POST https://generativelanguage.googleapis.com/v1/models/gemini-2.5-flash:generateContent "HTTP/1.1 200 OK" 
Step #1 - "explore_tests": server/integration_tests/explore_test.py::ExploreTestEE1::test_e2e_translate 
Step #1 - "explore_tests": [gw10] [ 68%] RERUN server/integration_tests/explore_test.py::ExploreTestDetection::test_detection_context 
Step #2 - "nl_tests": [gw3] [ 64%] PASSED server/integration_tests/nl_test.py::NLTestDemo::test_demo_multisv 
Step #1 - "explore_tests": [19:24:56][INFO    ][llm_api.py:88] Gemini model used for LLM API: gemini-2.5-flash 
Step #1 - "explore_tests": [19:24:56][INFO    ][models.py:4993] AFC is enabled with max remote calls: 10. 
Step #1 - "explore_tests": server/integration_tests/explore_test.py::ExploreTestDetection::test_detection_context 
Step #1 - "explore_tests": [gw11] [ 70%] RERUN server/integration_tests/explore_test.py::ExploreTestDetection::test_detection_multivar 
Step #1 - "explore_tests": [19:24:57][INFO    ][llm_api.py:88] Gemini model used for LLM API: gemini-2.5-flash 
Step #1 - "explore_tests": [19:24:57][INFO    ][models.py:4993] AFC is enabled with max remote calls: 10. 
Step #2 - "nl_tests": [gw8] [ 71%] PASSED server/integration_tests/nl_test.py::NLTestMisc::test_place_detection_e2e_dc 
Step #1 - "explore_tests": [19:25:00][INFO    ][_client.py:1025] HTTP Request: POST https://generativelanguage.googleapis.com/v1/models/gemini-2.5-flash:generateContent "HTTP/1.1 200 OK" 
Step #1 - "explore_tests": server/integration_tests/explore_test.py::ExploreTestDetection::test_detection_multivar 
Step #1 - "explore_tests": [gw5] [ 72%] PASSED server/integration_tests/explore_test.py::ExploreTestEE2::test_e2e_correlation_simple_place 
Step #1 - "explore_tests": [gw10] [ 72%] RERUN server/integration_tests/explore_test.py::ExploreTestDetection::test_detection_context 
Step #1 - "explore_tests": [19:25:01][INFO    ][llm_api.py:88] Gemini model used for LLM API: gemini-2.5-flash 
Step #1 - "explore_tests": [19:25:01][INFO    ][models.py:4993] AFC is enabled with max remote calls: 10. 
Step #1 - "explore_tests": server/integration_tests/explore_test.py::ExploreTestDetection::test_detection_context 
Step #2 - "nl_tests": [gw6] [ 78%] PASSED server/integration_tests/nl_test.py::NLTestMisc::test_international 
Step #1 - "explore_tests": [gw2] [ 72%] PASSED server/integration_tests/explore_test.py::ExploreTestEE1::test_e2e_translate 
Step #2 - "nl_tests": [gw2] [ 85%] PASSED server/integration_tests/nl_test.py::NLTestDemo::test_demo_fallback 
Step #1 - "explore_tests": [gw1] [ 74%] PASSED server/integration_tests/explore_test.py::ExploreTestEE1::test_e2e_superlatives 
Step #1 - "explore_tests": [19:25:04][INFO    ][_client.py:1025] HTTP Request: POST https://generativelanguage.googleapis.com/v1/models/gemini-2.5-flash:generateContent "HTTP/1.1 200 OK" 
Step #1 - "explore_tests": [19:25:04][INFO    ][llm_api.py:88] Gemini model used for LLM API: gemini-2.5-flash 
Step #1 - "explore_tests": [19:25:04][INFO    ][models.py:4993] AFC is enabled with max remote calls: 10. 
Step #1 - "explore_tests": [gw9] [ 76%] PASSED server/integration_tests/explore_test.py::ExploreTestEE2::test_e2e_fallbacks 
Step #1 - "explore_tests": [gw11] [ 76%] RERUN server/integration_tests/explore_test.py::ExploreTestDetection::test_detection_multivar 
Step #1 - "explore_tests": [19:25:04][INFO    ][llm_api.py:88] Gemini model used for LLM API: gemini-2.5-flash 
Step #1 - "explore_tests": [19:25:04][INFO    ][models.py:4993] AFC is enabled with max remote calls: 10. 
Step #1 - "explore_tests": [19:25:05][INFO    ][_client.py:1025] HTTP Request: POST https://generativelanguage.googleapis.com/v1/models/gemini-2.5-flash:generateContent "HTTP/1.1 200 OK" 
Step #1 - "explore_tests": [19:25:06][INFO    ][_client.py:1025] HTTP Request: POST https://generativelanguage.googleapis.com/v1/models/gemini-2.5-flash:generateContent "HTTP/1.1 200 OK" 
Step #1 - "explore_tests": server/integration_tests/explore_test.py::ExploreTestDetection::test_detection_multivar 
Step #1 - "explore_tests": [gw10] [ 76%] FAILED server/integration_tests/explore_test.py::ExploreTestDetection::test_detection_context 
Step #2 - "nl_tests": [gw0] [ 92%] PASSED server/integration_tests/nl_test.py::NLTestDemo::test_demo_cities_feb2023 
Step #1 - "explore_tests": [19:25:08][INFO    ][_client.py:1025] HTTP Request: POST https://generativelanguage.googleapis.com/v1/models/gemini-2.5-flash:generateContent "HTTP/1.1 200 OK" 
Step #1 - "explore_tests": server/integration_tests/explore_test.py::ExploreTestEE2::test_e2e_high_sv_threshold 
Step #1 - "explore_tests": [gw30] [ 78%] PASSED server/integration_tests/explore_test.py::ExploreTestEE1::test_e2e_india_demo 
Step #1 - "explore_tests": [gw11] [ 78%] FAILED server/integration_tests/explore_test.py::ExploreTestDetection::test_detection_multivar 
Step #1 - "explore_tests": server/integration_tests/explore_test.py::ExploreTestEE2::test_e2e_single_date 
Step #1 - "explore_tests": [gw16] [ 80%] PASSED server/integration_tests/explore_test.py::ExploreTestEE2::test_e2e_toolformer_rig_mode 
Step #2 - "nl_tests": [gw4] [100%] PASSED server/integration_tests/nl_test.py::NLTestDemo::test_demo_feb2023 
Step #2 - "nl_tests": 
Step #2 - "nl_tests": =============================== warnings summary ===============================
Step #2 - "nl_tests": .env/lib/python3.11/site-packages/flask_babel/__init__.py:183: 32 warnings
Step #2 - "nl_tests":   /workspace/.env/lib/python3.11/site-packages/flask_babel/__init__.py:183: DeprecationWarning: 'locked_cached_property' is deprecated and will be removed in Flask 2.4. Use a lock inside the decorated function if locking is needed.
Step #2 - "nl_tests":     @locked_cached_property
Step #2 - "nl_tests": 
Step #2 - "nl_tests": -- Docs: https://docs.pytest.org/en/stable/how-to/capture-warnings.html
Step #2 - "nl_tests": ================= 14 passed, 32 warnings in 105.59s (0:01:45) ==================
Finished Step #2 - "nl_tests"
Step #1 - "explore_tests": [19:25:11][INFO    ][llm_api.py:88] Gemini model used for LLM API: gemini-2.5-flash 
Step #1 - "explore_tests": [19:25:11][INFO    ][models.py:4993] AFC is enabled with max remote calls: 10. 
Step #1 - "explore_tests": [gw28] [ 82%] PASSED server/integration_tests/explore_test.py::ExploreTestEE1::test_e2e_edge_cases 
Step #1 - "explore_tests": [gw10] [ 84%] PASSED server/integration_tests/explore_test.py::ExploreTestEE2::test_e2e_high_sv_threshold 
Step #1 - "explore_tests": [gw31] [ 86%] PASSED server/integration_tests/explore_test.py::ExploreTestEE1::test_e2e_electrification_demo 
Step #1 - "explore_tests": [gw15] [ 88%] PASSED server/integration_tests/explore_test.py::ExploreTestEE2::test_e2e_toolformer_rag_mode 
Step #1 - "explore_tests": [19:25:14][INFO    ][_client.py:1025] HTTP Request: POST https://generativelanguage.googleapis.com/v1/models/gemini-2.5-flash:generateContent "HTTP/1.1 200 OK" 
Step #1 - "explore_tests": [gw4] [ 90%] PASSED server/integration_tests/explore_test.py::ExploreTestEE1::test_e2e_us_demo 
Step #1 - "explore_tests": [gw29] [ 92%] PASSED server/integration_tests/explore_test.py::ExploreTestEE1::test_e2e_edge_cases2 
Step #1 - "explore_tests": [gw8] [ 94%] PASSED server/integration_tests/explore_test.py::ExploreTestEE2::test_e2e_default_place 
Step #1 - "explore_tests": [gw6] [ 96%] PASSED server/integration_tests/explore_test.py::ExploreTestEE2::test_e2e_date_range 
Step #1 - "explore_tests": [gw17] [ 98%] PASSED server/integration_tests/explore_test.py::ExploreTestEE2::test_e2e_triple 
Step #1 - "explore_tests": [gw11] [100%] PASSED server/integration_tests/explore_test.py::ExploreTestEE2::test_e2e_single_date 
Step #1 - "explore_tests": 
Step #1 - "explore_tests": =================================== FAILURES ===================================
Step #1 - "explore_tests": _________________ ExploreTestDetection.test_detection_context __________________
Step #1 - "explore_tests": [gw10] linux -- Python 3.11.3 /workspace/.env/bin/python3
Step #1 - "explore_tests": 
Step #1 - "explore_tests": self = <workspace.server.integration_tests.explore_test.ExploreTestDetection testMethod=test_detection_context>
Step #1 - "explore_tests": 
Step #1 - "explore_tests":     def test_detection_context(self):
Step #1 - "explore_tests": >     self.run_detection('detection_api_context', [
Step #1 - "explore_tests":           'States with highest PHDs', 'Commute in tracts of California',
Step #1 - "explore_tests":           'Compare with Nevada', 'Correlate with asthma',
Step #1 - "explore_tests":           'countries with greenhouse gas emissions',
Step #1 - "explore_tests":           'median income in Santa Clara county and Alameda county'
Step #1 - "explore_tests":       ])
Step #1 - "explore_tests": 
Step #1 - "explore_tests": server/integration_tests/explore_test.py:343: 
Step #1 - "explore_tests": _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ 
Step #1 - "explore_tests": server/integration_tests/explore_test.py:70: in run_detection
Step #1 - "explore_tests":     self.handle_response(q, resp, test_dir, d, failure, check_detection)
Step #1 - "explore_tests": server/integration_tests/explore_test.py:197: in handle_response
Step #1 - "explore_tests":     self.assertEqual(a, b)
Step #1 - "explore_tests": E   AssertionError: '{\n [63 chars]    "quantity": {\n        "idx": 0,\n        [1124 chars]]\n}' != '{\n [63 chars]    "ranking_type": [\n        1\n      ],\n  [946 chars]]\n}'
Step #1 - "explore_tests": E     {
Step #1 - "explore_tests": E       "childEntityType": "State",
Step #1 - "explore_tests": E       "classifications": [
Step #1 - "explore_tests": E   -     {
Step #1 - "explore_tests": E   -       "quantity": {
Step #1 - "explore_tests": E   -         "idx": 0,
Step #1 - "explore_tests": E   -         "qval": {
Step #1 - "explore_tests": E   -           "cmp": "GE",
Step #1 - "explore_tests": E   -           "val": 2.2250738585072014e-308
Step #1 - "explore_tests": E   -         }
Step #1 - "explore_tests": E   -       },
Step #1 - "explore_tests": E   -       "type": 3
Step #1 - "explore_tests": E   -     },
Step #1 - "explore_tests": E         {
Step #1 - "explore_tests": E           "ranking_type": [
Step #1 - "explore_tests": E             1
Step #1 - "explore_tests": E           ],
Step #1 - "explore_tests": E           "type": 2
Step #1 - "explore_tests": E         },
Step #1 - "explore_tests": E         {
Step #1 - "explore_tests": E           "contained_in_place_type": "State",
Step #1 - "explore_tests": E           "had_default_type": false,
Step #1 - "explore_tests": E           "type": 4
Step #1 - "explore_tests": E         }
Step #1 - "explore_tests": E       ],
Step #1 - "explore_tests": E       "client": "test_detect",
Step #1 - "explore_tests": E       "comparisonEntities": [],
Step #1 - "explore_tests": E       "comparisonVariables": [],
Step #1 - "explore_tests": E       "context": {},
Step #1 - "explore_tests": E       "debug": {},
Step #1 - "explore_tests": E       "entities": [
Step #1 - "explore_tests": E         "country/USA"
Step #1 - "explore_tests": E       ],
Step #1 - "explore_tests": E       "nonPlaceEntities": [],
Step #1 - "explore_tests": E       "properties": [],
Step #1 - "explore_tests": E       "sessionId": "007_999999999",
Step #1 - "explore_tests": E       "variables": [
Step #1 - "explore_tests": E         "Count_Person_EducationalAttainmentDoctorateDegree",
Step #1 - "explore_tests": E         "Count_Person_25OrMoreYears_EducationalAttainmentDoctorateDegree_Female",
Step #1 - "explore_tests": E         "Count_Person_25OrMoreYears_EducationalAttainmentDoctorateDegree_Male",
Step #1 - "explore_tests": E         "Count_Person_25OrMoreYears_DoctorateDegree_AsFractionOf_Count_Person_25OrMoreYears",
Step #1 - "explore_tests": E         "Count_Person_25OrMoreYears_Female_DoctorateDegree_AsFractionOf_Count_Person_25OrMoreYears_Female",
Step #1 - "explore_tests": E         "Count_Person_25OrMoreYears_Male_DoctorateDegree_AsFractionOf_Count_Person_25OrMoreYears_Male",
Step #1 - "explore_tests": E         "dc/topic/StudentsInCollege"
Step #1 - "explore_tests": E       ]
Step #1 - "explore_tests": E     }
Step #1 - "explore_tests": _________________ ExploreTestDetection.test_detection_multivar _________________
Step #1 - "explore_tests": [gw11] linux -- Python 3.11.3 /workspace/.env/bin/python3
Step #1 - "explore_tests": 
Step #1 - "explore_tests": self = <workspace.server.integration_tests.explore_test.ExploreTestDetection testMethod=test_detection_multivar>
Step #1 - "explore_tests": 
Step #1 - "explore_tests":     def test_detection_multivar(self):
Step #1 - "explore_tests": >     self.run_detection('detection_api_multivar', [
Step #1 - "explore_tests":           'number of poor hispanic women with phd',
Step #1 - "explore_tests":           'compare obesity vs. poverty',
Step #1 - "explore_tests":           'show me the impact of climate change on drought',
Step #1 - "explore_tests":           'how are factors like obesity, blood pressure and asthma impacted by climate change',
Step #1 - "explore_tests":           'Compare "Male population" with "Female Population"',
Step #1 - "explore_tests":       ],
Step #1 - "explore_tests":                          check_detection=True)
Step #1 - "explore_tests": 
Step #1 - "explore_tests": server/integration_tests/explore_test.py:333: 
Step #1 - "explore_tests": _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ 
Step #1 - "explore_tests": server/integration_tests/explore_test.py:70: in run_detection
Step #1 - "explore_tests":     self.handle_response(q, resp, test_dir, d, failure, check_detection)
Step #1 - "explore_tests": server/integration_tests/explore_test.py:226: in handle_response
Step #1 - "explore_tests":     self._check_multivars(dbg["sv_matching"], expected["sv_matching"])
Step #1 - "explore_tests": _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ 
Step #1 - "explore_tests": 
Step #1 - "explore_tests": self = <workspace.server.integration_tests.explore_test.ExploreTestDetection testMethod=test_detection_multivar>
Step #1 - "explore_tests": got = {'CosineScore': [], 'MultiSV': {}, 'Query': 'number of poor hispanic women with phd', 'SV': [], ...}
Step #1 - "explore_tests": want = {'CosineScore': [], 'MultiSV': {}, 'Query': 'number of poor hispanic women with phd', 'SV': []}
Step #1 - "explore_tests": 
Step #1 - "explore_tests":     def _check_multivars(self, got, want):
Step #1 - "explore_tests": >     self.assertEqual(got['SV'][0], want['SV'][0])
Step #1 - "explore_tests":                        ^^^^^^^^^^^^
Step #1 - "explore_tests": E     IndexError: list index out of range
Step #1 - "explore_tests": 
Step #1 - "explore_tests": server/integration_tests/explore_test.py:229: IndexError
Step #1 - "explore_tests": =============================== warnings summary ===============================
Step #1 - "explore_tests": .env/lib/python3.11/site-packages/flask_babel/__init__.py:183: 32 warnings
Step #1 - "explore_tests":   /workspace/.env/lib/python3.11/site-packages/flask_babel/__init__.py:183: DeprecationWarning: 'locked_cached_property' is deprecated and will be removed in Flask 2.4. Use a lock inside the decorated function if locking is needed.
Step #1 - "explore_tests":     @locked_cached_property
Step #1 - "explore_tests": 
Step #1 - "explore_tests": -- Docs: https://docs.pytest.org/en/stable/how-to/capture-warnings.html
Step #1 - "explore_tests": =========================== short test summary info ============================
Step #1 - "explore_tests": FAILED server/integration_tests/explore_test.py::ExploreTestDetection::test_detection_context - AssertionError: '{\n [63 chars]    "quantity": {\n        "idx": 0,\n        [1124 chars]]\n}' != '{\n [63 chars]    "ranking_type": [\n        1\n      ],\n  [946 chars]]\n}'
Step #1 - "explore_tests":   {
Step #1 - "explore_tests":     "childEntityType": "State",
Step #1 - "explore_tests":     "classifications": [
Step #1 - "explore_tests": -     {
Step #1 - "explore_tests": -       "quantity": {
Step #1 - "explore_tests": -         "idx": 0,
Step #1 - "explore_tests": -         "qval": {
Step #1 - "explore_tests": -           "cmp": "GE",
Step #1 - "explore_tests": -           "val": 2.2250738585072014e-308
Step #1 - "explore_tests": -         }
Step #1 - "explore_tests": -       },
Step #1 - "explore_tests": -       "type": 3
Step #1 - "explore_tests": -     },
Step #1 - "explore_tests":       {
Step #1 - "explore_tests":         "ranking_type": [
Step #1 - "explore_tests":           1
Step #1 - "explore_tests":         ],
Step #1 - "explore_tests":         "type": 2
Step #1 - "explore_tests":       },
Step #1 - "explore_tests":       {
Step #1 - "explore_tests":         "contained_in_place_type": "State",
Step #1 - "explore_tests":         "had_default_type": false,
Step #1 - "explore_tests":         "type": 4
Step #1 - "explore_tests":       }
Step #1 - "explore_tests":     ],
Step #1 - "explore_tests":     "client": "test_detect",
Step #1 - "explore_tests":     "comparisonEntities": [],
Step #1 - "explore_tests":     "comparisonVariables": [],
Step #1 - "explore_tests":     "context": {},
Step #1 - "explore_tests":     "debug": {},
Step #1 - "explore_tests":     "entities": [
Step #1 - "explore_tests":       "country/USA"
Step #1 - "explore_tests":     ],
Step #1 - "explore_tests":     "nonPlaceEntities": [],
Step #1 - "explore_tests":     "properties": [],
Step #1 - "explore_tests":     "sessionId": "007_999999999",
Step #1 - "explore_tests":     "variables": [
Step #1 - "explore_tests":       "Count_Person_EducationalAttainmentDoctorateDegree",
Step #1 - "explore_tests":       "Count_Person_25OrMoreYears_EducationalAttainmentDoctorateDegree_Female",
Step #1 - "explore_tests":       "Count_Person_25OrMoreYears_EducationalAttainmentDoctorateDegree_Male",
Step #1 - "explore_tests":       "Count_Person_25OrMoreYears_DoctorateDegree_AsFractionOf_Count_Person_25OrMoreYears",
Step #1 - "explore_tests":       "Count_Person_25OrMoreYears_Female_DoctorateDegree_AsFractionOf_Count_Person_25OrMoreYears_Female",
Step #1 - "explore_tests":       "Count_Person_25OrMoreYears_Male_DoctorateDegree_AsFractionOf_Count_Person_25OrMoreYears_Male",
Step #1 - "explore_tests":       "dc/topic/StudentsInCollege"
Step #1 - "explore_tests":     ]
Step #1 - "explore_tests":   }
Step #1 - "explore_tests": FAILED server/integration_tests/explore_test.py::ExploreTestDetection::test_detection_multivar - IndexError: list index out of range
Step #1 - "explore_tests": == 2 failed, 47 passed, 1 skipped, 32 warnings, 5 rerun in 121.12s (0:02:01) ===
Finished Step #1 - "explore_tests"
ERROR
ERROR: build step 1 "python:3.11.3" failed: step exited with non-zero status: 1

Build Log: https://console.cloud.google.com/cloud-build/builds/4c4de92e-6d01-4cdb-9f70-76025672012c?project=879489846695

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Automated Integration Test Goldens Update from CI #5751

Uh oh!

Uh oh!

Automated Integration Test Goldens Update from CI #5751

Uh oh!

Summary

Details

Re-running checks...

Automated Integration Test Goldens Update from CI #5751

Are you sure you want to change the base?

Uh oh!

feat: Update goldens from Cloud Build workflow (build a690214d-bdb0-4…

Uh oh!

Automated Integration Test Goldens Update from CI #5751

Uh oh!

Summary

Details

Re-running checks...