Refactor PIINVOICE so env vars are fetched at start of billing pipeline. by KelvinLinBU · Pull Request #179 · CCI-MOC/invoicing

KelvinLinBU · 2025-04-22T13:34:57Z

Closes #168. Adhere to testing best practices as outlined in #159 (comment) and #168 (comment). Modify utils.py to accommodate changes.

KelvinLinBU · 2025-05-29T16:06:16Z

@QuanMPhm can be merged with main now

QuanMPhm · 2025-04-22T14:59:12Z

process_report/invoices/pi_specific_invoice.py

    - NewPICreditProcessor
    """

+    chrome_binary_location: str


Could you move this arg below the constants. In this codebase, the conventional structure is class constants, then variables, then functions

QuanMPhm · 2025-06-03T12:59:22Z

process_report/invoices/pi_specific_invoice.py

-            chrome_binary_location = os.environ.get(
-                "CHROME_BIN_PATH", "/usr/bin/chromium"
-            )
-            if not os.path.exists(chrome_binary_location):


You do not need to move this check to process_report.py. We only want to move the fetching of the env var. Error checking can stay in the invoice class for now.

QuanMPhm · 2025-06-03T14:35:18Z

process_report/process_report.py

You should add CHROME_BIN_PATH in REQUIRED_ENV_VARS

There's no reason to perform validation in two different places. If we're using pydantic_settings to manage environment variables, then rather than using required_env_files here, just mark chrome_bin_path as required in process_report.settings.Settings. Currently it is optional:

chrome_bin_path: str | None = None

To make it required:

chrome_bin_path: str

This will cause pydantic to throw a validation error when instantiating Settings:

pydantic_core._pydantic_core.ValidationError: 1 validation error for Settings chrome_bin_path Field required [type=missing, input_value={}, input_type=dict] For further information visit https://errors.pydantic.dev/2.10/v/missing

Similarly, you can remove the check for KEYCLOAK_CLIENT_ID and KEYCLOAK_CLIENT_SECRET here by adding a validator to Settings:

from pydantic import model_validator from pydantic_settings import BaseSettings class Settings(BaseSettings): ... @model_validator(mode="after") def check_keycloak_auth(self): if not self.coldfront_api_filepath and not ( self.keycloak_client_id and self.keycloak_client_secret ): raise ValueError( "You must either set coldfront_api_filepath or provide keycloak credentials in " "KEYCLOAK_CLIENT_ID and KEYCLOAK_CLIENT_SECRET" )

You probably want to catch the exception from pydantic to produce a more friendly error message that does not include a full Python traceback.

Thank you for the very detailed feedback. I don't know why I didn't thought of getting rid of that hacky function in the first place. Your feedback makes it look obvious

QuanMPhm · 2025-06-03T14:36:27Z

process_report/process_report.py

 def main():
    """Remove non-billable PIs and projects"""

+    chrome_binary_location = os.environ.get("CHROME_BIN_PATH", "/usr/bin/chromium")


Related to the adding CHROME_BIN_PATH to REQUIRED_ENV_VARS, you should have the env var fetched after the env vars have been validated

I am now a contributor to this PR as well

larsks · 2025-08-25T16:13:55Z

process_report/invoices/pi_specific_invoice.py

            subprocess.run(
                [
-                    CHROME_BIN_PATH,
+                    os.environ.get("CHROME_BIN_PATH", "/usr/bin/chromium"),


What was the motivation for this change? I think it's generally better practice to read your environment variables early, rather than doing it inline like this.

I've written so that the env var is loaded by the settings module

QuanMPhm · 2026-03-15T21:25:16Z

@larsks @knikolla After half a year, I want to wrap up this PR

larsks · 2026-04-01T15:31:26Z

.github/workflows/unit-tests.yaml

    name: Run unit tests
    runs-on: ubuntu-latest
+    env:
+      CHROME_BIN_PATH: /usr/foo/chromium


Why are we setting CHROME_BIN_PATH to an invalid path here?

The unit tests were designed to not require integration with 3rd-party tools. We do have an actual integration test for Chrome that installs Chromium and sets a real path.

As for why the env var is set, rather than not set at all, I wanted to check that code actual read the env var CHROME_BIN_PATH, whatever it maybe.

I don't really like this. Is there any way you can provide these values through mocking during the tests?

To echo what Kristi said: unit tests should never depend on the value of external environment variables. Regardless of whether CHROME_BIN_PATH is set to an invalid value, a valid path, or is unset, the unit tests should behave the same.

You don't even need to use mocking; you can just set environment variables directly in your unit tests

larsks · 2026-04-01T15:43:12Z

process_report/process_report.py

There's no reason to perform validation in two different places. If we're using pydantic_settings to manage environment variables, then rather than using required_env_files here, just mark chrome_bin_path as required in process_report.settings.Settings. Currently it is optional:

chrome_bin_path: str | None = None

To make it required:

chrome_bin_path: str

This will cause pydantic to throw a validation error when instantiating Settings:

pydantic_core._pydantic_core.ValidationError: 1 validation error for Settings chrome_bin_path Field required [type=missing, input_value={}, input_type=dict] For further information visit https://errors.pydantic.dev/2.10/v/missing

Similarly, you can remove the check for KEYCLOAK_CLIENT_ID and KEYCLOAK_CLIENT_SECRET here by adding a validator to Settings:

from pydantic import model_validator from pydantic_settings import BaseSettings class Settings(BaseSettings): ... @model_validator(mode="after") def check_keycloak_auth(self): if not self.coldfront_api_filepath and not ( self.keycloak_client_id and self.keycloak_client_secret ): raise ValueError( "You must either set coldfront_api_filepath or provide keycloak credentials in " "KEYCLOAK_CLIENT_ID and KEYCLOAK_CLIENT_SECRET" )

You probably want to catch the exception from pydantic to produce a more friendly error message that does not include a full Python traceback.

Also added validation for missing environment variables in settings.py and nice formatting for validation errors

knikolla · 2026-04-02T20:27:29Z

process_report/settings.py

    fetch_from_s3: bool = True
    upload_to_s3: bool = False

+    chrome_bin_path: str


I think it makes sense to keep providing a default as before as you've now made the environment variable required for running all unit tests for no obvious gain.

knikolla · 2026-04-02T20:29:38Z

.github/workflows/unit-tests.yaml

    name: Run unit tests
    runs-on: ubuntu-latest
+    env:
+      CHROME_BIN_PATH: /usr/foo/chromium


I don't really like this. Is there any way you can provide these values through mocking during the tests?

QuanMPhm requested review from QuanMPhm and larsks and removed request for QuanMPhm April 22, 2025 14:56

KelvinLinBU self-assigned this May 14, 2025

KelvinLinBU force-pushed the refactor-env branch 3 times, most recently from 63d9226 to 6d1bdef Compare May 29, 2025 16:02

QuanMPhm previously requested changes Jun 3, 2025

View reviewed changes

QuanMPhm unassigned KelvinLinBU Jul 14, 2025

QuanMPhm force-pushed the refactor-env branch 2 times, most recently from 4f89e7d to 0c340e2 Compare July 22, 2025 19:50

QuanMPhm requested review from knikolla and naved001 July 24, 2025 15:01

larsks reviewed Aug 25, 2025

View reviewed changes

QuanMPhm force-pushed the refactor-env branch from 0c340e2 to f0c6b60 Compare March 15, 2026 21:24

QuanMPhm requested a review from larsks March 15, 2026 21:24

QuanMPhm force-pushed the refactor-env branch from f0c6b60 to de5968f Compare March 15, 2026 21:35

larsks reviewed Apr 1, 2026

View reviewed changes

QuanMPhm force-pushed the refactor-env branch from de5968f to eb12106 Compare April 2, 2026 20:21

QuanMPhm requested a review from larsks April 2, 2026 20:23

Mark CHROME_BIN_PATH as required in settings

bc8e178

Also added validation for missing environment variables in settings.py and nice formatting for validation errors

QuanMPhm force-pushed the refactor-env branch from eb12106 to bc8e178 Compare April 2, 2026 20:28

knikolla requested changes Apr 2, 2026

View reviewed changes

Conversation

KelvinLinBU commented Apr 22, 2025 • edited by QuanMPhm Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

KelvinLinBU commented May 29, 2025

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

QuanMPhm commented Mar 15, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

QuanMPhm Apr 2, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

larsks Apr 2, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

KelvinLinBU commented Apr 22, 2025 •

edited by QuanMPhm

Loading

QuanMPhm commented Mar 15, 2026 •

edited

Loading

QuanMPhm Apr 2, 2026 •

edited

Loading

larsks Apr 2, 2026 •

edited

Loading