add job database implementation that uses stac #619

jdries · 2024-09-15T12:46:26Z

To solve still:

how to detect authentication settings
use aggregate api -> won't do, that api is still on dev and not clear if it works
filtering by status doesn't work yet
avoid dependency on external stac builder library

soxofaan

this is quite a large PR and I didn't go through all of it already, just some initial comments

openeo/extra/job_management.py

openeo/extra/stac_job_db.py

tests/extra/test_stac_jobdb.py

to prepare for future extensions, e.g. #619

soxofaan

some more notes

openeo/extra/job_management/stac_job_db.py

soxofaan · 2024-12-06T18:16:29Z

openeo/extra/job_management/stac_job_db.py

+        df = pd.DataFrame(series)
+        if len(series) == 0:
+            # TODO: What if default columns are overwritten by the user?
+            df = MultiBackendJobManager._normalize_df(


Using private MultiBackendJobManager._normalize_df from get_by_status looks a bit problematic.

It's also used from initialize_from_df, which is ok at the moment, but subject to change (see #667)

Using it in another context than initialize_from_df, like here, seems to indicate that we have to rethink all this initialization business. Not sure yet what to do instead

Indeed I was hesitant to use it here first, but I see no other way to return an empty dataframe here that has all columns required by the MultiBackendJobManager

One way to get the MultiBackendJobManager._normalize_df out of initialize_from_df, might be to do a read, normalize, persist at the start of MultiBackendJobManager.run_jobs?

I would leave it as MultiBackendJobManager._normalize_df for now, due to lack of better alternative.
We just have to be sure it's included in changes related to #667.

So in that regard, it's important that this MultiBackendJobManager._normalize_df stuff is properly covered by unit tests, so that we immediately see if we would be breaking something

openeo/extra/job_management/stac_job_db.py

soxofaan · 2024-12-18T15:04:09Z

I'm a bit short on time to review this deeply, let alone try running some use cases. It's good that half of the PR is unit tests
The PR also just adds a new class and doesn't touch existing code paths. So maybe we can just merge this as is

jdries added 8 commits September 15, 2024 14:44

add job database implementation that uses stac

4d50a15

add test which demonstrates that it's not yet working

82c1d2f

remove old abstract method

f565b9c

Merge branch 'master' into stac_jobdb

8291625

fix status filter

57c5dfd

remove dependency on external library

572e4cb

stac db: externalize auth + now actually works

8b6a79f

small cleanup: no random renaming of geometry column

5670075

jdries self-assigned this Oct 2, 2024

VincentVerelst added 9 commits December 3, 2024 11:18

Merge branch 'master' into stac_jobdb

3f8e3c3

include initialize_from_df in JobDatabaseInterface

3ffcf3e

some bugs removed from STACAPIJobDatabase

7ae446e

support append for STACAPIJobDatabase

46ab758

first unit tests STACAPIJobDatabase

29fa5d7

more tests for STACAPIJobDatabase

cde482f

added pystac-client as test dependency

efb94f4

changed typing in join_url

269f91c

change more typing not compatible with python3.8

949e7a9

VincentVerelst marked this pull request as ready for review December 5, 2024 17:35

VincentVerelst requested a review from soxofaan December 5, 2024 17:35

soxofaan reviewed Dec 6, 2024

View reviewed changes

soxofaan mentioned this pull request Dec 6, 2024

Remove read from JobDatabaseInterface #680

Closed

soxofaan added a commit that referenced this pull request Dec 6, 2024

Make openeo.extra.job_management a package (instead of module)

a811bff

to prepare for future extensions, e.g. #619

Merge branch 'master' into stac_jobdb

4487ab7

soxofaan reviewed Dec 6, 2024

View reviewed changes

VincentVerelst self-assigned this Dec 12, 2024

changes from PR review

8bb7f37

VincentVerelst requested a review from soxofaan December 12, 2024 17:46

soxofaan approved these changes Dec 18, 2024

View reviewed changes

VincentVerelst merged commit 7064cf3 into master Dec 18, 2024
15 checks passed

VincentVerelst deleted the stac_jobdb branch December 18, 2024 15:57

add job database implementation that uses stac #619

add job database implementation that uses stac #619

Uh oh!

Conversation

jdries commented Sep 15, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

soxofaan left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

soxofaan left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

soxofaan Dec 6, 2024

Choose a reason for hiding this comment

Uh oh!

VincentVerelst Dec 12, 2024

Choose a reason for hiding this comment

Uh oh!

VincentVerelst Dec 12, 2024

Choose a reason for hiding this comment

Uh oh!

soxofaan Dec 18, 2024

Choose a reason for hiding this comment

Uh oh!

Uh oh!

soxofaan commented Dec 18, 2024

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

jdries commented Sep 15, 2024 •

edited

Loading