improved retrievals and adaptions to next pyslk release by neumannd · Pull Request #48 · observingClouds/slkspec

neumannd · 2024-12-05T21:05:38Z

includes changes given in https://github.com/florianziemen/slkspec/tree/reactivate-pyslk
adapted to pyslk version 2.x.y
improved recall and retrieval workflow

neumannd · 2024-12-05T23:29:56Z

@observingClouds Can I prevent somehow that tests are generated automatically for all pyslk functions? Setting up all SlkMock functions properly will take a lot of time.

observingClouds · 2024-12-06T11:59:29Z

@neumannd can you point me to some tests? I think I cannot completely follow you. You just need to mock those functions that are actually used within slkspec.

neumannd · 2024-12-10T00:30:15Z

@observingClouds

You just need to mock those functions that are actually used within slkspec.

That's the main issue I have. It is quite time consuming to set up proper mock functions for this purpose. I am aware that this would be the proper proceedure. But, I am lacking time to finished this task in December. Would you merge anyway?

observingClouds · 2024-12-10T18:54:19Z

I have to say no here as I believe it's crucial for all code additions to pass the tests to maintain the integrity of the project.

neumannd · 2025-02-06T07:50:20Z

@observingClouds : The tests succeed now. @doguskbilir Updated the mock functions. Do you have any comments on the current code version?

Additionally, @doguskbilir is working on setting up a pyproject.toml file for the installation of the package. Thus, the old installation method via setup.py could be replaced by it. However, we do this in an extra PR?

observingClouds · 2025-02-08T10:50:43Z

Amazing! Great work! Yes, please do the pyproject.toml update in a separate PR.

observingClouds

Thanks @neumannd and @doguskbilir for this hard work! I really appreciate it. My main comment is about the complexity of _retrieve_item(). Please restructure it so that flake8 does no longer complain about C901. I think with a few extra functions and classes it will become much more readable and help us in the long run to maintain this package better.

slkspec/core.py

observingClouds · 2025-02-08T12:11:13Z

slkspec/core.py

+                """
+                {'SKIPPED': {'SKIPPED_TARGET_EXISTS': ['/arch/bm0146/k204221/iow/INDEX.txt']},
+                    'FILES': {'/arch/bm0146/k204221/iow/INDEX.txt': '/home/k204221/tmp/INDEX.txt'}}
+
+                # dry run
+                {'ENVISAGED': {'ENVISAGED': ['/arch/bm0146/k204221/iow/INDEX.txt']},
+                    'FILES': {'/arch/bm0146/k204221/iow/INDEX.txt': '/home/k204221/tmp/abcdef2/INDEX.txt'}}
+
+                # after successful retrieval
+                {'ENVISAGED': {'ENVISAGED': []}, 'FILES': {'/arch/bm0146/k204221/iow/INDEX.txt':
+                    '/home/k204221/tmp/INDEX.txt'}, 'SUCCESS': {'SUCCESS': ['/arch/bm0146/k204221/iow/INDEX.txt']}}
+
+                {'FAILED': {'FAILED_NOT_CACHED': ['/arch/bm0146/k204221/iow/iow_data5_001.tar']},
+                    'FILES': {'/arch/bm0146/k204221/iow/iow_data5_001.tar': '/home/k204221/tmp/iow_data5_001.tar'}}


This could go into the docstring of a new function?!

I'll check.

This docstring contains possible output of the function call four lines above. The following lines contain multiple if-clauses to react on the output. Therefore, I think it is reasonable to please this docstring here and not into the docstring of the function's header.

slkspec/core.py

neumannd · 2025-02-25T07:14:25Z

@observingClouds The tests do not start running. Any idea what causes this?

neumannd · 2025-02-25T08:54:50Z

@observingClouds There is are for loops in the code in which recalls are started and their status is checked. E.g. one of the for loops runs as long as there are unfinished recalls. Takes some time to implement this. Would it be possible to skip the tests for now and add them later? I know that not the ideal way but at some point there should be a working slkspec again.

florianziemen · 2025-02-25T14:06:06Z

Wouldn't that whole recall/retrieve loop logic be something that would have a good place in pyslk, so any client can use it? As a client, I could simply call retrieve_with_recall, and pyslk could handle the rest.

neumannd · 2025-02-25T19:07:06Z

@florianziemen In principle, I agree with you. But ... ;-) .

This approach is very inefficient when it is done for one or two files. Then, it is better to run the classical retrieval command (or something else).

This approach is very useful when the users want to retrieve many files at once. For the latter purpose we have already a nice script setup with automatic submission of SLURM jobs (https://docs.dkrz.de/doc/datastorage/hsm/retrievals.html#retrieve-more-than-a-handful-of-files).

Additionally, the approach taken here is pseudo-parallelized and hard to debug if something failes. Providing it to the broad community for individual usage might be a bit risky because nobody understands what is going on. We would need to invest much more time to make it mass-compatible.

Therefore, I do not think that it was very useful in pyslk -- even counter-productive if users start using it for getting one file after each other.

However, if this was set up from the scratch with a central thread running (server-like) then this approach might improve the situation of the users. The thread could "accept" newly requested files, put single requests together and organized recalls/retrievals. This approach would match the usage pattern in Python when users work with catalogs. I think that this is too much work in the moment -- particularly with a HSM proxy system at the horizon.

observingClouds · 2025-02-26T07:47:23Z

Hi @neumannd,
I am a bit confused about the tests failing again and needing more work. Were they not passing the other day and close to being finished? I thought only the flake8 issue was remaining.

neumannd · 2025-02-26T08:43:43Z

@observingClouds The tests of Dogus' PR finished. For this PR, the tests already failed on Friday. I think there is an infinite loop when I run the core functionality with the pyslk Mock functions. At least it looks like this when I run the tests locally.

florianziemen · 2025-02-26T10:06:38Z

Additionally, the approach taken here is pseudo-parallelized and hard to debug if something failes. Providing it to the broad community for individual usage might be a bit risky because nobody understands what is going on. We would need to invest much more time to make it mass-compatible.

This does not sound like we'd want to have it in a package like slkspec that @observingClouds provides to everybody without resources for further support.

codecov-commenter · 2025-02-27T07:22:52Z

⚠️ Please install the to ensure uploads and comments are reliably processed by Codecov.

Codecov Report

Attention: Patch coverage is 39.10761% with 232 lines in your changes missing coverage. Please review.

Please upload report for BASE (main@6836bab). Learn more about missing BASE report.

Files with missing lines	Patch %	Lines
slkspec/core.py	39.10%	232 Missing ⚠️

❗ Your organization needs to install the Codecov GitHub app to enable full functionality.

Additional details and impacted files

@@           Coverage Diff           @@
##             main      #48   +/-   ##
=======================================
  Coverage        ?   57.94%           
=======================================
  Files           ?        3           
  Lines           ?      585           
  Branches        ?        0           
=======================================
  Hits            ?      339           
  Misses          ?      246           
  Partials        ?        0

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

neumannd · 2025-02-27T08:49:35Z

@observingClouds Mocks updated. All current tests succeed. On paper, I have prepared two additional tests which would automatically test the new code part in more detail. These are tests which I ran manually. I need to add these -- but not this and not the next week. I'd prefer if this PR could be merged already before.

neumannd · 2025-02-27T08:50:39Z

@observingClouds What is this codecov thing? Do I need to care about this?

neumannd · 2025-02-27T15:50:55Z

Additionally, the approach taken here is pseudo-parallelized and hard to debug if something failes. Providing it to the broad community for individual usage might be a bit risky because nobody understands what is going on. We would need to invest much more time to make it mass-compatible.

This does not sound like we'd want to have it in a package like slkspec that @observingClouds provides to everybody without resources for further support.

@florianziemen @observingClouds I formulated it unclear. The process of getting data from tape itself is hard to debug if a tools does the recall and retrieval for the user in a black box. I don't mean that my code in particular has this issue.

After a meeting with Flo, we found two bugs which I will fix today or tomorrow. There is one general thing to discuss:

If a file cannot be recalled or retrieved, should an exception be thrown directly? Or is it OK to try to get all files first, collect "bad files" (recall or retrieval failed) and announce this in the end. Currently, I do the letter. Additionally, I write a list of all failed recalls and retrievals into one file and provide the path to the file in the error message. I think that it reasonable because then the user directly knows about all tapes/files which are problematic.

@observingClouds Would you agree with this procedure?

neumannd · 2025-03-03T13:06:47Z

Open issues add files on tapes with errorstate to list of failed recalls and proper exception in the end of _retrieve_item() fixed. Tests succeeded locally ... .

neumannd · 2025-03-03T13:15:32Z

@observingClouds Tests succeed. Code is running also in real use cases. Do you have any more comments?

neumannd · 2025-03-03T21:51:42Z

@observingClouds @florianziemen pyslk is available at PyPI now -- starting with version 2.2.9. Therefore, I updated the requirement line for pyslk in the pyproject.toml. pyslk 2.2.10 is the current release and contains a few changes in the documentation. It will be installed at Levante tomorrow.

observingClouds · 2025-03-07T07:41:28Z

slkspec/core.py

            return self._file
        return self._url

+    # flake8: noqa: C901


@neumannd can we remove this now that the function got a bit simplified?

neumannd · 2025-03-10T20:13:26Z

@observingClouds Is there anything which needs to be adapted from your point of view?

observingClouds · 2025-03-12T06:28:31Z

@neumannd, my only remaining comment is to get rid of # flake8: noqa: C901

neumannd · 2025-03-12T10:29:08Z

@observingClouds Removed it and updated the code.

neumannd · 2025-03-13T12:59:25Z

@observingClouds adapted CHANGELOG.md

observingClouds · 2025-03-13T13:14:04Z

It is done!! 🎉 Thanks for being patient with all my comments!

observingClouds · 2025-03-13T13:14:52Z

If you like I can immediately do the v0.0.3 release

neumannd · 2025-03-13T13:15:51Z

If you like I can immediately do the v0.0.3 release

@observingClouds Sounds good!

neumannd added the enhancement New feature or request label Dec 5, 2024

neumannd assigned observingClouds Dec 5, 2024

observingClouds self-requested a review February 8, 2025 11:46

observingClouds reviewed Feb 8, 2025

View reviewed changes

observingClouds reviewed Feb 10, 2025

View reviewed changes

slkspec/core.py Show resolved Hide resolved

doguskbilir mentioned this pull request Feb 17, 2025

Migrate to pyproject.toml #49

Merged

observingClouds reviewed Mar 7, 2025

View reviewed changes

neumannd added 24 commits March 13, 2025 13:56

fixed order recall retrieve

090dd40

updated pyslk Mock

980e990

fixed issues core.py

c084d3a

updated pre-commit hooks

22e85e2

black py310 linting done

00e3f0b

updated pre-commit hooks

a72a375

minor dcstring change conftest

d3acd9b

minor dcstring change conftest

82e7279

check preconfig hooks

d988dae

re-run pre-commit hooks for core.py

73a62d5

fixed issues in core.py

ce4f13d

corrected core.py

ec613eb

updated core and tests

34831a6

updated core and tests

1dc6a18

updated core and tests

c779097

final correction core.py

9236be6

fixed #9; fixed wrong counting of not-recalled files

3556132

throw exception when not all files retrieved; fixed #8

71dc9e0

throw exception when not all files retrieved; fixed #8

ec0891b

fixed issue in check of completeness of porcess

a90eb03

changed pyslk dependency: pyslk is not at PyPI

6489cc0

reduced complexity of core.py

ebd05d8

fixed minor issue in core.py

648cfc2

updated changelog

470c97f

observingClouds merged commit a52124a into observingClouds:main Mar 13, 2025
4 checks passed

neumannd mentioned this pull request Mar 13, 2025

Chown for folder permissions #30

Closed

Comments

Conversation

neumannd commented Dec 5, 2024

Uh oh!

neumannd commented Dec 5, 2024

Uh oh!

observingClouds commented Dec 6, 2024

Uh oh!

neumannd commented Dec 10, 2024

Uh oh!

observingClouds commented Dec 10, 2024

Uh oh!

neumannd commented Feb 6, 2025

Uh oh!

observingClouds commented Feb 8, 2025

Uh oh!

observingClouds left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

observingClouds Feb 8, 2025

Choose a reason for hiding this comment

Uh oh!

neumannd Feb 10, 2025

Choose a reason for hiding this comment

Uh oh!

neumannd Feb 22, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

neumannd commented Feb 25, 2025

Uh oh!

neumannd commented Feb 25, 2025

Uh oh!

florianziemen commented Feb 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

neumannd commented Feb 25, 2025

Uh oh!

observingClouds commented Feb 26, 2025

Uh oh!

neumannd commented Feb 26, 2025

Uh oh!

florianziemen commented Feb 26, 2025

Uh oh!

codecov-commenter commented Feb 27, 2025

Codecov Report

Uh oh!

neumannd commented Feb 27, 2025

Uh oh!

neumannd commented Feb 27, 2025

Uh oh!

neumannd commented Feb 27, 2025

Uh oh!

neumannd commented Mar 3, 2025

Uh oh!

neumannd commented Mar 3, 2025

Uh oh!

neumannd commented Mar 3, 2025

Uh oh!

observingClouds Mar 7, 2025

Choose a reason for hiding this comment

Uh oh!

neumannd commented Mar 10, 2025

Uh oh!

observingClouds commented Mar 12, 2025

Uh oh!

neumannd commented Mar 12, 2025

Uh oh!

neumannd commented Mar 13, 2025

Uh oh!

Uh oh!

observingClouds commented Mar 13, 2025

Uh oh!

observingClouds commented Mar 13, 2025

Uh oh!

neumannd commented Mar 13, 2025

florianziemen commented Feb 25, 2025 •

edited

Loading