Input data backup from betzy #712
Replies: 21 comments 54 replies
-
|
One way to identify missing files is to run tests with One current issue with this method on Betzy is NorESMhub/ccs_config_noresm#70. |
Beta Was this translation helpful? Give feedback.
-
|
I have been doing 6 semi-sporadically @MichaelSchulzMETNO . Do you want some more regular update schedule? |
Beta Was this translation helpful? Give feedback.
-
|
No, I think it only matters under which project they are stored under (for who they are accounted to), but yes the wrong group ownership can be a problem regarding access. |
Beta Was this translation helpful? Give feedback.
-
|
Potential show stopper: |
Beta Was this translation helpful? Give feedback.
-
|
@TomasTorsvik that's interesting. When I try rsync from betzy to nird it always asks for 2FA which is VERY annoying! What trick do you use to circumvent it? |
Beta Was this translation helpful? Give feedback.
-
|
Just for documentation: If someone feels responsible for those, please either adjust permissions (readable for the group |
Beta Was this translation helpful? Give feedback.
-
|
Some very preliminary info regarding the amount of input data files on Is this high number expected? SOURCE_PATH_EXCLUDE_LIST = [".svn/*", "*/.svn/*", "*.lock"]Please suggest more file patterns for removal... |
Beta Was this translation helpful? Give feedback.
-
|
I think it is copied to the research archive though. |
Beta Was this translation helpful? Give feedback.
-
|
Hopefully the last post before copying... The following list is the files below the path Please confirm that these are the files that are supposed to be copied to
This would leave around |
Beta Was this translation helpful? Give feedback.
-
|
Next question: |
Beta Was this translation helpful? Give feedback.
-
|
How about |
Beta Was this translation helpful? Give feedback.
-
|
Just for info: |
Beta Was this translation helpful? Give feedback.
-
|
Next question: |
Beta Was this translation helpful? Give feedback.
-
|
Just for info: |
Beta Was this translation helpful? Give feedback.
-
|
There are two folders in /datalake/NS16001B, where the second is referred to from the www folder: cdl-ns16001b-noresminputdata |
Beta Was this translation helpful? Give feedback.
-
|
The inputdata backup is now reachable via this web address: All data from https://ns9560k.web.sigma2.no/datapeak/inputdata/ should also be there at the new location (except for the empty folder |
Beta Was this translation helpful? Give feedback.
-
|
Is there a separate discussion for Olivia? |
Beta Was this translation helpful? Give feedback.
-
|
Small update: |
Beta Was this translation helpful? Give feedback.
-
|
Can you explore with sigma2 how to do a regular automatic update then? |
Beta Was this translation helpful? Give feedback.
-
|
Just for info: Looking a bit further, the main thing is the directory Please advise what's really needed in the backup. The script can exclude directory structures. |
Beta Was this translation helpful? Give feedback.
-
|
@kjetilaas @maritsandstad can you clarify? |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
goal: Save betzy NorESM input files to nird /projects/NS9560K/www/inputdata
steps:
Identify "NorESM input files" on betzy:/cluster/shared/noresm/inputdata of group owner noresm
which are NOT in NCAR copy : /datalake/NS12077K/CESM-input-data
Exclude some files: svn, lock, etc
Rsync those "NOT-in-NCAR NorESM input files" to /projects/NS9560K/www/inputdata
2b) identify problems with permissions - communicate with sigma2 to make all files readable for NS9560K
2c) rsync will only copy new files on www/inputdata
2d) Only the rsync script should add files on www/inputdata
Users on betzy can add files to this process by making files and directories owned by group "noresm"
Make a repository under NorESMhub with scripts for steps 1+2 @jgriesfeller
Run script regularly (tbd) (Jan)
5b) Manually correct files on /projects/NS9560K/www/inputdata on demand with Data management group
Update NCAR files now and then (Marit?)
Look at input files from other machines , on demand
Move eventually www/inputdata to a specific input data project
Please comment @maritsandstad @gold2718 @tylov @jgriesfeller @DirkOlivie @monsieuralok @matsbn @lisesg
Beta Was this translation helpful? Give feedback.
All reactions