Skip to content

Conversation

@Nasf-Fan
Copy link
Contributor

@Nasf-Fan Nasf-Fan commented Apr 18, 2025

Otherwise, it may misguide subsequent crt_get_filtered_grp_rank_list() and regard the IV root to be as non-exist in the ranks list, then fail related IV operation with -DER_NONEXIST.

It maybe not a perfected solution for current cart IV logic. But to be some temporary option, it makes CR to be workable when some ranks dead.

Add new test case to verfiy such corner case.

Test-tag: pr test_daos_cat_recov_core

Steps for the author:

  • Commit message follows the guidelines.
  • Appropriate Features or Test-tag pragmas were used.
  • Appropriate Functional Test Stages were run.
  • At least two positive code reviews including at least one code owner from each category referenced in the PR.
  • Testing is complete. If necessary, forced-landing label added and a reason added in a comment.

After all prior steps are complete:

  • Gatekeeper requested (daos-gatekeeper added as a reviewer).

@github-actions
Copy link

Ticket title is 'Aurora daos_ops: Pool rebuild is stuck with error (op:Reclaim fail)'
Status is 'In Progress'
Labels: 'ALCF,aurora_post_at,scrubbed_2.8'
https://daosio.atlassian.net/browse/DAOS-17135

@Nasf-Fan Nasf-Fan force-pushed the Nasf-Fan/DAOS-17135_1 branch from 498e164 to 25e41d1 Compare April 18, 2025 08:27
When -f option is specified for "dmg check repair" command, we allows the
user/admin to handle the same type of inconsistencies with the same action
subsequently. Under such scenario, using the inconsistency-class is more
clear instead of the seq-num. Then the SYNOPSIS will be as following:

dmg [OPTIONS] check repair [repair-OPTIONS] <seq-num|inconsistency-class>
	<interact-opt|action>
...
[repair command options]
	-f, --for-all	Take the same action for all (potential) inconsistencies
			with the same class. If this option is specified, then
			"inconsistency-class" and "action" will be accepted for
			subsequent parameters; otherwise, "seq-num" and "interact-opt"
			will be used.

Enhance test logic to make it to be workable after landing c9745d8.

Test-tag: test_daos_cat_recov_core

Signed-off-by: Fan Yong <[email protected]>
@Nasf-Fan Nasf-Fan force-pushed the Nasf-Fan/DAOS-17422 branch from cfdd691 to e92c895 Compare April 18, 2025 09:52
Otherwise, it may misguide subsequent crt_get_filtered_grp_rank_list()
and regard the IV root to be as non-exist in the ranks list, then fail
related IV operation with -DER_NONEXIST.

It maybe not a perfected solution for current cart IV logic. But to be
some temporary option, it makes CR to be workable when some ranks dead.

Add new test case to verfiy such corner case.

Test-tag: pr test_daos_cat_recov_core

Signed-off-by: Fan Yong <[email protected]>
@Nasf-Fan Nasf-Fan force-pushed the Nasf-Fan/DAOS-17135_1 branch from 25e41d1 to 3828884 Compare April 18, 2025 09:54
@Nasf-Fan Nasf-Fan force-pushed the Nasf-Fan/DAOS-17422 branch from e92c895 to 3814730 Compare May 7, 2025 01:02
@Nasf-Fan
Copy link
Contributor Author

Nasf-Fan commented May 9, 2025

Replaced by #16357

@Nasf-Fan Nasf-Fan closed this May 9, 2025
@Nasf-Fan Nasf-Fan deleted the Nasf-Fan/DAOS-17135_1 branch May 29, 2025 06:19
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Development

Successfully merging this pull request may close these issues.

2 participants