You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
DAOS-18470 rebuild: re-schedule rebuild task after stopped (#17492)
* DAOS-18470 rebuild: re-schedule rebuild task after stopped
To fix some rebuild stop sequence problem for example -
1. kill rank 5, trigger rebuild task A
2. dmg pool rebuild stop
3. kill rank 6 trigger rebuild task B
After 3's rebuild done, rank 5 status keeps as DOWN, but rank 6 is DOWNOUT.
The problem is due to in rebuild task B it actually rebuilt both rank 5 and
rank 6, but when it completes it only sets rank 5's status to DOWNOUT
because rebuild B's task->dst_tgts only with rank 6.
This patch take a temporarily simple method that -
After the rebuild stopped, reschedule original rebuild task with delay
-1, so following's rebuild can be merge with that to a new rebuild with
merged task->dst_tgts (which include both rank 5 and 6 in that case).
In future can consider to refine rebuild task management further.
Signed-off-by: Xuezhao Liu <xuezhao.liu@hpe.com>
0 commit comments