Skip to content

Commit 0ab0243

Browse files
OjaswinMharshimogalapalli
authored andcommitted
ext4: avoid journaling sb update on error if journal is destroying
commit ce2f26e upstream. Presently we always BUG_ON if trying to start a transaction on a journal marked with JBD2_UNMOUNT, since this should never happen. However, while ltp running stress tests, it was observed that in case of some error handling paths, it is possible for update_super_work to start a transaction after the journal is destroyed eg: (umount) ext4_kill_sb kill_block_super generic_shutdown_super sync_filesystem /* commits all txns */ evict_inodes /* might start a new txn */ ext4_put_super flush_work(&sbi->s_sb_upd_work) /* flush the workqueue */ jbd2_journal_destroy journal_kill_thread journal->j_flags |= JBD2_UNMOUNT; jbd2_journal_commit_transaction jbd2_journal_get_descriptor_buffer jbd2_journal_bmap ext4_journal_bmap ext4_map_blocks ... ext4_inode_error ext4_handle_error schedule_work(&sbi->s_sb_upd_work) /* work queue kicks in */ update_super_work jbd2_journal_start start_this_handle BUG_ON(journal->j_flags & JBD2_UNMOUNT) Hence, introduce a new mount flag to indicate journal is destroying and only do a journaled (and deferred) update of sb if this flag is not set. Otherwise, just fallback to an un-journaled commit. Further, in the journal destroy path, we have the following sequence: 1. Set mount flag indicating journal is destroying 2. force a commit and wait for it 3. flush pending sb updates This sequence is important as it ensures that, after this point, there is no sb update that might be journaled so it is safe to update the sb outside the journal. (To avoid race discussed in 2d01ddc) Also, we don't need a similar check in ext4_grp_locked_error since it is only called from mballoc and AFAICT it would be always valid to schedule work here. Fixes: 2d01ddc ("ext4: save error info to sb through journal if available") Reported-by: Mahesh Kumar <[email protected]> Signed-off-by: Ojaswin Mujoo <[email protected]> Reviewed-by: Jan Kara <[email protected]> Link: https://patch.msgid.link/9613c465d6ff00cd315602f99283d5f24018c3f7.1742279837.git.ojaswin@linux.ibm.com Signed-off-by: Theodore Ts'o <[email protected]> Signed-off-by: Harshit Mogalapalli <[email protected]> Signed-off-by: Greg Kroah-Hartman <[email protected]> (cherry picked from commit eddca44ddf810e27f0c96913aa3cc92ebd679ddb) Signed-off-by: Harshit Mogalapalli <[email protected]>
1 parent b2cc32a commit 0ab0243

File tree

3 files changed

+25
-9
lines changed

3 files changed

+25
-9
lines changed

fs/ext4/ext4.h

Lines changed: 2 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1827,7 +1827,8 @@ static inline int ext4_valid_inum(struct super_block *sb, unsigned long ino)
18271827
*/
18281828
enum {
18291829
EXT4_MF_MNTDIR_SAMPLED,
1830-
EXT4_MF_FC_INELIGIBLE /* Fast commit ineligible */
1830+
EXT4_MF_FC_INELIGIBLE, /* Fast commit ineligible */
1831+
EXT4_MF_JOURNAL_DESTROY /* Journal is in process of destroying */
18311832
};
18321833

18331834
static inline void ext4_set_mount_flag(struct super_block *sb, int bit)

fs/ext4/ext4_jbd2.h

Lines changed: 15 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -521,6 +521,21 @@ static inline int ext4_journal_destroy(struct ext4_sb_info *sbi, journal_t *jour
521521
{
522522
int err = 0;
523523

524+
/*
525+
* At this point only two things can be operating on the journal.
526+
* JBD2 thread performing transaction commit and s_sb_upd_work
527+
* issuing sb update through the journal. Once we set
528+
* EXT4_JOURNAL_DESTROY, new ext4_handle_error() calls will not
529+
* queue s_sb_upd_work and ext4_force_commit() makes sure any
530+
* ext4_handle_error() calls from the running transaction commit are
531+
* finished. Hence no new s_sb_upd_work can be queued after we
532+
* flush it here.
533+
*/
534+
ext4_set_mount_flag(sbi->s_sb, EXT4_MF_JOURNAL_DESTROY);
535+
536+
ext4_force_commit(sbi->s_sb);
537+
flush_work(&sbi->s_sb_upd_work);
538+
524539
err = jbd2_journal_destroy(journal);
525540
sbi->s_journal = NULL;
526541

fs/ext4/super.c

Lines changed: 8 additions & 8 deletions
Original file line numberDiff line numberDiff line change
@@ -719,9 +719,13 @@ static void ext4_handle_error(struct super_block *sb, bool force_ro, int error,
719719
* In case the fs should keep running, we need to writeout
720720
* superblock through the journal. Due to lock ordering
721721
* constraints, it may not be safe to do it right here so we
722-
* defer superblock flushing to a workqueue.
722+
* defer superblock flushing to a workqueue. We just need to be
723+
* careful when the journal is already shutting down. If we get
724+
* here in that case, just update the sb directly as the last
725+
* transaction won't commit anyway.
723726
*/
724-
if (continue_fs && journal)
727+
if (continue_fs && journal &&
728+
!ext4_test_mount_flag(sb, EXT4_MF_JOURNAL_DESTROY))
725729
schedule_work(&EXT4_SB(sb)->s_sb_upd_work);
726730
else
727731
ext4_commit_super(sb);
@@ -1306,7 +1310,6 @@ static void ext4_put_super(struct super_block *sb)
13061310
ext4_unregister_li_request(sb);
13071311
ext4_quotas_off(sb, EXT4_MAXQUOTAS);
13081312

1309-
flush_work(&sbi->s_sb_upd_work);
13101313
destroy_workqueue(sbi->rsv_conversion_wq);
13111314
ext4_release_orphan_info(sb);
13121315

@@ -1316,7 +1319,8 @@ static void ext4_put_super(struct super_block *sb)
13161319
if ((err < 0) && !aborted) {
13171320
ext4_abort(sb, -err, "Couldn't clean up the journal");
13181321
}
1319-
}
1322+
} else
1323+
flush_work(&sbi->s_sb_upd_work);
13201324

13211325
ext4_es_unregister_shrinker(sbi);
13221326
timer_shutdown_sync(&sbi->s_err_report);
@@ -4984,8 +4988,6 @@ static int ext4_load_and_init_journal(struct super_block *sb,
49844988
return 0;
49854989

49864990
out:
4987-
/* flush s_sb_upd_work before destroying the journal. */
4988-
flush_work(&sbi->s_sb_upd_work);
49894991
ext4_journal_destroy(sbi, sbi->s_journal);
49904992
return -EINVAL;
49914993
}
@@ -5676,8 +5678,6 @@ failed_mount8: __maybe_unused
56765678
sbi->s_ea_block_cache = NULL;
56775679

56785680
if (sbi->s_journal) {
5679-
/* flush s_sb_upd_work before journal destroy. */
5680-
flush_work(&sbi->s_sb_upd_work);
56815681
ext4_journal_destroy(sbi, sbi->s_journal);
56825682
}
56835683
failed_mount3a:

0 commit comments

Comments
 (0)