Skip to content

Commit d048945

Browse files
zhangyi089brauner
authored andcommitted
xfs: reserve blocks for truncating large realtime inode
When unaligned truncate down a big realtime file, xfs_truncate_page() only zeros out the tail EOF block, __xfs_bunmapi() should split the tail written extent and convert the later one that beyond EOF block to unwritten, but it couldn't work as expected now since the reserved block is zero in xfs_setattr_size(), this could expose stale data just after commit '943bc0882ceb ("iomap: don't increase i_size if it's not a write operation")'. If we truncate file that contains a large enough written extent: |< rxext >|< rtext >| ...WWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWWW ^ (new EOF) ^ old EOF Since we only zeros out the tail of the EOF block, and xfs_itruncate_extents()->..->__xfs_bunmapi() unmap the whole ailgned extents, it becomes this state: |< rxext >| ...WWWzWWWWWWWWWWWWW ^ new EOF Then if we do an extending write like this, the blocks in the previous tail extent becomes stale: |< rxext >| ...WWWzSSSSSSSSSSSSS..........WWWWWWWWWWWWWWWWW ^ old EOF ^ append start ^ new EOF Fix this by reserving XFS_DIOSTRAT_SPACE_RES blocks for big realtime inode. Signed-off-by: Zhang Yi <[email protected]> Link: https://lore.kernel.org/r/[email protected] Reviewed-by: Christoph Hellwig <[email protected]> Reviewed-by: Darrick J. Wong <[email protected]> Signed-off-by: Christian Brauner <[email protected]>
1 parent a7ca193 commit d048945

File tree

1 file changed

+14
-1
lines changed

1 file changed

+14
-1
lines changed

fs/xfs/xfs_iops.c

Lines changed: 14 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -17,6 +17,8 @@
1717
#include "xfs_da_btree.h"
1818
#include "xfs_attr.h"
1919
#include "xfs_trans.h"
20+
#include "xfs_trans_space.h"
21+
#include "xfs_bmap_btree.h"
2022
#include "xfs_trace.h"
2123
#include "xfs_icache.h"
2224
#include "xfs_symlink.h"
@@ -811,6 +813,7 @@ xfs_setattr_size(
811813
struct xfs_trans *tp;
812814
int error;
813815
uint lock_flags = 0;
816+
uint resblks = 0;
814817
bool did_zeroing = false;
815818

816819
xfs_assert_ilocked(ip, XFS_IOLOCK_EXCL | XFS_MMAPLOCK_EXCL);
@@ -917,7 +920,17 @@ xfs_setattr_size(
917920
return error;
918921
}
919922

920-
error = xfs_trans_alloc(mp, &M_RES(mp)->tr_itruncate, 0, 0, 0, &tp);
923+
/*
924+
* For realtime inode with more than one block rtextsize, we need the
925+
* block reservation for bmap btree block allocations/splits that can
926+
* happen since it could split the tail written extent and convert the
927+
* right beyond EOF one to unwritten.
928+
*/
929+
if (xfs_inode_has_bigrtalloc(ip))
930+
resblks = XFS_DIOSTRAT_SPACE_RES(mp, 0);
931+
932+
error = xfs_trans_alloc(mp, &M_RES(mp)->tr_itruncate, resblks,
933+
0, 0, &tp);
921934
if (error)
922935
return error;
923936

0 commit comments

Comments
 (0)