Skip to content

Commit 9a244d5

Browse files
committed
bugfix, system content loss mask
1 parent 906ffe0 commit 9a244d5

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

mftcoder_accelerate/src/data/preprocess_data.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -224,7 +224,7 @@ def _tokenize_fields(self, data, data_type):
224224
sys_content_ids = self.pure_encode(system_marker + content_format(chat[0][CONTENT_COL]))
225225
chat = chat[1:]
226226
input_ids += sys_content_ids
227-
loss_mask += [1] * len(sys_content_ids)
227+
loss_mask += [0] * len(sys_content_ids)
228228

229229
for i, r in enumerate(chat):
230230
role = r[ROLE_COL]

0 commit comments

Comments
 (0)