Skip to content

Commit 32b2148

Browse files
committed
tested after rebasing, fix importance sampling bug
1 parent 3746f73 commit 32b2148

File tree

4 files changed

+4
-3
lines changed

4 files changed

+4
-3
lines changed
Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,4 +1,4 @@
11
pandas>=1.4.1
22
sentencepiece
3-
colossalai==0.4.7
3+
colossalai>=0.4.7
44
prompt_toolkit

applications/ColossalChat/requirements.txt

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,4 +1,4 @@
1-
transformers==4.39.3
1+
transformers>=4.39.3
22
tqdm
33
datasets==2.14.7
44
loralib

applications/ColossalChat/rl_example.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -152,7 +152,7 @@
152152
"-ei",
153153
"--eval-interval",
154154
type=int,
155-
default=100,
155+
default=-1,
156156
help="Interval for evaluation. Evaluate every ei training steps.",
157157
)
158158
parser.add_argument(

colossalai/shardformer/modeling/qwen3.py

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -273,6 +273,7 @@ def qwen3_for_causal_lm_forward(
273273
hidden_states: Optional[torch.FloatTensor] = None,
274274
stage_index: Optional[List[int]] = None,
275275
shard_config: ShardConfig = None,
276+
**kwargs
276277
):
277278
r"""
278279
Args:

0 commit comments

Comments
 (0)