Skip to content

Commit 8e5db1c

Browse files
committed
fix bug
1 parent 369d259 commit 8e5db1c

File tree

2 files changed

+4
-8
lines changed

2 files changed

+4
-8
lines changed

core/trainers/framework/runner.py

Lines changed: 0 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -212,11 +212,9 @@ def _executor_dataloader_train(self, model_dict, context):
212212
if context["fleet_mode"].upper() == "PS":
213213
train_prog = context["model"][model_dict["name"]][
214214
"main_program"]
215-
print("condition 1")
216215
else:
217216
train_prog = context["model"][model_dict["name"]][
218217
"default_main_program"]
219-
print("condition 2")
220218
startup_prog = context["model"][model_dict["name"]][
221219
"startup_program"]
222220
with fluid.program_guard(train_prog, startup_prog):

models/rank/dnn/config.yaml

Lines changed: 4 additions & 6 deletions
Original file line numberDiff line numberDiff line change
@@ -114,15 +114,13 @@ runner:
114114
print_interval: 1
115115
phases: [phase1]
116116

117-
- name: local_ps_train
118-
class: local_cluster_train
117+
- name: single_multi_gpu_train
118+
class: train
119119
# num of epochs
120120
epochs: 1
121121
# device to run training or infer
122-
device: cpu
123-
selected_gpus: "0" # 选择多卡执行训练
124-
work_num: 1
125-
server_num: 1
122+
device: gpu
123+
selected_gpus: "0,1" # 选择多卡执行训练
126124
save_checkpoint_interval: 1 # save model interval of epochs
127125
save_inference_interval: 4 # save inference
128126
save_step_interval: 1

0 commit comments

Comments
 (0)