Skip to content

Commit 2475b15

Browse files
wangshankunGACLove
andauthored
Dev/seko talk v2.7 (#867)
Co-authored-by: gaclove <peng.gaoc@gmail.com>
1 parent bf7048a commit 2475b15

File tree

86 files changed

+1240
-648
lines changed

Some content is hidden

Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.

86 files changed

+1240
-648
lines changed

configs/seko_talk/5090/seko_talk_5090_bf16.json

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -11,7 +11,7 @@
1111
"sample_guide_scale": 1,
1212
"sample_shift": 5,
1313
"enable_cfg": false,
14-
"use_31_block": false,
14+
"use_31_block": true,
1515
"cpu_offload": true,
1616
"offload_granularity": "block",
1717
"offload_ratio": 1,

configs/seko_talk/5090/seko_talk_5090_int8.json

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -11,7 +11,7 @@
1111
"sample_guide_scale": 1,
1212
"sample_shift": 5,
1313
"enable_cfg": false,
14-
"use_31_block": false,
14+
"use_31_block": true,
1515
"cpu_offload": true,
1616
"offload_granularity": "block",
1717
"offload_ratio": 1,

configs/seko_talk/5090/seko_talk_5090_int8_8gpu.json

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -12,7 +12,7 @@
1212
"sample_guide_scale": 1,
1313
"sample_shift": 5,
1414
"enable_cfg": false,
15-
"use_31_block": false,
15+
"use_31_block": true,
1616
"cpu_offload": true,
1717
"offload_granularity": "block",
1818
"offload_ratio": 1,

configs/seko_talk/A800/seko_talk_A800_int8.json

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -12,7 +12,7 @@
1212
"sample_shift": 5,
1313
"enable_cfg": false,
1414
"cpu_offload": false,
15-
"use_31_block": false,
15+
"use_31_block": true,
1616
"dit_quantized": true,
1717
"dit_quant_scheme": "int8-vllm",
1818
"adapter_quantized": true,

configs/seko_talk/A800/seko_talk_A800_int8_dist_2gpu.json

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -12,7 +12,7 @@
1212
"sample_shift": 5,
1313
"enable_cfg": false,
1414
"cpu_offload": false,
15-
"use_31_block": false,
15+
"use_31_block": true,
1616
"dit_quantized": true,
1717
"dit_quant_scheme": "int8-vllm",
1818
"adapter_quantized": true,

configs/seko_talk/A800/seko_talk_A800_int8_dist_4gpu.json

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -12,7 +12,7 @@
1212
"sample_shift": 5,
1313
"enable_cfg": false,
1414
"cpu_offload": false,
15-
"use_31_block": false,
15+
"use_31_block": true,
1616
"dit_quantized": true,
1717
"dit_quant_scheme": "int8-vllm",
1818
"adapter_quantized": true,

configs/seko_talk/A800/seko_talk_A800_int8_dist_8gpu.json

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -12,7 +12,7 @@
1212
"sample_shift": 5,
1313
"enable_cfg": false,
1414
"cpu_offload": false,
15-
"use_31_block": false,
15+
"use_31_block": true,
1616
"dit_quantized": true,
1717
"dit_quant_scheme": "int8-vllm",
1818
"adapter_quantized": true,

configs/seko_talk/L40s/1gpu/seko_talk_bf16.json

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -11,7 +11,7 @@
1111
"sample_guide_scale": 1.0,
1212
"sample_shift": 5,
1313
"enable_cfg": false,
14-
"use_31_block": false,
14+
"use_31_block": true,
1515
"cpu_offload": true,
1616
"offload_granularity": "block",
1717
"offload_ratio": 0.8,

configs/seko_talk/L40s/1gpu/seko_talk_fp8.json

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -11,7 +11,7 @@
1111
"sample_guide_scale": 1.0,
1212
"sample_shift": 5,
1313
"enable_cfg": false,
14-
"use_31_block": false,
14+
"use_31_block": true,
1515
"t5_quantized": true,
1616
"t5_quant_scheme": "fp8-q8f",
1717
"dit_quantized": true,

configs/seko_talk/L40s/2gpu/seko_talk_bf16.json

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -11,7 +11,7 @@
1111
"sample_guide_scale": 1.0,
1212
"sample_shift": 5,
1313
"enable_cfg": false,
14-
"use_31_block": false,
14+
"use_31_block": true,
1515
"cpu_offload": false,
1616
"t5_cpu_offload": true,
1717
"clip_cpu_offload": true,

0 commit comments

Comments
 (0)