Skip to content
This repository was archived by the owner on Sep 10, 2025. It is now read-only.

Commit 43dfdc7

Browse files
committed
5/n torchchat init
1 parent cbadc92 commit 43dfdc7

File tree

2 files changed

+3
-2
lines changed

2 files changed

+3
-2
lines changed

torchchat/model.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -438,7 +438,7 @@ def build_model(self) -> nn.Module:
438438
return recipe.fusion_class(**modules)
439439

440440
def _replace_know_params(self, params):
441-
patterns = {"QuickGELUActivation()": QuickGELUActivation(), "False": False, "True": True}
441+
patterns = {"QuickGELUActivation()": QuickGELUActivation()}
442442
for key, value in params.items():
443443
if isinstance(value, Hashable) and value in patterns:
444444
params[key] = patterns[value]

torchchat/model_params/llava-1.5.json

Lines changed: 2 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,5 +1,6 @@
11
{
22
"model_type": "llava",
3+
"use_tiktoken": true,
34
"encoder": {
45
"tile_size": 336,
56
"patch_size": 14,
@@ -9,7 +10,7 @@
910
"out_indices": [
1011
23
1112
],
12-
"output_cls_projection": "False",
13+
"output_cls_projection": false,
1314
"max_num_tiles": 1,
1415
"in_channels": 3,
1516
"intermediate_act": "QuickGELUActivation()"

0 commit comments

Comments
 (0)