Skip to content

Commit c73f656

Browse files
committed
update readme and minor fix
1 parent d800cec commit c73f656

File tree

3 files changed

+4
-7
lines changed

3 files changed

+4
-7
lines changed

README.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -32,7 +32,7 @@
3232
<hr style="border: 2px solid gray;"></hr>
3333

3434
## Latest Updates
35-
- [2025.5.21] Models & code of PS3 and VILA-HD are released! We released two PS3 models (`PS3-1.5K-SigLIP` and `PS3-4K-SigLIP`) and two VILA-HD models (`VILA-HD-1.5K-8B-SigLIP` and `VILA-HD-4K-8B-SigLIP`).
35+
- [2025.6.4] Models & code of PS3 and VILA-HD are released! We released two PS3 models (`PS3-1.5K-SigLIP` and `PS3-4K-SigLIP`) and two VILA-HD models (`VILA-HD-1.5K-8B-SigLIP` and `VILA-HD-4K-8B-SigLIP`), and the corresponding training/inference code are also released.
3636
- [2025.4.22] Demo of VILA-HD is released! Welcome to give it a try. We are actively improving the model so any feedback is welcome!
3737
- [2025.4.4] Selected as conference highlight at CVPR 2025. See you in Nashville!
3838
- [2025.3.24] Initial paper release. Code and weights of PS3 and VILA-HD will be released very soon!
@@ -317,7 +317,7 @@ class PS3VisionModel(PS3PreTrainedModel):
317317

318318
## Training
319319

320-
Coming soon.
320+
Please see `train/`.
321321

322322
## Using PS3 in Downstream MLLMs
323323

train/src/open_clip/factory.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -402,7 +402,7 @@ def _convert_ln(m):
402402

403403
if checkpoint_path:
404404
logging.info(f'Loading pretrained {model_name} weights ({pretrained}).')
405-
load_checkpoint(model, checkpoint_path, strict=False, weights_only=load_weights_only)
405+
load_checkpoint(model, checkpoint_path, strict=True, weights_only=load_weights_only)
406406
else:
407407
error_str = (
408408
f'Pretrained weights ({pretrained}) not found for model {model_name}.'

train/src/open_clip/model.py

Lines changed: 1 addition & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -470,10 +470,7 @@ def __init__(
470470

471471
self.ps3 = vision_cfg.get("ps3", False) if isinstance(vision_cfg, dict) else vision_cfg.ps3
472472
if self.ps3:
473-
if "ps3_feature_proj" in vision_cfg and vision_cfg["ps3_feature_proj"] == "mlp":
474-
self.prompt_proj = Mlp(in_features=self.text.output_dim, hidden_features=self.visual.width, out_features=self.visual.width, norm_layer=nn.LayerNorm)
475-
else:
476-
self.prompt_proj = nn.Linear(self.text.output_dim, self.visual.width)
473+
self.prompt_proj = Mlp(in_features=self.text.output_dim, hidden_features=self.visual.width, out_features=self.visual.width, norm_layer=nn.LayerNorm)
477474

478475
def lock_image_tower(self, unlocked_groups=0, freeze_bn_stats=False):
479476
# lock image tower as per LiT - https://arxiv.org/abs/2111.07991

0 commit comments

Comments
 (0)