Skip to content

Commit 80af3da

Browse files
authored
Merge pull request #885 from roboflow/florence2_speedups
merge lora into base model for 3x speedup
2 parents 539fd22 + e3f50c5 commit 80af3da

File tree

1 file changed

+2
-0
lines changed

1 file changed

+2
-0
lines changed

inference/models/transformers/transformers.py

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -252,6 +252,8 @@ def initialize_model(self):
252252
.to(self.dtype)
253253
)
254254

255+
self.model.merge_and_unload()
256+
255257
self.processor = self.processor_class.from_pretrained(
256258
model_load_id, revision=revision, cache_dir=cache_dir, token=token
257259
)

0 commit comments

Comments
 (0)