Feat: Support dynamic ONNX model loading from HF (maintaining zero-dependency) by Kagandi · Pull Request #46 · PrithivirajDamodaran/FlashRank

Kagandi · 2025-12-11T19:44:28Z

Description:

I’ve read your comments regarding the design philosophy of Flashrank—specifically the goal to keep the library curated, lightweight, and focused on "tiny and performant" models. I fully agree that Flashrank should not become a heavy wrapper for massive models.

However, I believe this PR strengthens that mission while improving maintainability:

Decoupling Code from Models: Currently, adding a new "tiny/performant" model requires a code change and a release by you. This PR allows the community to experiment with new lightweight ONNX models immediately.

Zero Dependencies: This uses the existing architecture and does not add new dependencies.

Strictly Lightweight (Proposed Safeguard): To ensure this feature doesn't violate the "Flashrank" ethos, I can implement a hard file-size limit (e.g., < 200MB) for custom loaded models. This guarantees that users cannot load massive/slow models, keeping the library true to its name while offering flexibility.

This change allows the library to remain "light and fast" while offloading the burden of constant model updates from the maintainers.

Changes

refactor

Extracted file download logic into a dedicated download_file() function to eliminate code duplication
Updated model preparation workflow to use the new reusable download function
Enhanced support for downloading both model archives and individual files from HuggingFace Hub

feat

Added support for downloading models and required tokenizer files from HuggingFace Hub
Implemented a fallback mechanism to download from HuggingFace when models are not found in the local model map
Added helper function _download_hf_model_files() to fetch models using HuggingFace URLs

fix

Added a check for token_type_ids presence in ONNX model inputs before using them
Improved robustness of model loading for different ONNX model architectures

chore

Bumped version to 0.3 in setup.py
Added hf_model_url for HuggingFace models

…el file download process

…mats

raphaeleduardo42 · 2026-01-06T11:11:48Z

Worked as intended with onnx-community/bge-reranker-v2-m3-ONNX

Kagandi added 7 commits December 10, 2025 15:27

Add check for token_type_ids presence in ONNX model inputs

b317079

Add support for downloading models and required files from Hugging Face

815b1fc

Bump version to 0.3 in setup.py

745c89b

refactor: file download logic into a reusable function and update mod…

b3de97d

…el file download process

docs: update README to include Hugging Face model support and examples

f9a2b71

docs: update README to describe the model as a fast modern model

134be31

fix: update model download logic to include additional ONNX model for…

29c336b

…mats

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Feat: Support dynamic ONNX model loading from HF (maintaining zero-dependency)#46

Feat: Support dynamic ONNX model loading from HF (maintaining zero-dependency)#46
Kagandi wants to merge 7 commits intoPrithivirajDamodaran:mainfrom
Kagandi:main

Kagandi commented Dec 11, 2025

Uh oh!

raphaeleduardo42 commented Jan 6, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

Kagandi commented Dec 11, 2025

Description:

Changes

refactor

feat

fix

chore

Uh oh!

raphaeleduardo42 commented Jan 6, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants