Skip to content

Ensure that n_embedding_parameters is not None #3947

@ayush1298

Description

@ayush1298

Originally posted by @KennethEnevoldsen in #3837

Code for calculation:

If the model can be loaded from transformers/senetence-transformers then load the model and use below code:

import numpy as np
from transformers import AutoModel

model = AutoModel.from_pretrained(model_name, trust_remote_code=True)
emb = model[0].auto_model.get_input_embeddings()
meta.n_embedding_parameters = int(np.prod(emb.weight.shape))

If the model loaded using CrossEncoder then:

import numpy as np
from sentence_transformers import CrossEncoder

model = CrossEncoder("cross-encoder/ms-marco-MiniLM-L6-v2")
emb = model.model.get_input_embeddings()
n_embedding_parameters = int(np.prod(emb.weight.shape))

List of models for which n_embedding_parameters is to be evaluated is as follows:

  • ApsaraStackMaaS/EvoQwen2.5-VL-Retriever-3B-v1
  • ApsaraStackMaaS/EvoQwen2.5-VL-Retriever-7B-v1
  • BAAI/bge-visualized-base
  • BAAI/bge-visualized-m3
  • ByteDance/ListConRanker
  • Bytedance/Seed1.6-embedding
  • GritLM/GritLM-8x7B
  • NovaSearch/stella_en_400M_v5
  • QuanSun/EVA02-CLIP-B-16
  • QuanSun/EVA02-CLIP-L-14
  • QuanSun/EVA02-CLIP-bigE-14
  • QuanSun/EVA02-CLIP-bigE-14-plus
  • Salesforce/SFR-Embedding-Code-2B_R
  • Salesforce/blip2-opt-2.7b
  • Salesforce/blip2-opt-6.7b-coco
  • Snowflake/snowflake-arctic-embed-m-long
  • TencentBAC/Conan-embedding-v2
  • TomoroAI/tomoro-colqwen3-embed-4b
  • TomoroAI/tomoro-colqwen3-embed-8b
  • ai-sage/Giga-Embeddings-instruct
  • baseline/bm25s
  • consciousAI/cai-stellaris-text-embeddings
  • deepvk/USER2-base
  • deepvk/USER2-small
  • facebook/SONAR
  • facebook/dinov2-base
  • facebook/dinov2-giant
  • facebook/dinov2-large
  • facebook/dinov2-small
  • facebook/webssl-dino1b-full2b-224
  • facebook/webssl-dino2b-full2b-224
  • facebook/webssl-dino2b-heavy2b-224
  • facebook/webssl-dino2b-light2b-224
  • facebook/webssl-dino300m-full2b-224
  • facebook/webssl-dino3b-full2b-224
  • facebook/webssl-dino3b-heavy2b-224
  • facebook/webssl-dino3b-light2b-224
  • facebook/webssl-mae1b-full2b-224
  • facebook/webssl-mae300m-full2b-224
  • ibm-granite/granite-vision-3.3-2b-embedding
  • intfloat/mmE5-mllama-11b-instruct
  • jinaai/jina-clip-v1
  • jinaai/jina-colbert-v2
  • jinaai/jina-reranker-v2-base-multilingual
  • jxm/cde-small-v1
  • jxm/cde-small-v2
  • kakaobrain/align-base
  • laion/CLIP-ViT-B-16-DataComp.XL-s13B-b90K
  • laion/CLIP-ViT-B-32-DataComp.XL-s13B-b90K
  • laion/CLIP-ViT-B-32-laion2B-s34B-b79K
  • laion/CLIP-ViT-H-14-laion2B-s32B-b79K
  • laion/CLIP-ViT-L-14-DataComp.XL-s13B-b90K
  • laion/CLIP-ViT-L-14-laion2B-s32B-b82K
  • laion/CLIP-ViT-bigG-14-laion2B-39B-b160k
  • laion/CLIP-ViT-g-14-laion2B-s34B-b88K
  • malenia1/ternary-weight-embedding
  • microsoft/LLM2CLIP-Openai-B-16
  • microsoft/LLM2CLIP-Openai-L-14-224
  • microsoft/LLM2CLIP-Openai-L-14-336
  • nomic-ai/colnomic-embed-multimodal-3b
  • nomic-ai/colnomic-embed-multimodal-7b
  • nomic-ai/modernbert-embed-base
  • nomic-ai/nomic-embed-text-v1
  • nomic-ai/nomic-embed-text-v1-ablated
  • nomic-ai/nomic-embed-text-v1-unsupervised
  • nomic-ai/nomic-embed-text-v1.5
  • nomic-ai/nomic-embed-text-v2-moe
  • nomic-ai/nomic-embed-vision-v1.5
  • nyu-visionx/moco-v3-vit-b
  • nyu-visionx/moco-v3-vit-l
  • openai/clip-vit-base-patch16
  • openai/clip-vit-base-patch32
  • openai/clip-vit-large-patch14
  • samaya-ai/promptriever-llama2-7b-v1
  • samaya-ai/promptriever-llama3.1-8b-instruct-v1
  • samaya-ai/promptriever-llama3.1-8b-v1
  • vidore/colSmol-256M
  • vidore/colSmol-500M
  • vidore/colpali-v1.1
  • vidore/colpali-v1.2
  • vidore/colpali-v1.3
  • vidore/colqwen2-v1.0
  • vidore/colqwen2.5-v0.2
  • voyageai/voyage-multimodal-3

Metadata

Metadata

Assignees

No one assigned

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions