generated from amazon-archives/__template_Apache-2.0
-
Notifications
You must be signed in to change notification settings - Fork 651
Open
Labels
Backlog - DEVDeveloper assigned to issue is responsible for creating PR.Developer assigned to issue is responsible for creating PR.
Description
What do you want to do?
- Request a change to existing documentation
- Add new documentation
- Report a technical problem with the documentation
- Other
Tell us about your request.
The pooling_mode parameter in the Register Model API documentation (_ml-commons-plugin/api/model-apis/register-model.md) currently lists supported values as mean, mean_sqrt_len, max, weightedmean, and cls.
A new lasttoken pooling mode is being added in ml-commons (opensearch-project/ml-commons#4711) to support decoder-only text embedding models (e.g., Qwen3-Embedding, GPT-style models) where the final non-padding token captures cumulative context through causal attention.
The documentation should be updated to:
- Add
lasttokento the list of supportedpooling_modevalues in themodel_configobject table - Describe that
lasttokenuses the last non-padding token's embedding, and is useful for decoder-only models where the final token captures cumulative context
Version: 3.4
What other resources are available?
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
Backlog - DEVDeveloper assigned to issue is responsible for creating PR.Developer assigned to issue is responsible for creating PR.