Models extractor #553

irar2 · 2026-01-12T10:47:13Z

This PR adds an ability to collect information from /v1/models and store it in endpoint's attributes.

Closes #466

Signed-off-by: irar2 <[email protected]>

nirrozenbaum · 2026-01-12T12:11:16Z

pkg/plugins/datalayer/models/extractor.go

+// ModelInfo defines model's data returned from /v1/models API
+type ModelInfo struct {
+	ID     string `json:"id"`
+	Parent string `json:"parent,omitempty"`


parent field is not part of OpenAI standardization.
it's specific to vllm and might not work with other model servers.
I also don't think it's used (or should be used) anywhere.
I recommend removing this field.

OpenAI standard here:
https://platform.openai.com/docs/api-reference/models/list

A few comments

If not present, the omitempty kicks in so I don't see the downside of having it.

For use cases that need the parent information for Base/LoRA relations, if it is not provided by model extraction then one must assume the base model name is provided elsewhere. There is currently no other source of truth...

I think it is fine to rely on vLLM specific for that.

It can be treated as part of the "contract" (same as the case when other model servers are expected to provide the MSP metrics even if by a different name).

configuration of data sources is per EPP so you can always not enable this for other model servers . This is valid usage as long as we use homogeneous model server in a pool (other code breaks as well when this is not the case...)

Signed-off-by: irar2 <[email protected]>

elevran · 2026-01-14T13:10:51Z

/hold
this should go in post v0.5

Signed-off-by: Ira Rosen <[email protected]>

Signed-off-by: irar2 <[email protected]>

pkg/plugins/datalayer/models/datasource_test.go

pkg/plugins/datalayer/models/extractor.go

elevran · 2026-02-03T09:47:16Z

pkg/plugins/datalayer/models/extractor.go

+}
+
+// NewModelExtractor returns a new model extractor.
+func NewModelExtractor() (*ModelExtractor, error) {


nit: at least in theory, the plugin could have a name...

What do you mean?

ModelExtractor is a plugin. A plugin has a type and an optional name.
The code does not support setting a plugin name and it should.

pkg/plugins/datalayer/models/extractor_test.go

elevran · 2026-02-03T09:50:24Z

pkg/plugins/datalayer/models/factories.go

+		}
+	}
+
+	ds := http.NewHTTPDataSource(cfg.Scheme, cfg.Path, cfg.InsecureSkipVerify, ModelsDataSourceType,


Q; does NewHTTPDataSource validate the scheme?

No, there is only a check if it's https

Since we use the scheme passed in by the user it should at least sanitize it to ensure it's one one of a known set of acceptable values (e.g., "http" and "https").
Can be in this PR or separate adding scheme validation to the HTTPDataSource

pkg/plugins/datalayer/models/factories.go

elevran · 2026-02-03T09:53:36Z

/lgtm
/approve
/hold

overall looks good. minor comments left so placing a hold. Leaving to your discretion if you want to amend or cancel the hold to allow merging as-is

Signed-off-by: irar2 <[email protected]>

…uler into models

Models extractor

4bb0dfc

Signed-off-by: irar2 <[email protected]>

github-project-automation bot added this to llm-d-inference-scheduler Jan 12, 2026

vMaroon requested review from elevran, kfirtoledo, kfswain, nilig, nirrozenbaum and shmuelk January 12, 2026 10:47

nirrozenbaum reviewed Jan 12, 2026

View reviewed changes

Merge branch 'main' into models

8b6e87e

vMaroon requested a review from nirrozenbaum January 13, 2026 10:19

Merge branch 'main' into models

4f37fa5

Signed-off-by: irar2 <[email protected]>

github-actions bot added the hold label Jan 14, 2026

Merge branch 'main' into models

8e970f1

elevran added this to the v0.6 milestone Jan 22, 2026

elevran moved this to In review in llm-d-inference-scheduler Jan 22, 2026

elevran removed the hold label Jan 26, 2026

irar2 added 5 commits January 29, 2026 08:02

Merge branch 'main' into models

e6e9852

Signed-off-by: Ira Rosen <[email protected]>

Merge branch 'main' into models

4628de9

Signed-off-by: Ira Rosen <[email protected]>

Update register.go

55996a2

Signed-off-by: Ira Rosen <[email protected]>

Updated for the newer GIE

f3f582a

Signed-off-by: irar2 <[email protected]>

Merge branch 'main' into models

c0aac23

elevran reviewed Feb 3, 2026

View reviewed changes

github-actions bot added hold lgtm "Looks good to me", indicates that a PR is ready to be merged. labels Feb 3, 2026

github-actions bot previously approved these changes Feb 3, 2026

View reviewed changes

irar2 added 2 commits February 3, 2026 12:39

Review comments

8eaf3ce

Signed-off-by: irar2 <[email protected]>

Merge branch 'models' of github.com:irar2/irar2-llm-d-inference-sched…

7d16b18

…uler into models

irar2 dismissed github-actions[bot]’s stale review via 7d16b18 February 3, 2026 10:48

Merge branch 'main' into models

9c510f8

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Models extractor #553

Models extractor #553

irar2 commented Jan 12, 2026

Uh oh!

nirrozenbaum Jan 12, 2026

Uh oh!

elevran Jan 13, 2026

Uh oh!

elevran commented Jan 14, 2026

Uh oh!

Uh oh!

Uh oh!

Uh oh!

elevran Feb 3, 2026

Uh oh!

irar2 Feb 3, 2026

Uh oh!

elevran Feb 3, 2026

Uh oh!

Uh oh!

elevran Feb 3, 2026

Uh oh!

irar2 Feb 3, 2026

Uh oh!

elevran Feb 3, 2026

Uh oh!

Uh oh!

elevran commented Feb 3, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Models extractor #553

Are you sure you want to change the base?

Models extractor #553

Conversation

irar2 commented Jan 12, 2026

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

elevran commented Jan 14, 2026

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

elevran commented Feb 3, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants