[DOCS] Adds DeBERTA v2 to the tokenizers list in API docs #112752

szabosteve · 2024-09-11T14:42:25Z

Overview

This PR expands the Tokenization properties section on the PUT trained models API doc page and the Infer trained model API doc page with the DeBERTa v2 tokenizer reference docs. The updates also effect the Inference processor reference docs.

Preview

PUT trained models - Tokenization properties
Infer trained model
Inference processor

github-actions · 2024-09-11T14:42:36Z

Documentation preview:

✨ Changed pages

elasticsearchmachine · 2024-09-11T14:42:48Z

Pinging @elastic/ml-core (Team:ML)

elasticsearchmachine · 2024-09-11T14:42:48Z

Pinging @elastic/es-docs (Team:Docs)

maxhniebergall

Looks good, thanks Istvan!

overall, I think that the deberta_v2 tokenizer is usable for any ML task, but it doesn't appear in all of the tasks in https://elasticsearch_bk_112752.docs-preview.app.elstc.co/guide/en/elasticsearch/reference/master/inference-processor.html#inference-processor-fill-mask-opt

I also noticed that page doesn't have a secontion for text_similarity, which is the main thing we will be using deberta for. I don't think that needs to be a part of this PR, but we should consider adding that.

docs/reference/ml/ml-shared.asciidoc

Co-authored-by: Max Hniebergall <[email protected]>

…ocs page.

szabosteve · 2024-09-12T08:30:25Z

Hey @maxhniebergall,
I've added DeBERTa to all the ML tasks on the inference pipeline reference doc page.
I also opened an issue to document text_similarity and add it to the page: https://github.com/elastic/search-docs-team/issues/188

maxhniebergall

LGTM

~~Just wondering why some of the new sections in docs/reference/ingest/processors/inference.asciidoc have span and some dont?~~

edit: I see that zero-shot classification doesn't support span https://github.com/elastic/elasticsearch/blob/main/x-pack/plugin/core/src/main/java/org/elasticsearch/xpack/core/ml/inference/trainedmodel/ZeroShotClassificationConfig.java#L145

elasticsearchmachine · 2024-10-07T08:25:14Z

💚 Backport successful

Status	Branch	Result
✅	8.x

…2752) Co-authored-by: Max Hniebergall <[email protected]>

…114203) Co-authored-by: Max Hniebergall <[email protected]>

…2752) Co-authored-by: Max Hniebergall <[email protected]>

[DOCS] Adds DeBERTA v2 to the tokenizers list in API docs.

9bec5d9

szabosteve added >docs General docs changes :ml Machine learning Team:Docs Meta label for docs team v8.16.0 labels Sep 11, 2024

szabosteve requested a review from maxhniebergall September 11, 2024 14:42

elasticsearchmachine added the Team:ML Meta label for the ML team label Sep 11, 2024

[DOCS] Fixes parameter highlighting.

9aee005

mark-vieira added v9.0.0 and removed v8.16.0 labels Sep 11, 2024

maxhniebergall reviewed Sep 11, 2024

View reviewed changes

docs/reference/ml/ml-shared.asciidoc Outdated Show resolved Hide resolved

docs/reference/ml/ml-shared.asciidoc Outdated Show resolved Hide resolved

szabosteve and others added 3 commits September 12, 2024 09:31

Apply suggestions from code review

d2c5cab

Co-authored-by: Max Hniebergall <[email protected]>

[DOCS] Further edits.

1971dd2

[DOCS] Adds DeBERTA to the ML task types on the inference processor d…

4459277

…ocs page.

szabosteve requested a review from maxhniebergall September 12, 2024 08:30

maxhniebergall approved these changes Sep 18, 2024

View reviewed changes

szabosteve added auto-backport Automatically create backport pull requests when merged v8.16.0 labels Oct 7, 2024

szabosteve merged commit 57955cb into elastic:main Oct 7, 2024

szabosteve deleted the deberta-docs-update branch October 7, 2024 08:23

szabosteve mentioned this pull request Oct 7, 2024

[8.x] [DOCS] Adds DeBERTA v2 to the tokenizers list in API docs (#112752) #114203

Merged

szabosteve added a commit to szabosteve/elasticsearch that referenced this pull request Oct 7, 2024

[DOCS] Adds DeBERTA v2 to the tokenizers list in API docs (elastic#11…

8076d54

…2752) Co-authored-by: Max Hniebergall <[email protected]>

elasticsearchmachine pushed a commit that referenced this pull request Oct 7, 2024

[DOCS] Adds DeBERTA v2 to the tokenizers list in API docs (#112752) (#…

bca80f7

…114203) Co-authored-by: Max Hniebergall <[email protected]>

matthewabbott pushed a commit to matthewabbott/elasticsearch that referenced this pull request Oct 10, 2024

[DOCS] Adds DeBERTA v2 to the tokenizers list in API docs (elastic#11…

89dbfe6

…2752) Co-authored-by: Max Hniebergall <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[DOCS] Adds DeBERTA v2 to the tokenizers list in API docs #112752

[DOCS] Adds DeBERTA v2 to the tokenizers list in API docs #112752

Uh oh!

szabosteve commented Sep 11, 2024 •

edited

Loading

Uh oh!

github-actions bot commented Sep 11, 2024

Uh oh!

elasticsearchmachine commented Sep 11, 2024

Uh oh!

elasticsearchmachine commented Sep 11, 2024

Uh oh!

maxhniebergall left a comment

Uh oh!

Uh oh!

Uh oh!

szabosteve commented Sep 12, 2024

Uh oh!

maxhniebergall left a comment •

edited

Loading

Uh oh!

elasticsearchmachine commented Oct 7, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

[DOCS] Adds DeBERTA v2 to the tokenizers list in API docs #112752

[DOCS] Adds DeBERTA v2 to the tokenizers list in API docs #112752

Uh oh!

Conversation

szabosteve commented Sep 11, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Overview

Preview

Uh oh!

github-actions bot commented Sep 11, 2024

Uh oh!

elasticsearchmachine commented Sep 11, 2024

Uh oh!

elasticsearchmachine commented Sep 11, 2024

Uh oh!

maxhniebergall left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

szabosteve commented Sep 12, 2024

Uh oh!

maxhniebergall left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

elasticsearchmachine commented Oct 7, 2024

💚 Backport successful

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

szabosteve commented Sep 11, 2024 •

edited

Loading

maxhniebergall left a comment •

edited

Loading