More V5 pipeline cleanup #43325

Rocketknight1 · 2026-01-16T15:01:51Z

More V5 pipeline cleanup, followup to #43256 and #43306:

feature-extraction renamed to text-embedding (keeping the old name as an alias)
image-feature-extraction renamed to image-embedding (keeping the old name as an alias)
question-answering and visual-question-answering removed
fill-mask removed
Updated the default text-generation and image-text-to-text models
Updated the migration guide to explain all of this!

HuggingFaceDocBuilderDev · 2026-01-16T15:14:57Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

vasqu · 2026-01-16T17:10:59Z

Just commenting before more other langs docs might get broken:

Can you approve [Docs] Fix other lang deprecations of a few pipelines #43292? Don't think I can merge without ✔️ atm
Don't forget the other languages :p

github-actions · 2026-01-16T18:15:10Z

[For maintainers] Suggested jobs to run (before merge)

run-slow: afmoe, aimv2, albert, align, altclip, audio_spectrogram_transformer, autoformer, bamba, bart

Rocketknight1 · 2026-01-16T18:25:53Z

@vasqu I think this should be ready for review now! I chased down the references in the other language docs too

github-actions · 2026-01-16T18:50:47Z

View the CircleCI Test Summary for this PR:

https://huggingface.co/spaces/transformers-community/circle-ci-viz?pr=43325&sha=3571cb

vasqu

Nice cleanup 🧹 mostly nits, but I think we need to take another look at the tests to properly rename some stuff there

vasqu · 2026-01-19T11:17:32Z

MIGRATION_GUIDE_V5.md

+`Text2TextGenerationPipeline`, as well as the related `SummarizationPipeline` and `TranslationPipeline`, were deprecated and will now be removed. The
+`question-answering` pipeline has also been removed. `pipeline` classes are intended as a high-level beginner-friendly API,


Suggested change

`Text2TextGenerationPipeline`, as well as the related `SummarizationPipeline` and `TranslationPipeline`, were deprecated and will now be removed. The

`question-answering` pipeline has also been removed. `pipeline` classes are intended as a high-level beginner-friendly API,

`question-answering` and `Text2TextGenerationPipeline`, including its related `SummarizationPipeline` and `TranslationPipeline`, were deprecated and will now be removed. `pipeline` classes are intended as a high-level beginner-friendly API,

More of a nit, the first 2 sentences just don't read super well

vasqu · 2026-01-19T11:18:14Z

MIGRATION_GUIDE_V5.md

-Similarly, the `image-to-text` pipeline has been removed. This pipeline was used for early image captioning models, but these
-no longer offer competitive performance. Instead, for image captioning tasks we recommend using a modern vision-language chat model
-via the `image-text-to-text` pipeline. For example:
+The above example can be adapted for translation or question answering simply by changing the prompt.


Suggested change

The above example can be adapted for translation or question answering simply by changing the prompt.

The above example can be adapted for other tasks, e.g. translation or question answering, simply by changing the prompt.

vasqu · 2026-01-19T11:20:21Z

MIGRATION_GUIDE_V5.md

+
+### Other changes
+
+- The `feature-extraction` pipeline has now been renamed to `text-embedding` and the `image-feature-extraction` pipeline has been renamed to `image-embedding`. The older names are still usable as aliases, so this should not impact your existing code.


Do we want to mark that these aliases won't be forever (we should deprecate them later on)

Unless we make changes on the Hub side, people will forever have feature-extraction models they'll want to run.

vasqu · 2026-01-19T11:22:44Z

src/transformers/__init__.py

-        "FillMaskPipeline",
        "ImageClassificationPipeline",
-        "ImageFeatureExtractionPipeline",
+        "ImageEmbeddingPipeline",


We should be able to build on top of #42564 for getting modality-specific embeddings, just as a note (to myself)

vasqu · 2026-01-19T11:27:01Z

src/transformers/pipelines/__init__.py

+        "impl": TextEmbeddingPipeline,
        "pt": (AutoModel,) if is_torch_available() else (),
        "default": {"model": ("distilbert/distilbert-base-cased", "6ea8117")},
-        "type": "multimodal",


So this was wrong?

vasqu · 2026-01-19T11:30:56Z

tests/models/bart/test_modeling_bart.py

-    @slow
-    def test_base_mask_filling(self):
-        pbase = pipeline(task="fill-mask", model="facebook/bart-base")
-        src_text = [" I went to the <mask>."]
-        results = [x["token_str"] for x in pbase(src_text)]
-        assert " bathroom" in results
-
-    @slow
-    def test_large_mask_filling(self):
-        plarge = pipeline(task="fill-mask", model="facebook/bart-large")
-        src_text = [" I went to the <mask>."]
-        results = [x["token_str"] for x in plarge(src_text)]
-        expected_results = [" bathroom", " gym", " wrong", " movies", " hospital"]
-        self.assertListEqual(results, expected_results)


These are essentially integration tests, can we rewrite those instead of removing

vasqu · 2026-01-19T11:33:38Z

tests/test_pipeline_mixin.py

-    @is_pipeline_test
-    def test_pipeline_fill_mask(self):
-        self.run_task_tests(task="fill-mask")


Only seeing tests removed but no renames / additions for the new naming? E.g. self.run_task_tests(task="text-embedding") should exist, no?

vasqu · 2026-01-19T11:34:03Z

tests/test_pipeline_mixin.py

    "document-question-answering": {"test": DocumentQuestionAnsweringPipelineTests},
-    "feature-extraction": {"test": FeatureExtractionPipelineTests},
-    "fill-mask": {"test": FillMaskPipelineTests},
+    "text-embedding": {"test": FeatureExtractionPipelineTests},


Nit: We should also rename FeatureExtractionPipelineTests (for image as well)

Rocketknight1 marked this pull request as ready for review January 16, 2026 15:03

Rocketknight1 changed the title ~~Rename the feature extraction pipelines and remove question-answering~~ More V5 pipeline cleanup Jan 16, 2026

Rocketknight1 added 13 commits January 16, 2026 17:42

Rename the feature extraction pipelines and remove question-answering

c44bcc2

make style

cf39acc

Remove more refs to the question-answering pipelines

5a2eb1a

Remove more refs to the question-answering pipelines

48b99db

More migration guide

e1f24b9

make fix-repo

aec8901

Correct the name imports in init.py

0a5662a

Cleanup lots of refs to the pipelines

6167ebb

make fix-repo

4610a2f

Remove a bunch of doc refs too

de9801d

Remove from not_doctested.txt

5de17b5

More cleanup of the QA pipelines

b1fc9ac

More cleanup of the QA pipelines

1373e63

Rocketknight1 force-pushed the more_pipeline_cleanup branch from de69c78 to 1373e63 Compare January 16, 2026 17:42

Rocketknight1 added 4 commits January 16, 2026 17:45

Catch more examples in the docs

f72ef88

Catch more examples in the docs

7aaa073

Remove refs to the argumenthandler too

d59dfa6

A few more refs to fill-mask

696f910

Rocketknight1 added 2 commits January 16, 2026 18:34

Remove references in the pipeline model mappings

88ab3f1

Remove references in the pipeline model mappings

3571cbe

vasqu reviewed Jan 19, 2026

View reviewed changes

		`Text2TextGenerationPipeline`, as well as the related `SummarizationPipeline` and `TranslationPipeline`, were deprecated and will now be removed. The
		`question-answering` pipeline has also been removed. `pipeline` classes are intended as a high-level beginner-friendly API,

	`Text2TextGenerationPipeline`, as well as the related `SummarizationPipeline` and `TranslationPipeline`, were deprecated and will now be removed. The
	`question-answering` pipeline has also been removed. `pipeline` classes are intended as a high-level beginner-friendly API,
	`question-answering` and `Text2TextGenerationPipeline`, including its related `SummarizationPipeline` and `TranslationPipeline`, were deprecated and will now be removed. `pipeline` classes are intended as a high-level beginner-friendly API,

	The above example can be adapted for translation or question answering simply by changing the prompt.
	The above example can be adapted for other tasks, e.g. translation or question answering, simply by changing the prompt.


		### Other changes

		- The `feature-extraction` pipeline has now been renamed to `text-embedding` and the `image-feature-extraction` pipeline has been renamed to `image-embedding`. The older names are still usable as aliases, so this should not impact your existing code.

More V5 pipeline cleanup #43325

Are you sure you want to change the base?

More V5 pipeline cleanup #43325

Conversation

Rocketknight1 commented Jan 16, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

HuggingFaceDocBuilderDev commented Jan 16, 2026

Uh oh!

vasqu commented Jan 16, 2026

Uh oh!

github-actions bot commented Jan 16, 2026

Uh oh!

Rocketknight1 commented Jan 16, 2026

Uh oh!

github-actions bot commented Jan 16, 2026

Uh oh!

vasqu left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Rocketknight1 commented Jan 16, 2026 •

edited

Loading