Add granit speech doc #41360

Deep-unlearning · 2025-10-06T09:43:27Z

What does this PR do?

Add usage example for the Granite Speech models

HuggingFaceDocBuilderDev · 2025-10-06T09:52:43Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

eustlb

Thanks, nice initiative! 🤗
nits but we can merge after

eustlb · 2025-10-06T17:10:00Z

docs/source/en/model_doc/granite_speech.md

+dataset = load_dataset("hf-internal-testing/librispeech_asr_dummy", "clean", split="validation")
+wav = torch.tensor(dataset[0]["audio"]["array"]).unsqueeze(0)  # add batch dimension


starting from datasets 4.0 this should be directly audio.get_all_samples etc see doc

eustlb · 2025-10-06T17:11:57Z

docs/source/en/model_doc/granite_speech.md

+device = "cuda" if torch.cuda.is_available() else "cpu"
+
+# Load model and processor
+model_name = "ibm-granite/granite-speech-3.3-8b"


nit but let's use rather model_id

eustlb · 2025-10-06T17:13:00Z

docs/source/en/model_doc/granite_speech.md

+from transformers import AutoProcessor, AutoModelForSpeechSeq2Seq
+from datasets import load_dataset
+
+device = "cuda" if torch.cuda.is_available() else "cpu"


let's rather use rather device_map="auto" in from_pretrained

eustlb · 2025-10-06T17:13:18Z

docs/source/en/model_doc/granite_speech.md

+processor = AutoProcessor.from_pretrained(model_name)
+tokenizer = processor.tokenizer
+model = AutoModelForSpeechSeq2Seq.from_pretrained(
+    model_name, device_map=device, torch_dtype=torch.bfloat16


torch_dtype is deprecated! let's use dtype. Also dtype="auto" here since
"torch_dtype": "bfloat16" in config.json

Deep-unlearning added 2 commits October 6, 2025 11:38

add usage example for granite speech

418b129

remove key features

1bb46ca

eustlb approved these changes Oct 6, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add granit speech doc #41360

Add granit speech doc #41360

Deep-unlearning commented Oct 6, 2025

Uh oh!

HuggingFaceDocBuilderDev commented Oct 6, 2025

Uh oh!

eustlb left a comment

Uh oh!

eustlb Oct 6, 2025

Uh oh!

eustlb Oct 6, 2025

Uh oh!

eustlb Oct 6, 2025

Uh oh!

eustlb Oct 6, 2025

Uh oh!

Uh oh!

		dataset = load_dataset("hf-internal-testing/librispeech_asr_dummy", "clean", split="validation")
		wav = torch.tensor(dataset[0]["audio"]["array"]).unsqueeze(0) # add batch dimension

Add granit speech doc #41360

Are you sure you want to change the base?

Add granit speech doc #41360

Conversation

Deep-unlearning commented Oct 6, 2025

What does this PR do?

Uh oh!

HuggingFaceDocBuilderDev commented Oct 6, 2025

Uh oh!

eustlb left a comment

Choose a reason for hiding this comment

Uh oh!

eustlb Oct 6, 2025

Choose a reason for hiding this comment

Uh oh!

eustlb Oct 6, 2025

Choose a reason for hiding this comment

Uh oh!

eustlb Oct 6, 2025

Choose a reason for hiding this comment

Uh oh!

eustlb Oct 6, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!