You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: docs/docs/extraction/audio.md
+5-1Lines changed: 5 additions & 1 deletion
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -82,10 +82,13 @@ Use the following procedure to run the NIM locally.
82
82
.extract(
83
83
document_type="wav", # Ingestor should detect type automatically in most cases
84
84
extract_method="audio",
85
+
extract_audio_params={
86
+
"segment_audio": True,
87
+
},
85
88
)
86
89
)
87
90
```
88
-
91
+
To generate one extracted element for each sentence-like ASR segment, include `extract_audio_params={"segment_audio": True}` when calling `.extract(...)`. This option applies when audio extraction runs with a Parakeet NIM (either locally through Docker or remotely via NVCF) but has no effect when using the local Hugging Face Parakeet model.
89
92
90
93
!!! tip
91
94
@@ -117,6 +120,7 @@ Instead of running the pipeline locally, you can use NVCF to perform inference b
Set extract_audio_params={"segment_audio": True} to output sentence-like audio segments as distinct extracted elements. This setting applies only when audio extraction runs through a hosted Parakeet endpoint—such as the Parakeet ASR NIM or NVCF—and has no effect when using the local Hugging Face Parakeet model.
0 commit comments