Add audio text to text task #1692

MrShahzebKhoso · 2025-08-17T18:55:39Z

This PR introduces support for the Audio-Text-to-Text task in huggingface.js.

Added details to the sections of the audio-text-to-text task section in the packages/tasks/src/tasks/audio-text-to-text/ directory that contains about.md and data.ts.
Ensured consistency with existing task structure and documentation.

…paces, and use cases.

merveenoyan

thanks a lot!
only about.md part needs a bit of changes, we try to keep these pages high quality sources
also cc @Vaibhavs10 if you have time would be nice if you could review!

packages/tasks/src/tasks/audio-text-to-text/about.md

Co-authored-by: Merve Noyan <[email protected]>

Updated about.md as per the review.

MrShahzebKhoso · 2025-08-31T00:13:09Z

@merveenoyan Thank you very much for the guidance and valuable feedback. I have made all the changes.

Updated inference section. Added 3 different codes that can be run directly. The files are from URLs.
Added the Voxtral Examples in the inference section for more authenticity.
Added more details and more use cases for better understanding.
Added resources section in the about.md file. This contains links to the papers, blogs, datasets, and codes. These will be helpful to anyone trying to explore more.

Kindly let me know if you have any further suggestions. I really appreciate your time and input!

Vaibhavs10

gentle reminder @merveenoyan @MrShahzebKhoso to look at existing open PRs before opening a new one, in this case there was already an old task PR from @Deep-unlearning: #1212

That said, let's proceed with this one since it's from the community vs in-house.

Vaibhavs10

took a first pass, cc: @Deep-unlearning - could you take a look by Monday as well please.

packages/tasks/src/tasks/audio-text-to-text/about.md

Co-authored-by: vb <[email protected]>

pcuenca

LGTM with @Vaibhavs10's comments and the linter changes.

packages/tasks/src/tasks/audio-text-to-text/about.md

Co-authored-by: Pedro Cuenca <[email protected]>

MrShahzebKhoso · 2025-08-31T12:20:16Z

Thank you, @Vaibhavs10 and @pcuenca, for reviewing and considering this PR. I’ve made the requested updates and pushed the changes.

Added the two examples from Initial commit: Add task audio-text-to-text #1212.
Updated the resources section with additions from Initial commit: Add task audio-text-to-text #1212.
Revised about.md to align with task documentation standards.
Ensured consistency in formatting and descriptions.

Looking forward to your feedback and approval.

packages/tasks/src/tasks/audio-text-to-text/about.md

Deep-unlearning · 2025-09-01T08:28:58Z

Hi @MrShahzebKhoso, Thanks a lot!
I added some fixes for grammar and typos. Otherwise LGTM!

Co-authored-by: Steven Zheng <[email protected]>

MrShahzebKhoso · 2025-09-01T11:21:29Z

Hi @Deep-unlearning, thank you for reviewing and the fixes. Really appreciate the support.
I have committed these changes.

MrShahzebKhoso added 2 commits August 17, 2025 23:44

Add audio-text-to-text task with datasets, demo, models, datsasets, s…

cbb8c19

…paces, and use cases.

Update about.md

73cf5fe

MrShahzebKhoso requested review from SBrandeis, gary149, Wauplin, julien-c, pcuenca and ngxson as code owners August 17, 2025 18:55

MrShahzebKhoso added 6 commits August 21, 2025 18:42

Merge branch 'main' into add-audio-text-to-text-task

bc4dfe7

Merge branch 'main' into add-audio-text-to-text-task

fd95ece

Merge branch 'main' into add-audio-text-to-text-task

b031fef

Merge branch 'main' into add-audio-text-to-text-task

ce51d87

Merge branch 'main' into add-audio-text-to-text-task

bf06eee

Merge branch 'main' into add-audio-text-to-text-task

0f6538e

merveenoyan reviewed Aug 29, 2025

View reviewed changes

MrShahzebKhoso and others added 6 commits August 30, 2025 00:04

Update packages/tasks/src/tasks/audio-text-to-text/about.md

bd8733b

Co-authored-by: Merve Noyan <[email protected]>

Update packages/tasks/src/tasks/audio-text-to-text/about.md

7930ccc

Co-authored-by: Merve Noyan <[email protected]>

Update packages/tasks/src/tasks/audio-text-to-text/about.md

69a2b79

Co-authored-by: Merve Noyan <[email protected]>

Update packages/tasks/src/tasks/audio-text-to-text/about.md

c91ccef

Co-authored-by: Merve Noyan <[email protected]>

Merge branch 'main' into add-audio-text-to-text-task

5dfde65

Update about.md

c50a69c

Updated about.md as per the review.

Vaibhavs10 reviewed Aug 31, 2025

View reviewed changes

MrShahzebKhoso and others added 6 commits August 31, 2025 13:37

Update packages/tasks/src/tasks/audio-text-to-text/about.md

5289f1b

Co-authored-by: vb <[email protected]>

Update packages/tasks/src/tasks/audio-text-to-text/about.md

9c4e6a4

Co-authored-by: vb <[email protected]>

Update packages/tasks/src/tasks/audio-text-to-text/about.md

e5438ec

Co-authored-by: vb <[email protected]>

Update packages/tasks/src/tasks/audio-text-to-text/about.md

4db1edc

Co-authored-by: vb <[email protected]>

Update packages/tasks/src/tasks/audio-text-to-text/about.md

e8f652e

Co-authored-by: vb <[email protected]>

Update packages/tasks/src/tasks/audio-text-to-text/about.md

b9f1c48

Co-authored-by: vb <[email protected]>

This was referenced Aug 31, 2025

Add audio text to text #1691

Closed

Initial commit: Add task audio-text-to-text #1212

Closed

pcuenca approved these changes Aug 31, 2025

View reviewed changes

MrShahzebKhoso and others added 4 commits August 31, 2025 16:05

Update about.md

5fd050e

Update packages/tasks/src/tasks/audio-text-to-text/about.md

cdeaba0

Co-authored-by: Pedro Cuenca <[email protected]>

Update packages/tasks/src/tasks/audio-text-to-text/about.md

a427dee

Co-authored-by: Pedro Cuenca <[email protected]>

Merge branch 'main' into add-audio-text-to-text-task

01efcb6

Deep-unlearning self-requested a review September 1, 2025 08:15

Deep-unlearning reviewed Sep 1, 2025

View reviewed changes

packages/tasks/src/tasks/audio-text-to-text/about.md Outdated Show resolved Hide resolved

Deep-unlearning reviewed Sep 1, 2025

View reviewed changes

packages/tasks/src/tasks/audio-text-to-text/about.md Outdated Show resolved Hide resolved

Deep-unlearning reviewed Sep 1, 2025

View reviewed changes

packages/tasks/src/tasks/audio-text-to-text/about.md Outdated Show resolved Hide resolved

Deep-unlearning reviewed Sep 1, 2025

View reviewed changes

packages/tasks/src/tasks/audio-text-to-text/about.md Outdated Show resolved Hide resolved

MrShahzebKhoso and others added 4 commits September 1, 2025 16:09

Update packages/tasks/src/tasks/audio-text-to-text/about.md

ce2fdef

Co-authored-by: Steven Zheng <[email protected]>

Update packages/tasks/src/tasks/audio-text-to-text/about.md

69b82f0

Co-authored-by: Steven Zheng <[email protected]>

Update packages/tasks/src/tasks/audio-text-to-text/about.md

8096da6

Co-authored-by: Steven Zheng <[email protected]>

Merge branch 'main' into add-audio-text-to-text-task

7cf7f65

MrShahzebKhoso added 6 commits September 1, 2025 18:30

Merge branch 'main' into add-audio-text-to-text-task

ee2d130

Merge branch 'main' into add-audio-text-to-text-task

944ce81

Merge branch 'main' into add-audio-text-to-text-task

0b96556

Merge branch 'main' into add-audio-text-to-text-task

0a21659

Merge branch 'main' into add-audio-text-to-text-task

0087d9d

Merge branch 'main' into add-audio-text-to-text-task

26e63a7

Add audio text to text task #1692

Are you sure you want to change the base?

Add audio text to text task #1692

Uh oh!

Conversation

MrShahzebKhoso commented Aug 17, 2025

Uh oh!

merveenoyan left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

MrShahzebKhoso commented Aug 31, 2025

Uh oh!

Vaibhavs10 left a comment

Choose a reason for hiding this comment

Uh oh!

Vaibhavs10 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

pcuenca left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

MrShahzebKhoso commented Aug 31, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Deep-unlearning commented Sep 1, 2025

Uh oh!

MrShahzebKhoso commented Sep 1, 2025

Uh oh!

Uh oh!

pcuenca left a comment •

edited

Loading