Skip to content

Article suggests text-to-speech model output format is an image #25480

@chowieuk

Description

@chowieuk

Existing documentation URL(s)

https://developers.cloudflare.com/workers-ai/models/aura-1/

Output

The binding returns a ReadableStream with the image in JPEG or PNG format (check the model's output schema).

What changes are you suggesting?

I haven't worked with this worker / model, but I'm assuming the output is audio.

Additional information

No response

Metadata

Metadata

Labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions