Adding support for pronunciation_dict_id by bpanahij · Pull Request #56 · cartesia-ai/cartesia-python

bpanahij · 2025-11-08T04:01:35Z

Summary

Adds support for the pronunciation_dict_id parameter to TTS generation requests, enabling users to apply custom pronunciation dictionaries to their text-to-speech generations.

Changes

Added pronunciation_dict_id parameter to GenerationRequestParams and GenerationRequest types
- src/cartesia/tts/requests/generation_request.py:72-76
- src/cartesia/tts/types/generation_request.py:77-79
Added test coverage for both SSE and WebSocket interfaces
- tests/custom/test_client.py:434-455 - SSE test
- tests/custom/test_client.py:610-641 - WebSocket async test

Implementation Details

The pronunciation_dict_id parameter is:

Optional (typing_extensions.NotRequired in TypedDict, typing.Optional[str] in Pydantic model)
Available in both SSE and WebSocket generation methods
Applied on a per-generation basis, allowing different pronunciation dictionaries for different requests

Test Plan

✅ Added test_sse_pronunciation_dict() to verify SSE endpoint accepts the parameter
✅ Added test_ws_pronunciation_dict() to verify WebSocket endpoint accepts the parameter
✅ Both tests validate audio generation works correctly with the parameter

This enables users of the official Cartesia Python client to leverage custom pronunciation dictionaries when generating speech.

type: string A pronunciation dict ID to use for the generation. This will be applied to this TTS generation only. This enable the use of custom pronunciation dicts when using the official Cartesia python client.

…nciation_dict_id Adding support for pronunciation_dict_id

type: string A pronunciation dict ID to use for the generation. This will be applied to this TTS generation only. This enable the use of custom pronunciation dicts when using the official Cartesia python client.

Takes changes from #56 and adds support in bytes method and ws send wrapper methods. --------- Co-authored-by: Brian Johnson <brian@pjohnson.info>

bpanahij added 14 commits November 7, 2025 19:57

Adding support for pronunciation_dict_id

bfcb46a

type: string A pronunciation dict ID to use for the generation. This will be applied to this TTS generation only. This enable the use of custom pronunciation dicts when using the official Cartesia python client.

Merge pull request #1 from Tavus-Engineering/Adding-support-for-pronu…

160561e

…nciation_dict_id Adding support for pronunciation_dict_id

updating version

0f0303e

Adding pronunciation to requset

c247066

More

56acaf0

Fixing

75c9a84

Default to None

f265a47

Adding support for pronunciation_dict_id

654cd38

type: string A pronunciation dict ID to use for the generation. This will be applied to this TTS generation only. This enable the use of custom pronunciation dicts when using the official Cartesia python client.

updating version

33264ab

Adding pronunciation to requset

e679114

More

a63728d

Fixing

1434411

Default to None

cde214b

Merge branch 'main' into Adding-support-for-pronunciation_dict_id

bedfef2

noahlt mentioned this pull request Nov 13, 2025

pronunciation dict updates #57

Merged

noahlt added a commit that referenced this pull request Nov 13, 2025

pronunciation dict updates (#57)

9f3fac3

Takes changes from #56 and adds support in bytes method and ws send wrapper methods. --------- Co-authored-by: Brian Johnson <brian@pjohnson.info>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adding support for pronunciation_dict_id#56

Adding support for pronunciation_dict_id#56
bpanahij wants to merge 14 commits intocartesia-ai:mainfrom
Tavus-Engineering:Adding-support-for-pronunciation_dict_id

bpanahij commented Nov 8, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

bpanahij commented Nov 8, 2025

Summary

Changes

Implementation Details

Test Plan

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant