Skip to content

For ONVIF TTS audio proposal, to support device with TTS function#692

Merged
Peggy0422 merged 6 commits intoonvif:video/TTS-audio-clipfrom
Peggy0422:video/TTS-audio-clip
Dec 1, 2025
Merged

For ONVIF TTS audio proposal, to support device with TTS function#692
Peggy0422 merged 6 commits intoonvif:video/TTS-audio-clipfrom
Peggy0422:video/TTS-audio-clip

Conversation

@Peggy0422
Copy link
Copy Markdown

To support audio device with TTS function, several operations should be done, which are:

TTSCapabilities(Optional): Add complex type TTSCapabilities to the existing complex Type "AudioClipCapabilities" as optional, to indicate whether the device is capable of TTS function and the detailed configuration if so.
Parameter:

  1. MaxContentLength: the max length of the content in a text that device could convert into an audio clip;
  2. TTSLanguage: indicates what languages the device supports in TTS.
  3. TTSVoiceType: indicates what types of voice that the device supports in TTS.

Add “AddTTSAudioClip”and "AddTTSAudioClipResponse"element: To send a text, corresponding TTS configuration and audio clip configuration to device, for device converting the text to an audio clip based on the TTS configuration, and play it according to audio clip configuration in the future.
Parameter:

  1. Token(Optional): token for the audio clip.
  2. Configuration: audio clip configuration to add, reference to Configuration for AddAudioClip.
  3. TTSConfiguration: the configuration for the TTS audio clip to add, it specifies the audio content, language and voice type when device play this audio clip.
    Reponse:
  4. Token: Unique token of the TTS audio clip to be uploaded.

media2.wsdl

  1. Added AddTTSAudioClip request and AddTTSAudioClip response for sending a text, audio clip configuration, TTS configuration to the device;
  2. Added complex types "TTSAudio" for TTSConfiguration to support TTS function. It includes parameters: Content, Language, VoiceType.
  3. Updated AudioClipCapabilities with TTSCapabilities, and added complex types for TTSCapabilities to indicate the device supports TTS function and its corresponding configuration. Complex types TTSCapabilities includes MaxContentLength, TTSLanguage and TTSVoiceType.
  4. Added simpleType TTSLanguage and TTSVoiceType.

media2.xml and documentation

  1. Added detailed descriptions for AddTTSAudioClip operations, explaining their purpose, parameters, and responses.
  2. Updated audio clip Capabilities with TTSCapabilities.
    ONVIF-Media2-Service-Spec-TTS update.docx

1. Added AddTTSAudioClip request and AddTTSAudioClip response for sending a text and its TTS configuration to the device(1621-1652)(2036-2041)(2418-2422)(2935-2943).
2. Added complex types "TTS Audio" (1465-1485)for TTSConfiguration to support TTS function. It includes parameters Content, Language, VoiceType.
3. updated AudioClipCapabilities with TTSCapabilities(177-181), and added complex types for TTSCapabilities(201-220)to indicate the device supports TTS function and its corresponding configuration. 
complex types TTSCapabilities includes MaxContentLength, TTSLanguage and TTSVoiceType.
4. Added simpleType TTSLanguage(220-231) and TTSVoiceType(232-238).
1. Added detailed descriptions for AddTTSAudioClip operations, explaining their purpose, parameters, and responses.(2359-2416)
2. updated audio clip Capabilities with TTSCapabilities.(2698-2700)
update code line information for TTS function
correct some editorial errors
Updated the description of the AddTTSAudioClip operation to clarify the parameters and response. Updated the description of TTScapabilities.
@Peggy0422 Peggy0422 closed this Dec 1, 2025
@Peggy0422 Peggy0422 reopened this Dec 1, 2025
@Peggy0422 Peggy0422 merged commit 823e174 into onvif:video/TTS-audio-clip Dec 1, 2025
@ocampana-videotec
Copy link
Copy Markdown
Collaborator

@Peggy0422 shoudl we close this and just refer to #694 ?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants