In the Conversations API, messages can contain input audio, but this definition is completely missing from the spec.