Agent speech output audio is interpreted as user speech when AudioSession configureAudio Output: 'speaker'