Skip to content
Discussion options

You must be logged in to vote

Looks like you're missing a / 32768.0 and make sure audio has only 1 dimension.

return np.frombuffer(out, np.int16).flatten().astype(np.float32) / 32768.0

Note that the results are expected to be disjointed and possibly have missing words because of the 5 second chunking.

Replies: 1 comment 4 replies

Comment options

You must be logged in to vote
4 replies
@Ca-ressemble-a-du-fake
Comment options

@Ca-ressemble-a-du-fake
Comment options

@elpidiovaldez
Comment options

@MarkoMilos
Comment options

Answer selected by elpidiovaldez
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
4 participants