Last week I tried to host flask + tacotron2 + waveglow. It was not possible to simultaneously call decoder.inference due to the https://github.com/NVIDIA/tacotron2/blob/ece7d3f5681bf8fe46a6c3e5293bf8c5aab6cbce/model.py#L437 overwrites previous request's decoder_sate. I've just sent PR. https://github.com/NVIDIA/tacotron2/pull/176