Good WER using attention-based encoder-decoder on WSJ: https://arxiv.org/abs/1811.02770?fbclid=IwAR12930xgp1Z9Fw1eJ72JkDWfGPevNZ7exP77z-Y3hkn4P1gzH5oWytGeeU