Skip to content

GriffinLim. RuntimeError: Given transposed=1, weight of size [1026, 1, 1024], expected input[1, 160, 181] to have 1026 channels, but got 160 channels instead #30

@psydok

Description

@psydok

Run "python inference.py"

2021-05-25 11:41:42.188662: I tensorflow/stream_executor/platform/default/dso_loader.cc:53] Successfully opened dynamic library libcudart.so.11.0
2021-05-25 11:41:44.038910: I tensorflow/stream_executor/platform/default/dso_loader.cc:53] Successfully opened dynamic library libcuda.so.1
2021-05-25 11:41:44.067997: I tensorflow/stream_executor/cuda/cuda_gpu_executor.cc:937] successful NUMA node read from SysFS had negative value (-1), but there must be at least one NUMA node, so returning NUMA node zero
2021-05-25 11:41:44.068584: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1733] Found device 0 with properties: 
pciBusID: 0000:00:04.0 name: Tesla T4 computeCapability: 7.5
coreClock: 1.59GHz coreCount: 40 deviceMemorySize: 14.75GiB deviceMemoryBandwidth: 298.08GiB/s
2021-05-25 11:41:44.068651: I tensorflow/stream_executor/platform/default/dso_loader.cc:53] Successfully opened dynamic library libcudart.so.11.0
2021-05-25 11:41:44.071265: I tensorflow/stream_executor/platform/default/dso_loader.cc:53] Successfully opened dynamic library libcublas.so.11
2021-05-25 11:41:44.071371: I tensorflow/stream_executor/platform/default/dso_loader.cc:53] Successfully opened dynamic library libcublasLt.so.11
2021-05-25 11:41:44.073321: I tensorflow/stream_executor/platform/default/dso_loader.cc:53] Successfully opened dynamic library libcufft.so.10
2021-05-25 11:41:44.073801: I tensorflow/stream_executor/platform/default/dso_loader.cc:53] Successfully opened dynamic library libcurand.so.10
2021-05-25 11:41:44.073918: W tensorflow/stream_executor/platform/default/dso_loader.cc:64] Could not load dynamic library 'libcusolver.so.11'; dlerror: libcusolver.so.11: cannot open shared object file: No such file or directory; LD_LIBRARY_PATH: /usr/lib64-nvidia
2021-05-25 11:41:44.074408: I tensorflow/stream_executor/platform/default/dso_loader.cc:53] Successfully opened dynamic library libcusparse.so.11
2021-05-25 11:41:44.074614: I tensorflow/stream_executor/platform/default/dso_loader.cc:53] Successfully opened dynamic library libcudnn.so.8
2021-05-25 11:41:44.074639: W tensorflow/core/common_runtime/gpu/gpu_device.cc:1766] Cannot dlopen some GPU libraries. Please make sure the missing libraries mentioned above are installed properly if you would like to use GPU. Follow the guide at https://www.tensorflow.org/install/gpu for how to download and setup the required libraries for your platform.
Skipping registering GPU devices...
2021-05-25 11:41:44.074929: I tensorflow/core/platform/cpu_feature_guard.cc:142] This TensorFlow binary is optimized with oneAPI Deep Neural Network Library (oneDNN) to use the following CPU instructions in performance-critical operations:  AVX2 FMA
To enable them in other operations, rebuild TensorFlow with the appropriate compiler flags.
2021-05-25 11:41:44.075066: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1258] Device interconnect StreamExecutor with strength 1 edge matrix:
2021-05-25 11:41:44.075084: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1264]      
Starting
Text :  С трево+жным чу+вством беру+сь я+ за+ перо+
Checkpoint :  checkpoints/sova_fix/sova_fix_fastspeech_7788502_61k_steps.pyt
2021-05-25 11:42:00.754573: I tensorflow/compiler/mlir/mlir_graph_optimization_pass.cc:176] None of the MLIR Optimization Passes are enabled (registered 2)
2021-05-25 11:42:00.754979: I tensorflow/core/platform/profile_utils/cpu_utils.cc:114] CPU Frequency: 2199995000 Hz
TTS synthesis
predicting
Traceback (most recent call last):
  File "inference.py", line 259, in <module>
    main(sys.argv[1:])
  File "inference.py", line 217, in main
    wav = griffin_lim(m, stft, 30)
  File "/content/drive/MyDrive/FastSpeech2-1/dataset/audio_processing.py", line 239, in griffin_lim
    signal = stft_fn.inverse(magnitudes, angles).squeeze(1)
  File "/content/drive/MyDrive/FastSpeech2-1/utils/stft.py", line 122, in inverse
    padding=0
RuntimeError: Given transposed=1, weight of size [1026, 1, 1024], expected input[1, 160, 181] to have 1026 channels, but got 160 channels instead

I want to synthesize speech using the griffin-lim algorithm, but an error comes out.
I have set in default.yaml "melgan_vocoder: True".

I don’t understand what’s wrong. What is the error?
How to fix?

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions