Monitoring metric by Yeongtae · Pull Request #284 · NVIDIA/tacotron2

Yeongtae · 2019-11-18T04:51:49Z

Summary
You can see monitoring metrics for the Seq2Seq TTS model on the Tensorboard.

Monitoring metric

Attention Alignment Diagonality(AAD):
AAD is defined as the length of attention alignment path divided by the length of the diagonal path.
The attention alignment path is the line connecting maximum values for each time-step in the attention weight matrix.
The meaning of this metric is the degree of learning of the relationship between the encoder and the decoder.
Average max attention weight:
The meaning of this metric is the degree of learning of the relationship between the encoder and the decoder.
log Mel Cepstrum Distortion (MCD)
The acoustic similarity between the synthesized audio and the target audio.
f0 RMSE
The similarity of fundamental frequency between the synthesized audio and the target audio.

Future work

The calculation method for f0 will be changed.

Example

)

merge latest works

ㄴ attention alignment diagonality ㄴ average max attention weight ㄴ f0 RMSE ㄴ MCD

ㄴ re-implementing and solving errors.

ㄴ Audio processing: trimming silence(if it < 23 db), preemphasis, amplitude normalization ㄴ Remove short clip(if it < 14847 samples, It maybe percentile 0.10 my own dataset)

ㄴ Replacing MCD(metric name) to log_MCD

# Conflicts: # train.py

ㄴ debugging

Yeongtae and others added 8 commits August 18, 2019 23:42

Merge pull request #5 from NVIDIA/master

11191fb

merge latest works

Removing numpy==1.13.0 in requirements.txt to solve some error.

6adaf9f

Applying monitoring metrics to Tensorboard.

d2a3fe7

ㄴ attention alignment diagonality ㄴ average max attention weight ㄴ f0 RMSE ㄴ MCD

Applying monitoring metrics to Tensorboard.

ce4ef6c

ㄴ re-implementing and solving errors.

Dataset preprocessing

26954c8

ㄴ Audio processing: trimming silence(if it < 23 db), preemphasis, amplitude normalization ㄴ Remove short clip(if it < 14847 samples, It maybe percentile 0.10 my own dataset)

Being fixed errors of metric name on Tensorboard,

b590337

ㄴ Replacing MCD(metric name) to log_MCD

Moving the order in which 'pre_emphasis' is applied after 'trimming'.

cdf64f8

Merge remote-tracking branch 'origin/pr/6' into monitoring_metric

9a8ad53

# Conflicts: # train.py

Yeongtae mentioned this pull request Dec 17, 2019

Evaluating checkpoints #290

Closed

Yeongtae changed the title ~~Monitoring metric~~ Monitering metric Dec 23, 2019

Yeongtae changed the title ~~Monitering metric~~ Monitoring metric Dec 23, 2019

Yeongtae force-pushed the monitoring_metric branch from 7b85ecf to 0f6e1b4 Compare January 16, 2020 03:28

Dataset preprocessing

8fad645

ㄴ debugging

Yeongtae force-pushed the monitoring_metric branch from 0f6e1b4 to 8fad645 Compare January 16, 2020 06:18

CookiePPP mentioned this pull request Apr 18, 2020

reduction window is vital for the model to pick up alignment. #280

Open

Merge branch 'master' into monitoring_metric

9ccd41b

nullptr-0 mentioned this pull request Sep 25, 2024

monitoring metrics nullptr-0/tacotron2#2

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Monitoring metric#284

Monitoring metric#284
Yeongtae wants to merge 10 commits intoNVIDIA:masterfrom
Yeongtae:monitoring_metric

Yeongtae commented Nov 18, 2019 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

Yeongtae commented Nov 18, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Yeongtae commented Nov 18, 2019 •

edited

Loading