Skip to content

GAN model cannot converge. #20

@Bleachzer

Description

@Bleachzer

I found that the GAN model cannot converge during training. I followed the data format in tests/fixtures folder to create the train_setences.tsv file and train_dialog.tsv file from the de-duplicated version of DailyDialog dataset which is proposed by Bahuleyan et al. in their GitHub https://github.com/HareeshBahuleyan/probabilistic_nlg/tree/master/dialog/data/DailyDial/de_duplicated. I did not change any code and the VAE model is well trained whereas the GAN model will early stop after training 5 epochs. The training loss for GAN model did not decrease and the GAN model can only produce some meaningless sentences.

Are there any suggestions for solving this?

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions