persona-chat/transformer at master · machinereading/persona-chat

Name	Name	Last commit message	Last commit date
parent directory ..
docs	docs
environment	environment
model	model
.dockerignore	.dockerignore
.gitignore	.gitignore
LICENSE	LICENSE
README.md	README.md
Sample Responses.ipynb	Sample Responses.ipynb
agent.py	agent.py
build.py	build.py
config.py	config.py
eval_f1.py	eval_f1.py
eval_hits.py	eval_hits.py
get_trainer.py	get_trainer.py
interactive.py	interactive.py
requirements.txt	requirements.txt
train.py	train.py
wild.py	wild.py

Name

Last commit message

Last commit date

Sample Responses.ipynb

This is a clone of https://github.com/atselousov/transformer_chatbot

This project use Python3! Before use scripts, you should first do following things:

Download BPE vocabulary files and a checkpoint file

bash envirionment/prepare_environment.sh

Download datasets

python build.py

Reinstall spacy and Download spacy-en

pip uninstall spacy
pip install spacy=2.0.0
wget https://github.com/explosion/spacy-models/releases/download/en_core_web_sm-2.0.0/en_core_web_sm-2.0.0.tar.gz
pip install --user en_core_web_sm-2.0.0.tar.gz
python -m spacy link en_core_web_sm en

Install dependencies

pip install -r requirements.txt

Download pretrained OpenAI-GPT weights

git clone https://github.com/openai/finetune-transformer-lm
mv finetune-transformer-lm/models/* parameters/*
mv parameters/params_shapes.json parameters/parameters_shapes.json
rm -rf finetune-transformer-lm

Train Commands

python train.py

To train from scratch (this initializes weights with pretrained OpenAI GPT), set load_last = False of trainer_config in config.py

To train from some checkpoint, set load_last = True and trained_checkpoint_path = <the path> of trainer_config in config.py

The trainer loads datasets from paths in train_datasets (valid_datasets) of trainer_config in config.py

We used following hyper-parameter settings in our re-implementation. (Modify this in config.py)

n_epochs: 80
batch_size: 160
batch_split: 64
lr: 6.25e-5
lr_warmup: 16000
lm_weight: 0.5
risk_weight: 0
n_jobs: 4
label_smoothing: 0.1
clip_grad: None
test_period: 1

Evaluation Commands

python eval_f1.py
python eval_hits.py

To evaluate learned model, modify checkpoint_path of model_config in config.py to the saved checkpoint.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

This is a clone of https://github.com/atselousov/transformer_chatbot

Train Commands

Evaluation Commands

FilesExpand file tree

transformer

Directory actions

More options

Directory actions

More options

Latest commit

History

transformer

Folders and files

parent directory

README.md

This is a clone of https://github.com/atselousov/transformer_chatbot

Train Commands

Evaluation Commands