cs230-robot-apocalypse

Preprocessing:

Normalize punctuation
Tokenize (word and character level)
Learn and apply Byte Pair Encoding (BPE)
Generate python code corpus
Generate annotation corpus
Shuffle data
Split data to train/dev/test sets

Running:

A shell script for hyperparameter tuning was generated using generate_sweep_commands.py. The data used for the coarse sweep can be found in split_data.

To train manually:

python train.py   --cell_type 'lstm' \ 
                    --attention_type 'luong' \
                    --hidden_units 1024 \
                    --depth 2 \
                    --embedding_size 500 \
                    --num_encoder_symbols 30000 \
                    --num_decoder_symbols 30000 ...

To decode manually:

python decode.py  --beam_width 5 \
                    --decode_batch_size 30 \
                    --model_path $PATH_TO_A_MODEL_CHECKPOINT (e.g. model/translate.ckpt-100) \
                    --max_decode_step 300 \
                    --write_n_best False
                    --decode_input $PATH_TO_DECODE_INPUT
                    --decode_output $PATH_TO_DECODE_OUTPUT

Seq2seq model modified from JayParks (https://github.com/JayParks/tf-seq2seq)

BLEU score calculation:

perl multi-bleu-detok.perl data/dev.code < output$MODEL_NUM_dev/output_train_$BEAM_WIDTH

Name		Name	Last commit message	Last commit date
Latest commit History 34 Commits
analysis_scripts		analysis_scripts
attention_vis_scripts		attention_vis_scripts
preprocess_scripts		preprocess_scripts
preprocess_study		preprocess_study
split_data		split_data
split_data_final		split_data_final
tf-seq2seq		tf-seq2seq
.gitignore		.gitignore
README.md		README.md
build_char_level_dictionary.py		build_char_level_dictionary.py
format_bleu.py		format_bleu.py
generate_bleu_commands.py		generate_bleu_commands.py
generate_bleu_commands_char_level.py		generate_bleu_commands_char_level.py
generate_decode_commands.py		generate_decode_commands.py
generate_sweep_commands.py		generate_sweep_commands.py
multi-bleu-detok.perl		multi-bleu-detok.perl
multi-bleu.perl		multi-bleu.perl
print_hyperparam_summary.py		print_hyperparam_summary.py
seq2seq_model_char_level.py		seq2seq_model_char_level.py
test_embed.sh		test_embed.sh
train3.py		train3.py
train_character_level.py		train_character_level.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

cs230-robot-apocalypse

Preprocessing:

Running:

About

Uh oh!

Releases

Packages

Contributors 3

Uh oh!

Languages

janettec/cs230-robot-apocalypse

Folders and files

Latest commit

History

Repository files navigation

cs230-robot-apocalypse

Preprocessing:

Running:

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Uh oh!

Languages

Packages