ECP-CANDLE
diff --git a/‎Pilot1/Attn/attn_bin_working_jan7_h5.py‎
Lines changed: 0 additions & 550 deletions b/‎Pilot1/Attn/attn_bin_working_jan7_h5.py‎
Lines changed: 0 additions & 550 deletions
diff --git a/‎Pilot1/Attn/attn_bin_working_jan7_h5.sh‎
Lines changed: 0 additions & 51 deletions b/‎Pilot1/Attn/attn_bin_working_jan7_h5.sh‎
Lines changed: 0 additions & 51 deletions
diff --git a/‎Pilot1/Attn/attn_bsub.sh‎
Lines changed: 0 additions & 57 deletions b/‎Pilot1/Attn/attn_bsub.sh‎
Lines changed: 0 additions & 57 deletions
diff --git a/‎Pilot1/Attn/cmd1.sh‎
Lines changed: 0 additions & 17 deletions b/‎Pilot1/Attn/cmd1.sh‎
Lines changed: 0 additions & 17 deletions
diff --git a/‎Pilot1/Attn/cmd2.sh‎
Lines changed: 0 additions & 5 deletions b/‎Pilot1/Attn/cmd2.sh‎
Lines changed: 0 additions & 5 deletions
diff --git a/‎examples/histogen/README.md‎
Lines changed: 21 additions & 0 deletions b/‎examples/histogen/README.md‎
Lines changed: 21 additions & 0 deletions
diff --git a/‎examples/image-vae/README.md‎
Lines changed: 15 additions & 0 deletions b/‎examples/image-vae/README.md‎
Lines changed: 15 additions & 0 deletions
diff --git a/‎examples/rnagen/README.md‎
Lines changed: 15 additions & 0 deletions b/‎examples/rnagen/README.md‎
Lines changed: 15 additions & 0 deletions
diff --git a/‎examples/rnngen/README.md‎
Lines changed: 17 additions & 3 deletions b/‎examples/rnngen/README.md‎
Lines changed: 17 additions & 3 deletions
@@ -2,6 +2,27 @@
 
 ## Usage
 
+The CANDLE-ized versions of the codes can simply be run without any command line arguments, with the default settings being read from the corresponding `default_model` file.
+When needed, the CANDLE versions also use the `fetch_file` methods, which store the data in the top-level `Data/Examples` directory.
+Any keywords in the `default_model` file can be overwritten with the appropriate command line argument.
+The orginal codes and workflow below are preserved for comparison.
+New package dependencies are now included in the top-level install instructions.
+
+# CANDLE workflow
+
+Sample images (the trained models will be downloaded automatically).
+```
+python sample_baseline_pytorch.py
+```
+Training pipeline
+```
+python train_vqvae_baseline_pytorch.py -e 1
+pythong extract_baseline_pytorch.py
+python train_pixelsnail_baseline_pytorch.py
+```
+
+# Original workflow
+
 Sample histology images from a trained histology image model.
 
 1. Download trained models into `checkpoint` folder.
 
@@ -1,3 +1,18 @@
+## Usage
+
+The CANDLE-ized versions of the codes can simply be run without any command line arguments, with the default settings being read from the corresponding `default_model` file.
+When needed, the CANDLE versions also use the `fetch_file` methods, which store the data in the top-level `Data/Examples` directory.
+Any keywords in the `default_model` file can be overwritten with the appropriate command line argument.
+The orginal codes and workflow below are preserved for comparison.
+New package dependencies are now included in the top-level install instructions.
+
+# CANDLE workflow
+
+```
+python image_vae_baseline_pytorch.py
+python sample_baseline_pytorch.py
+```
+
 # Image VAE
 
 2D-Images are a relatively unexplored representation for molecular learning tasks. We create a molecular generator and embedding based on 2D-depictions of molecules. We use a variational autoencoder (VAE) to encode 2D-images of molecules to a latent space of 512, and with a gaussian prior sample the space and decode directly to images. A modified ResNet is used to encode molecular depictions to a latent space. A decoder is created by performing the inverse operations of ResNet (i.e. run blocks in reverse order replacing convolutional layers with deconvolution layers (transpose convolution). One can embed molecules in this space by use of only the decoder, or by generating random gaussian noise one can decode a latent vector into a molecular image. In the latent space, generation can also be steered through interpolation or epsilon-sampling. VAEs are prone to mode collapse and exploding gradients. Mode collapse occurs when enforcing a normal prior on the latent space causes any learning to “collapse” and the model ceases to learn. Exploding gradients occurs when the gradients become so large that the optimization routine becomes unstable and again learning ceases to occur. To mode collapse, we use KL-divergence loss annealing. KL-divergence loss annealing slowly ramps up the weight of the normal prior in latent space as the model learns to reconstruct the encoded images better. This essentially enforces that the decoder and encoder learn at similar rates. The avoid exploding gradients, we use gradient clipping which limits the magnitude of any particular optimization step. This enforces a slow and gradual learning process.
 
@@ -1,3 +1,18 @@
+## Usage
+
+The CANDLE-ized versions of the codes can simply be run without any command line arguments, with the default settings being read from the corresponding `default_model` file.
+When needed, the CANDLE versions also use the `fetch_file` methods, which store the data in the top-level `Data/Examples` directory.
+Any keywords in the `default_model` file can be overwritten with the appropriate command line argument.
+The orginal codes and workflow below are preserved for comparison.
+New package dependencies are now included in the top-level install instructions.
+
+# CANDLE workflow
+
+```
+python rnagen_baseline_keras2.py
+python rnagen_baseline_keras2.py --plot
+```
+
 # Improving cancer type classifier with synthetic data
 
 We demonstrate the value of generator models in boosting the performance of predictive models.
 
@@ -1,8 +1,22 @@
 # RNN Generator
 Based 99.98\% on the model from [1]
 
+## Usage
 
-# How to use Molecular Generator Code
+The CANDLE-ized versions of the codes can simply be run without any command line arguments, with the default settings being read from the corresponding `default_model` file.
+When needed, the CANDLE versions also use the `fetch_file` methods, which store the data in the top-level `Data/Examples` directory.
+Any keywords in the `default_model` file can be overwritten with the appropriate command line argument.
+The orginal codes and workflow below are preserved for comparison.
+
+# CANDLE workflow
+
+This will automatically download the models needed and run with the `autosave.model.pt` set in the `default_model` file.
+
+```
+python infer_rnngen_baseline_pytorch.py
+```
+
+# Original workflow
 
 ## Python dependencies
 
@@ -38,6 +52,6 @@ python infer.py -i mosesrun/ --logdir pilot1/ -o p1_poor.txt -n 10000 -vr --mode
 
 
 # Refereces:
-1. Gupta, A., Müller, A., Huisman, B., Fuchs, J., Schneider, P., Schneider, G. (2018). Generative Recurrent Networks for De Novo Drug Design Molecular Informatics  37(1-2)https://dx.doi.org/10.1002/minf.201700111
-2. Polykovskiy, D., Zhebrak, A., Sanchez-Lengeling, B., Golovanov, S., Tatanov, O., Belyaev, S., Kurbanov, R., Artamonov, A., Aladinskiy, V., Veselov, M., Kadurin, A., Nikolenko, S., Aspuru-Guzik, A., Zhavoronkov, A. (2018). Molecular Sets (MOSES): A Benchmarking Platform for Molecular Generation Modelshttps://arxiv.org/abs/1811.12823
+1. Gupta, A., Müller, A., Huisman, B., Fuchs, J., Schneider, P., Schneider, G. (2018). Generative Recurrent Networks for De Novo Drug Design Molecular Informatics  37(1-2) https://dx.doi.org/10.1002/minf.201700111
+2. Polykovskiy, D., Zhebrak, A., Sanchez-Lengeling, B., Golovanov, S., Tatanov, O., Belyaev, S., Kurbanov, R., Artamonov, A., Aladinskiy, V., Veselov, M., Kadurin, A., Nikolenko, S., Aspuru-Guzik, A., Zhavoronkov, A. (2018). Molecular Sets (MOSES): A Benchmarking Platform for Molecular Generation Models https://arxiv.org/abs/1811.12823