Fix ST1 Readme. Multiple fixes to examples.

jmohdyusof · jmohdyusof · commit b9b3870e3e4f · 2022-05-25T17:28:08.000-06:00
diff --git a/Pilot1/ST1/README.md b/Pilot1/ST1/README.md
@@ -1,9 +1,9 @@
 # Simple transformers for classification and regression using SMILE string input
 
 ## Introduction
-The ST1 benchmark represent two versions of a simple transformer, one that can perform regression and the other classification. We chose the transformer architecture to see if we could train directly on SMILE strings. This benchmark brings novel capability to the suite of Pilot1 benchmarks in two ways. First, the featureization of a small molecule is simple its SMILE string. The secone novel aspect to the set of Pilot1 benchmarks is that the model is based on the Transformer architecture, albeit this benchmark is a simpler version of the large Transformer models that train on billions and greater parameters.
+The ST1 benchmark represent two versions of a simple transformer, one that can perform regression and the other classification. We chose the transformer architecture to see if we could train directly on SMILE strings. This benchmark brings novel capability to the suite of Pilot1 benchmarks in two ways. First, the featureization of a small molecule is simply its SMILE string. The second novel aspect to the set of Pilot1 benchmarks is that the model is based on the Transformer architecture, albeit this benchmark is a simpler version of the large Transformer models that train on billions and greater parameters.
 
-Both the original code and the CANDLE versions are available. The original examples are retained and can be run as noted below. The CANDLE versions make use of the common network design in smiles_transformer.py, and implement the models in `sct_baseline2_keras.py` and `srt_baseline_keras2.py`, for classification and regression, respectively. 
+Both the original code and the CANDLE versions are available. The original examples are retained and can be run as noted below. The CANDLE versions make use of the common network design in `smiles_transformer.py`, and implement the models in `sct_baseline_keras2.py` and `srt_baseline_keras2.py`, for classification and regression, respectively. 
 
 The example classification problem takes as input SMILE strings and trains a model to predict whether or not a compound is 'drug-like' based on Lipinski criteria. The example regression problem takes as input SMILE strings and trains a model to predict the molecular weight. Data are freely downloadable and automatically downloaded by the CANDLE versions.
 
@@ -12,8 +12,10 @@ For the CANDLE versions, all the relevant arguments are contained in the respect
 class_default_model.txt
 python sct_baseline_keras2.py
 
+```
 and
 
+```
 regress_default_model.txt
 python srt_baseline_keras2.py
 ```
@@ -23,12 +25,16 @@ The original code demonstrating a simple transformer regressor and a simple tran
 ```
 smiles_regress_transformer.py
 
+```
 and
 
+```
 smiles_class_transformer.py
 ```
 
-The example data sets are the same as for the CANDLE versions, and allow one to predict whether a small molecule is "drug-like" based on Lipinski criteria (classification problem), or predict the molecular weight (regression) from a SMILE string as input. The example data sets are downloadable using the information in the regress_default_model.txt or class_default_model.txt files. These data files must be downloaded manually and specified on the command line for execution.
+The example data sets are the same as for the CANDLE versions, and allow one to predict whether a small molecule is "drug-like" based on Lipinski criteria (classification problem), or predict the molecular weight (regression) from a SMILE string as input.
+The example data sets are downloadable using the information in the `regress_default_model.txt` or `class_default_model.txt` files.
+These data files must be downloaded manually and specified on the command line for execution.
 
 ```
 # for regression
@@ -45,8 +51,10 @@ To run the models
 ```
 CUDA_VISIBLE_DEVICES=1 python smiles_class_transformer.py --in_train chm.lipinski.trn.csv --in_vali chm.lipinski.val.csv --ep 25
 
+```
 or
 
+```
 CUDA_VISIBLE_DEVICES=0 python smiles_regress_transformer.py --in_train chm.weight.trn.csv --in_vali chm.weight.val.csv --ep 25
 ```
 The model with the best validation loss is saved in the .h5 dumps. Log files contain the trace. Regression output should look something like this.
diff --git a/examples/IGTD/Scripts/Examples_Of_Table_To_Image_Conversion.py b/examples/IGTD/Scripts/Examples_Of_Table_To_Image_Conversion.py
@@ -7,7 +7,7 @@
 num_col = 30    # Number of pixel columns in image representation
 num = num_row * num_col  # Number of features to be included for analysis, which is also the total number of pixels in image representation
 save_image_size = 3  # Size of pictures (in inches) saved during the execution of IGTD algorithm.
-max_step = 10000    # The maximum number of iterations to run the IGTD algorithm, if it does not converge.
+max_step = 1000    # The maximum number of iterations to run the IGTD algorithm, if it does not converge.
 val_step = 300  # The number of iterations for determining algorithm convergence. If the error reduction rate is smaller than a pre-set threshold for val_step itertions, the algorithm converges.
 
 # Import the example data and linearly scale each feature so that its minimum and maximum values are 0 and 1, respectively.
diff --git a/examples/IGTD/Scripts/Prediction_Modeling_Functions.py b/examples/IGTD/Scripts/Prediction_Modeling_Functions.py
@@ -1,16 +1,16 @@
-from keras import backend
-from keras import optimizers
-from keras.models import Model, load_model
-from keras.layers import Input, Dense, Dropout, concatenate, Conv2D, BatchNormalization, ReLU, MaxPooling2D, \
+from tensorflow.keras import backend
+from tensorflow.keras import optimizers
+from tensorflow.keras.models import Model, load_model
+from tensorflow.keras.layers import Input, Dense, Dropout, concatenate, Conv2D, BatchNormalization, ReLU, MaxPooling2D, \
     Flatten, AlphaDropout
-from keras.callbacks import ModelCheckpoint, CSVLogger, ReduceLROnPlateau, EarlyStopping
+from tensorflow.keras.callbacks import ModelCheckpoint, CSVLogger, ReduceLROnPlateau, EarlyStopping
 from scipy import stats
 from sklearn.metrics import r2_score, mean_squared_error, mean_absolute_error, roc_auc_score, accuracy_score, \
     matthews_corrcoef
 
 import configparser
 import numpy as np
-import keras
+import tensorflow.keras as keras
 import os
 import pandas as pd
 import shutil
diff --git a/examples/chemrep/convert_smiles.py b/examples/chemrep/convert_smiles.py
@@ -1,12 +1,9 @@
 from __future__ import print_function
 
 import os
-import sys
 import logging
 
 file_path = os.path.dirname(os.path.realpath(__file__))
-lib_path2 = os.path.abspath(os.path.join(file_path, '..', '..', 'common'))
-sys.path.append(lib_path2)
 
 import candle
 
diff --git a/examples/darts/advanced/example_setup.py b/examples/darts/advanced/example_setup.py
@@ -3,8 +3,8 @@
 
 
 file_path = os.path.dirname(os.path.realpath(__file__))
-lib_path2 = os.path.abspath(os.path.join(file_path, '..', '..', '..', 'common'))
-sys.path.append(lib_path2)
+#lib_path2 = os.path.abspath(os.path.join(file_path, '..', '..', '..', 'common'))
+#sys.path.append(lib_path2)
 
 
 import candle
diff --git a/examples/darts/uno/example_setup.py b/examples/darts/uno/example_setup.py
@@ -3,8 +3,8 @@
 
 
 file_path = os.path.dirname(os.path.realpath(__file__))
-lib_path2 = os.path.abspath(os.path.join(file_path, '..', '..', '..', 'common'))
-sys.path.append(lib_path2)
+#lib_path2 = os.path.abspath(os.path.join(file_path, '..', '..', '..', 'common'))
+#sys.path.append(lib_path2)
 
 
 import candle
diff --git a/examples/rnagen/rnagen.py b/examples/rnagen/rnagen.py
@@ -148,7 +148,7 @@ def with_prefix(x):
         df1 = df_sample_source.merge(df_source, on='Source', how='left').drop('Source', axis=1)
         logger.info('Embedding RNAseq data source into features: %d additional columns', df1.shape[1] - 1)
 
-    df2 = df.drop('Sample', 1)
+    df2 = df.drop('Sample', axis=1)
     if add_prefix:
         df2 = df2.add_prefix('rnaseq.')
 
diff --git a/examples/rnagen/rnagen_baseline_keras2.py b/examples/rnagen/rnagen_baseline_keras2.py
@@ -183,7 +183,7 @@ def with_prefix(x):
         df1 = df_sample_source.merge(df_source, on='Source', how='left').drop('Source', axis=1)
         logger.info('Embedding RNAseq data source into features: %d additional columns', df1.shape[1] - 1)
 
-    df2 = df.drop('Sample', 1)
+    df2 = df.drop('Sample', axis=1)
     if add_prefix:
         df2 = df2.add_prefix('rnaseq.')
 
diff --git a/examples/xform-smiles/regress_default_model.txt b/examples/xform-smiles/regress_default_model.txt
@@ -16,3 +16,5 @@ vocab_size = 40000
 epochs = 400
 batch_size = 32
 loss = 'mean_squared_error'
+optimizer = 'adam'
+learning_rate = 0.00001
diff --git a/examples/xform-smiles/sct_baseline_keras.py b/examples/xform-smiles/sct_baseline_keras.py
@@ -1,7 +1,6 @@
 # Setup
 
 import os
-import sys
 # import gzip
 
 # import math
@@ -15,8 +14,6 @@
 from tensorflow.keras.callbacks import ModelCheckpoint, CSVLogger, ReduceLROnPlateau, EarlyStopping
 
 file_path = os.path.dirname(os.path.realpath(__file__))
-lib_path = os.path.abspath(os.path.join(file_path, '..', '..', 'common'))
-sys.path.append(lib_path)
 
 import candle
 import smiles_transformer as st
@@ -63,6 +60,8 @@ def run(params):
 
     model.load_weights('smile_class.autosave.model.h5')
 
+    return history
+
 
 def main():
     params = initialize_parameters()
diff --git a/examples/xform-smiles/smiles_class_transformer.py b/examples/xform-smiles/smiles_class_transformer.py
@@ -3,7 +3,6 @@
 import pandas as pd
 # import numpy as np
 import os
-import sys
 # import gzip
 import argparse
 
@@ -24,8 +23,6 @@
 from tensorflow.keras.preprocessing import text
 
 file_path = os.path.dirname(os.path.realpath(__file__))
-lib_path = os.path.abspath(os.path.join(file_path, '..', '..', 'common'))
-sys.path.append(lib_path)
 
 psr = argparse.ArgumentParser(description='input csv file')
 psr.add_argument('--in_train', default='in_train')
diff --git a/examples/xform-smiles/smiles_regress_transformer.py b/examples/xform-smiles/smiles_regress_transformer.py
@@ -3,7 +3,6 @@
 import pandas as pd
 # import numpy as np
 import os
-import sys
 # import gzip
 import argparse
 
@@ -26,8 +25,6 @@
 
 
 file_path = os.path.dirname(os.path.realpath(__file__))
-lib_path = os.path.abspath(os.path.join(file_path, '..', '..', 'common'))
-sys.path.append(lib_path)
 
 psr = argparse.ArgumentParser(description='input csv file')
 psr.add_argument('--in_train', default='in_train')
diff --git a/examples/xform-smiles/smiles_transformer.py b/examples/xform-smiles/smiles_transformer.py
@@ -1,7 +1,6 @@
 from __future__ import print_function
 
 import os
-import sys
 
 import pandas as pd
 
@@ -13,8 +12,6 @@
 from tensorflow.keras import backend as K
 
 file_path = os.path.dirname(os.path.realpath(__file__))
-lib_path2 = os.path.abspath(os.path.join(file_path, '..', '..', 'common'))
-sys.path.append(lib_path2)
 
 import candle
 
diff --git a/examples/xform-smiles/srt_baseline_keras.py b/examples/xform-smiles/srt_baseline_keras.py
@@ -1,7 +1,6 @@
 # Setup
 
 import os
-import sys
 # import gzip
 
 # import math
@@ -11,12 +10,9 @@
 # import matplotlib.pyplot as plt
 
 from tensorflow.keras import backend as K
-import tensorflow.keras.optimizers as optimizers
 from tensorflow.keras.callbacks import ModelCheckpoint, CSVLogger, ReduceLROnPlateau, EarlyStopping
 
 file_path = os.path.dirname(os.path.realpath(__file__))
-lib_path = os.path.abspath(os.path.join(file_path, '..', '..', 'common'))
-sys.path.append(lib_path)
 
 import candle
 import smiles_transformer as st
@@ -52,18 +48,22 @@ def run(params):
 
     model = st.transformer_model(params)
 
-    optimizer = optimizers.deserialize({'class_name': params['optimizer'], 'config': {}})
+    kerasDefaults = candle.keras_default_config()
+
+    optimizer = candle.build_optimizer(params['optimizer'], params['learning_rate'], kerasDefaults)
+
+# optimizer = optimizers.deserialize({'class_name': params['optimizer'], 'config': {}})
 
     # I don't know why we set base_lr. It doesn't appear to be used.
     # if 'base_lr' in params and params['base_lr'] > 0:
     #     base_lr = params['base_lr']
     # else:
     #     base_lr = K.get_value(optimizer.lr)
 
-    if 'learning_rate' in params and params['learning_rate'] > 0:
-        K.set_value(optimizer.lr, params['learning_rate'])
-        print('Done setting optimizer {} learning rate to {}'.format(
-            params['optimizer'], params['learning_rate']))
+#     if 'learning_rate' in params and params['learning_rate'] > 0:
+#         K.set_value(optimizer.lr, params['learning_rate'])
+#         print('Done setting optimizer {} learning rate to {}'.format(
+#             params['optimizer'],params['learning_rate']))
 
     model.compile(loss='mean_squared_error',
                   optimizer=optimizer,
@@ -76,15 +76,17 @@ def run(params):
     reduce_lr = ReduceLROnPlateau(monitor='val_loss', factor=0.75, patience=20, verbose=1, mode='auto', epsilon=0.0001, cooldown=3, min_lr=0.000000001)
     early_stop = EarlyStopping(monitor='val_loss', patience=100, verbose=1, mode='auto')
 
-    model.fit(x_train, y_train,
-              batch_size=params['batch_size'],
-              epochs=params['epochs'],
-              verbose=1,
-              validation_data=(x_val, y_val),
-              callbacks=[checkpointer, csv_logger, reduce_lr, early_stop])
+    history = model.fit(x_train, y_train,
+                        batch_size=params['batch_size'],
+                        epochs=params['epochs'],
+                        verbose=1,
+                        validation_data=(x_val, y_val),
+                        callbacks=[checkpointer, csv_logger, reduce_lr, early_stop])
 
     model.load_weights('smile_regress.autosave.model.h5')
 
+    return history
+
 
 def main():
     params = initialize_parameters()