You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: README.md
+5-5Lines changed: 5 additions & 5 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -257,27 +257,27 @@ The default `VwPolicy` is initialized with some default arguments. The default e
257
257
258
258
The `VwPolicy`'s data files can be stored and examined or used to do [off policy evaluation](https://vowpalwabbit.org/docs/vowpal_wabbit/python/latest/tutorials/off_policy_evaluation.html) for hyper parameter tuning.
259
259
260
-
The way to do this is to set a log file path to `vw_logs` on chain creation:
260
+
The way to do this is to set a log file path to `rl_logs` on chain creation:
261
261
262
-
`picker = learn_to_pick.PickBest.create(vw_logs=<path to log FILE>, [...])`
262
+
`picker = learn_to_pick.PickBest.create(rl_logs=<path to log FILE>, [...])`
263
263
264
264
### Advanced featurization options
265
265
266
266
#### auto_embed
267
267
268
268
By default the input that is wrapped (`ToSelectFrom`, `BasedOn`) is not tampered with. This might not be sufficient featurization, so based on how complex the scenario is you can set auto-embeddings to ON
This will produce more complex embeddings and featurizations of the inputs, likely accelerating RL learning, albeit at the cost of increased runtime.
273
273
274
-
By default, [sbert.net's sentence_transformers's ](https://www.sbert.net/docs/pretrained_models.html#model-overview)`all-mpnet-base-v2` model will be used for these embeddings but you can set a different embeddings model by initializing feature_embedder with a different model. You could also set an entirely different embeddings encoding object, as long as it has an `encode()` function that returns a list of the encodings.
274
+
By default, [sbert.net's sentence_transformers's ](https://www.sbert.net/docs/pretrained_models.html#model-overview)`all-mpnet-base-v2` model will be used for these embeddings but you can set a different embeddings model by initializing featurizer with a different model. You could also set an entirely different embeddings encoding object, as long as it has an `encode()` function that returns a list of the encodings.
275
275
276
276
```python
277
277
from sentence_transformers import SentenceTransformer
@@ -325,12 +325,12 @@ class RLLoop(Generic[TEvent]):
325
325
- metrics (Optional[Union[MetricsTrackerRollingWindow, MetricsTrackerAverage]]): Tracker for metrics, can be set to None.
326
326
327
327
Initialization Attributes:
328
-
- feature_embedder (Embedder): Embedder used for the `BasedOn` and `ToSelectFrom` inputs.
328
+
- featurizer (Featurizer): Featurizer used for the `BasedOn` and `ToSelectFrom` inputs.
329
329
- model_save_dir (str, optional): Directory for saving the VW model. Default is the current directory.
330
330
- reset_model (bool): If set to True, the model starts training from scratch. Default is False.
331
331
- vw_cmd (List[str], optional): Command line arguments for the VW model.
332
332
- policy (Type[VwPolicy]): Policy used by the chain.
333
-
- vw_logs (Optional[Union[str, os.PathLike]]): Path for the VW logs.
333
+
- rl_logs (Optional[Union[str, os.PathLike]]): Path for the VW logs.
334
334
- metrics_step (int): Step for the metrics tracker. Default is -1. If set without metrics_window_size, average metrics will be tracked, otherwise rolling window metrics will be tracked.
335
335
- metrics_window_size (int): Window size for the metrics tracker. Default is -1. If set, rolling window metrics will be tracked.
Text Embedder class that embeds the `BasedOn` and `ToSelectFrom` inputs into a format that can be used by the learning policy
47
+
Text Featurizer class that embeds the `BasedOn` and `ToSelectFrom` inputs into a format that can be used by the learning policy
48
48
49
49
Attributes:
50
50
model name (Any, optional): The type of embeddings to be used for feature representation. Defaults to BERT SentenceTransformer.
@@ -259,7 +259,7 @@ class PickBest(base.RLLoop[PickBestEvent]):
259
259
RLLoop
260
260
261
261
Attributes:
262
-
feature_embedder (PickBestFeatureEmbedder, optional): Is an advanced attribute. Responsible for embedding the `BasedOn` and `ToSelectFrom` inputs. If omitted, a default embedder is utilized.
262
+
featurizer (PickBestFeaturizer, optional): Is an advanced attribute. Responsible for embedding the `BasedOn` and `ToSelectFrom` inputs. If omitted, a default embedder is utilized.
0 commit comments