Skip to content

Commit 16e64fa

Browse files
committed
Fix links
1 parent ec87120 commit 16e64fa

File tree

3 files changed

+10
-10
lines changed

3 files changed

+10
-10
lines changed

articles/machine-learning/how-to-auto-train-nlp-models.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -232,7 +232,7 @@ Named entity recognition (NER)|`"eng"` <br> `"deu"` <br> `"mul"`| English&nbsp
232232

233233
[!INCLUDE [cli v2](includes/machine-learning-cli-v2.md)]
234234

235-
You can specify your dataset language in the featurization section of your configuration YAML file. BERT is also used in the featurization process of automated ML experiment training, learn more about [BERT integration and featurization in automated ML (SDK v1)](./v1/how-to-configure-auto-features.md#bert-integration-in-automated-ml).
235+
You can specify your dataset language in the featurization section of your configuration YAML file. BERT is also used in the featurization process of automated ML experiment training, learn more about [BERT integration and featurization in AutoML (SDK v1)](./v1/how-to-configure-auto-features.md#bert-integration-in-automl).
236236

237237
```azurecli
238238
featurization:
@@ -243,7 +243,7 @@ featurization:
243243

244244
[!INCLUDE [sdk v2](includes/machine-learning-sdk-v2.md)]
245245

246-
You can specify your dataset language with the `set_featurization()` method. BERT is also used in the featurization process of automated ML experiment training, learn more about [BERT integration and featurization in automated ML (SDK v1)](./v1/how-to-configure-auto-features.md?view=azureml-api-1&preserve-view=true#bert-integration-in-automated-ml).
246+
You can specify your dataset language with the `set_featurization()` method. BERT is also used in the featurization process of automated ML experiment training, learn more about [BERT integration and featurization in AutoML (SDK v1)](./v1/how-to-configure-auto-features.md?view=azureml-api-1&preserve-view=true#bert-integration-in-automl).
247247

248248
```python
249249
text_classification_job.set_featurization(dataset_language='eng')

articles/machine-learning/v1/how-to-auto-train-nlp-models.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -189,7 +189,7 @@ Multi-class text classification| `'eng'` <br> `'deu'` <br> `'mul'`| English
189189
Named entity recognition (NER)| `'eng'` <br> `'deu'` <br> `'mul'`| English&nbsp;BERT&nbsp;[cased](https://huggingface.co/bert-base-cased) <br> [German BERT](https://huggingface.co/bert-base-german-cased)<br> [Multilingual BERT](https://huggingface.co/bert-base-multilingual-cased) <br><br>For all other languages, automated ML applies multilingual BERT
190190

191191

192-
You can specify your dataset language in your `FeaturizationConfig`. BERT is also used in the featurization process of automated ML experiment training, learn more about [BERT integration and featurization in automated ML](how-to-configure-auto-features.md#bert-integration-in-automated-ml).
192+
You can specify your dataset language in your `FeaturizationConfig`. BERT is also used in the featurization process of automated ML experiment training, learn more about [BERT integration and featurization in AutoML](how-to-configure-auto-features.md#bert-integration-in-automl).
193193

194194
```python
195195
from azureml.automl.core.featurization import FeaturizationConfig

articles/machine-learning/v1/how-to-configure-auto-features.md

Lines changed: 7 additions & 7 deletions
Original file line numberDiff line numberDiff line change
@@ -143,13 +143,13 @@ Supported customizations include:
143143
|--|--|
144144
|**Column purpose update**|Override the autodetected feature type for the specified column.|
145145
|**Transformer parameter update** |Update the parameters for the specified transformer. Currently supports *Imputer* (mean, most frequent, and median) and *HashOneHotEncoder*.|
146-
|**Drop columns** |Specifies columns to drop from being featurized.|
146+
|**Drop columns*** |Specifies columns to drop from being featurized.|
147147
|**Block transformers**| Specifies block transformers to be used in the featurization process.|
148148

149149
>[!NOTE]
150-
> The **drop columns** functionality is deprecated as of SDK version 1.19. Drop columns from your dataset as part of data cleansing, before consuming it in your AutoML experiment.
150+
> *The **drop columns** functionality is deprecated as of SDK version 1.19. Drop columns from your dataset as part of data cleansing, before consuming it in your AutoML experiment.
151151
152-
Create the `FeaturizationConfig` object by using API calls:
152+
You can create the `FeaturizationConfig` object by using API calls:
153153

154154
```python
155155
featurization_config = FeaturizationConfig()
@@ -241,7 +241,7 @@ Output
241241

242242
### Scaling and normalization
243243

244-
To understand scaling and normalization, and the selected algorithm with its hyperparameter values, use `fitted_model.steps`.
244+
To understand scaling/normalization and the selected algorithm with its hyperparameter values, use `fitted_model.steps`.
245245

246246
The following sample output is from running `fitted_model.steps` for a chosen run:
247247

@@ -334,9 +334,9 @@ If the underlying model doesn't support the `predict_proba()` function or the fo
334334

335335
## BERT integration in AutoML
336336

337-
[BERT](https://techcommunity.microsoft.com/t5/azure-ai/how-bert-is-integrated-into-azure-automated-machine-learning/ba-p/1194657) is used in the featurization layer of AutoML. In this layer, if a column contains free text or other types of data like timestamps or simple numbers, then featurization is applied accordingly.
337+
[Bidirectional Encoder Representations from Transformers (BERT)](https://techcommunity.microsoft.com/t5/azure-ai/how-bert-is-integrated-into-azure-automated-machine-learning/ba-p/1194657) is used in the featurization layer of AutoML. In this layer, if a column contains free text or other types of data like timestamps or simple numbers, then featurization is applied accordingly.
338338

339-
For BERT, the model is fine-tuned and trained utilizing the user-provided labels. From here, document embeddings are output as features alongside others, like timestamp-based features, day of week.
339+
For BERT, the model is fine-tuned and trained by utilizing the user-provided labels. From here, document embeddings are output as features alongside others, like timestamp-based features, day of week.
340340

341341
Learn how to [Set up AutoML to train a natural language processing model with Python](how-to-auto-train-nlp-models.md).
342342

@@ -346,7 +346,7 @@ In order to invoke BERT, set `enable_dnn: True` in your `automl_settings` and us
346346

347347
AutoML takes the following steps for BERT.
348348

349-
1. **Preprocesses and tokenizes all text columns**. For example, the `StringCast` transformer can be found in the final model's featurization summary. An example of how to produce the model's featurization summary can be found in [this Jupyter notebook](https://github.com/Azure/azureml-examples/blob/v1-archive/v1/python-sdk/tutorials/automl-with-azureml/classification-text-dnn/auto-ml-classification-text-dnn.ipynb).
349+
1. **Preprocesses and tokenizes all text columns**. For example, the `StringCast` transformer can be found in the final model's featurization summary. An example of how to produce the model's featurization summary can be found in this [Jupyter notebook](https://github.com/Azure/azureml-examples/blob/v1-archive/v1/python-sdk/tutorials/automl-with-azureml/classification-text-dnn/auto-ml-classification-text-dnn.ipynb).
350350

351351
1. **Concatenates all text columns into a single text column**, hence the `StringConcatTransformer` in the final model.
352352

0 commit comments

Comments
 (0)