Skip to content

Commit 91c3ec2

Browse files
Merge pull request #112169 from raviskolli/main
NLP documentation for support of Huggingface models
2 parents 76e5f7a + 6413e98 commit 91c3ec2

File tree

1 file changed

+20
-14
lines changed

1 file changed

+20
-14
lines changed

articles/machine-learning/how-to-auto-train-nlp-models.md

Lines changed: 20 additions & 14 deletions
Original file line numberDiff line numberDiff line change
@@ -343,23 +343,29 @@ AutoML NLP allows you to provide a list of models and combinations of hyperparam
343343

344344
All the pre-trained text DNN models currently available in AutoML NLP for fine-tuning are listed below:
345345

346-
* bert_base_cased
347-
* bert_large_uncased
348-
* bert_base_multilingual_cased
349-
* bert_base_german_cased
350-
* bert_large_cased
351-
* distilbert_base_cased
352-
* distilbert_base_uncased
353-
* roberta_base
354-
* roberta_large
355-
* distilroberta_base
356-
* xlm_roberta_base
357-
* xlm_roberta_large
358-
* xlnet_base_cased
359-
* xlnet_large_cased
346+
* bert-base-cased
347+
* bert-large-uncased
348+
* bert-base-multilingual-cased
349+
* bert-base-german-cased
350+
* bert-large-cased
351+
* distilbert-base-cased
352+
* distilbert-base-uncased
353+
* roberta-base
354+
* roberta-large
355+
* distilroberta-base
356+
* xlm-roberta-base
357+
* xlm-roberta-large
358+
* xlnet-base-cased
359+
* xlnet-large-cased
360360

361361
Note that the large models are larger than their base counterparts. They are typically more performant, but they take up more GPU memory and time for training. As such, their SKU requirements are more stringent: we recommend running on ND-series VMs for the best results.
362362

363+
## Supported model algorithms - HuggingFace (preview)
364+
365+
With the new backend that runs on [Azure Machine Learning pipelines](concept-ml-pipelines.md), you can additionally use any text/token classification model from the HuggingFace Hub for [Text Classification](https://huggingface.co/models?pipeline_tag=text-classification&library=transformers), [Token Classification](https://huggingface.co/models?pipeline_tag=token-classification&sort=trending) which is part of the transformers library (such as microsoft/deberta-large-mnli). You may also find a curated list of models in [Azure Machine Learning model registry](concept-foundation-models.md?view=azureml-api-2&preserve-view=true) that have been validated with the pipeline components.
366+
367+
Using any HuggingFace model will trigger runs using pipeline components. If both legacy and HuggingFace models are used, all runs/trials will be triggered using components.
368+
363369
## Supported hyperparameters
364370

365371
The following table describes the hyperparameters that AutoML NLP supports.

0 commit comments

Comments
 (0)