HF typos (#106)

jeanniefinks · robertgshaw2-redhat · web-flow · commit e29e74e7d1ed · 2022-11-01T17:19:43.000-04:00
HuggingFace typos. Should be Hugging Face.
Multiple places.

* Update question-answering.mdx

* Update text-classification.mdx

* Update question-answering.mdx

* Update token-classification.mdx

* Update text-classification.mdx

* Update sparsify-a-model.mdx

* Update supported-integrations.mdx

* Update question-answering.mdx

Co-authored-by: Robert Shaw &lt;114415538+rsnm2@users.noreply.github.com&gt;
diff --git a/src/content/get-started/sparsify-a-model.mdx b/src/content/get-started/sparsify-a-model.mdx
@@ -11,7 +11,7 @@ index: 4000
 SparseML enables you to create a sparse model from scratch. The library contains state-of-the-art sparsification algorithms, including pruning, distillation, and quantization techniques.
 
 These algorithms are built on top of sparsification recipes, enabling easy integration into custom ML training pipelines to sparsify most neural networks. 
-Additionally, SparseML integrates with popular ML repositories like HuggingFace Transformers and Ultralytics YOLO. With these integrations, creating a recipe and passing it to a CLI is all you need to sparsify a model.
+Additionally, SparseML integrates with popular ML repositories like Hugging Face Transformers and Ultralytics YOLO. With these integrations, creating a recipe and passing it to a CLI is all you need to sparsify a model.
 
 Aside from sparsification algorithms, SparseML contains generic export pathways for performant deployments.
 These export pathways ensure the model saves in the correct format and rewrites the inference graphs for performance, such as quantized operator folding.
diff --git a/src/content/get-started/sparsify-a-model/supported-integrations.mdx b/src/content/get-started/sparsify-a-model/supported-integrations.mdx
@@ -1,16 +1,16 @@
 ---
 title: "Supported Integrations"
-metaTitle: "Sparsifying a model for SparseML Integrations"
+metaTitle: "Sparsifying a Model for SparseML Integrations"
 metaDescription: "Sparsify a model with SparseML and recipes for smaller, faster, and cheaper model inferences in deployment"
 githubURL: "https://github.com/neuralmagic/docs/blob/main/src/content/get-started/sparsify-a-model/supported-integrations.mdx"
 index: 1000
 ---
 
-# Sparsifying a model for SparseML Integrations
+# Sparsifying a Model for SparseML Integrations
 
 This page walks through an example of creating a sparsification recipe to prune a dense model from scratch and applying a recipe to a supported integration.
 
-SparseML has pre-made integrations with many popular model repositories, such as with HuggingFace Transformers and Ultralytics YOLOv5.
+SparseML has pre-made integrations with many popular model repositories, such as with Hugging Face Transformers and Ultralytics YOLOv5.
 For these integrations, a sparsification recipe is all you need, and you can apply state-of-the-art sparsification algorithms, including
 pruning, distillation, and quantization, with a single command line call.
 
@@ -31,17 +31,17 @@ Important hyperparameters that need to be set are the following:
  - The layers to prune and their target sparsity levels
  - The number of epochs for pruning
  - The frequency of pruning
- - The length of time to fine tune after pruning
- - The the learning rates to (LR) for pruning and finetuning
+ - The length of time to fine-tune after pruning
+ - The learning rates to (LR) for pruning and fine-tuning
 
 The proper hyperparameter values will differ for different model architectures, training schemes, and domains, but there is some general intuition for safe starting values.
 The following are reasonably default values to start with:
   - The final sparsity is set to 80% sparsity applied globally across all layers.
   - The running frequency is set to pruning once per epoch (up to a few times per epoch for shorter schedules).
   - The number of pruning epochs is set to 1/3 the original training epochs.
-  - The number of finetuning epochs is set to 1/4 the original epochs.
+  - The number of fine-tuning epochs is set to 1/4 the original epochs.
   - The pruning LR is set to the midrange from the model's training start and final LRs.
-  - The finetuning LRs cycle from the pruning LR to the final LR is used for training.
+  - The fine-tuning LRs cycle from the pruning LR to the final LR is used for training.
 
 SparseML conveniently encodes these hyperparameters into a YAML-based **Recipe** file. The rest of the system parses the arguments in the YAML file to set the parameters of the algorithm.
 
@@ -76,16 +76,16 @@ In this recipe:
   - `GlobalMagnitudePruningModifier` applies gradual magnitude pruning globally across all the prunable parameters/weights in a model.
   - `GlobalMagnitudePruningModifier` starts at 5% sparsity at epoch 0 and gradually ramps up to 80% sparsity at epoch 30, pruning at the start of each epoch.
   - `SetLearningRateModifier` sets the pruning LR to 0.05 (midpoint between the original 0.1 and 0.001 training LRs).
-  - `LearningRateFunctionModifier` cycles the finetuning LR from the pruning LR to 0.001 with a cosine curve (0.001 was the final original training LR).
-  - `EpochRangeModifier` expands the training time to continue finetuning for an additional 20 epochs after pruning has ended.
-  - 30 pruning epochs and 20 finetuning epochs were chosen based on a 90 epoch training schedule -- be sure to adjust based on the number of epochs used for the initial training for your use case.
+  - `LearningRateFunctionModifier` cycles the fine-tuning LR from the pruning LR to 0.001 with a cosine curve (0.001 was the final original training LR).
+  - `EpochRangeModifier` expands the training time to continue fine-tuning for an additional 20 epochs after pruning has ended.
+  - 30 pruning epochs and 20 fine-tuning epochs were chosen based on a 90 epoch training schedule -- be sure to adjust based on the number of epochs used for the initial training for your use case.
 
 ## Quantization and Quantization Recipes
 
 A quantization recipe systematically reduces the precision for weights and activations within a neural network, generally from `FP32` to `INT8`. Running a quantized
 model increases speed and reduces memory consumption while sacrificing very little in terms of accuracy.
 
-Quantization aware training (QAT) is the standard algorithm. With QAT, fake quantization operators are injected into the graph before quantizable nodes for activations, and weights are wrapped with fake quantization operators.
+Quantization-aware training (QAT) is the standard algorithm. With QAT, fake quantization operators are injected into the graph before quantizable nodes for activations, and weights are wrapped with fake quantization operators.
 The fake quantization operators interpolate the weights and activations down to INT8 on the forward pass but enable a full update of the weights at FP32 on the backward pass.
 The updates to the weights at FP32 throughout the training process allow the model to adapt to the loss of information from quantization on the forward pass.
 QAT generally guarantees better recovery for a given model compared with post-training quantization (PTQ), where training is not used.
@@ -124,7 +124,7 @@ Note the `model` is used here as a general placeholder; to determine the name of
 ## Pruning plus Quantization Recipe
 
 To create a pruning and quantization recipe, the pruning and quantization recipes are merged from the previous sections.
-Quantization is added after pruning and finetuning are complete such that the training cycles end with it.
+Quantization is added after pruning and fine-tuning are complete such that the training cycles end with it.
 This prevents stability issues from lacking precision when pruning and utilizing larger LRs.
 
 Combining the two previous recipes creates the following new recipe.yaml file:
diff --git a/src/content/use-cases/natural-language-processing/question-answering.mdx b/src/content/use-cases/natural-language-processing/question-answering.mdx
@@ -1,12 +1,12 @@
 ---
 title: "Question Answering"
 metaTitle: "NLP Question Answering"
-metaDescription: "Question Answering with HuggingFace Transformers and SparseML to create cheaper and more performant NLP models"
+metaDescription: "Question Answering with Hugging Face Transformers and SparseML to create cheaper and more performant NLP models"
 githubURL: "https://github.com/neuralmagic/docs/blob/main/src/content/use-cases/natural-language-processing/question-answering.mdx"
 index: 1000
 ---
 
-# Question Answering with HuggingFace Transformers and SparseML
+# Question Answering with Hugging Face Transformers and SparseML
 
 This page explains how to create and deploy a sparse Transformer for Question Answering.
 
@@ -19,7 +19,7 @@ This integration enables you to create a sparse model in two ways:
 - **Sparse Transfer Learning** - fine-tune a sparse model (or use one of our [sparse pre-trained models](https://sparsezoo.neuralmagic.com/?domain=nlp&sub_domain=question_answering)) on your own private dataset.
 
 Each option is useful in different situations:
-- **Sparsification from Scratch** enables you to create a sparse version of any model (even those not in the SparseZoo), but requires hand-tuning the hyperparameters of the Sparsification algorithm.
+- **Sparsification from Scratch** enables you to create a sparse version of any model (even those not in the SparseZoo), but requires hand-tuning the hyperparameters of the sparsification algorithm.
 - **Sparse Transfer Learning** is the easiest path to creating a sparse model trained on your data. Simply pull a pre-sparsified model and transfer learning recipe from the SparseZoo and fine-tune on your data with a single command.
 
 ## Installation Requirements
@@ -53,15 +53,15 @@ sparseml.transformers.question_answering \
   --recipe zoo:nlp/question_answering/bert-base/pytorch/huggingface/squad/pruned-aggressive_98
 ```
 
-The SparseML train script is a wrapper around a [HuggingFace script](https://huggingface.co/docs/transformers/run_scripts), 
-and usage for most arguments follows the HuggingFace. The most important arguments for SparseML are:
+The SparseML train script is a wrapper around a [Hugging Face script](https://huggingface.co/docs/transformers/run_scripts), 
+and usage for most arguments follows the Hugging Face. The most important arguments for SparseML are:
 
 - `--model_name_or_path` indicates which model to start the pruning process from. It can be a SparseZoo stub, HF model identifier, or a path to a local model.
 - `--recipe` points to recipe file containing the sparsification hyperparamters. It can be a SparseZoo stub or a local file. For more on creating a recipe see [here](/user-guide/recipes/creating).
 - `--dataset_name` indicates that we should fine tune on the SQuAD dataset.
 
-To utilize a custom dataset, use the `--train_file` and `--validation_file` arguments. To use a dataset from the HuggingFace hub, use `--dataset_name`. 
-See the [HF Docs](https://huggingface.co/docs/transformers/run_scripts#run-a-script) for more details.
+To utilize a custom dataset, use the `--train_file` and `--validation_file` arguments. To use a dataset from the Hugging Face hub, use `--dataset_name`. 
+See the [Hugging Face Docs](https://huggingface.co/docs/transformers/run_scripts#run-a-script) for more details.
 
 Run the following to see the full list of options:
 ```bash
diff --git a/src/content/use-cases/natural-language-processing/text-classification.mdx b/src/content/use-cases/natural-language-processing/text-classification.mdx
@@ -1,12 +1,12 @@
 ---
 title: "Text Classification"
 metaTitle: "NLP Text Classification"
-metaDescription: "Text Classification with HuggingFace Transformers and SparseML to create cheaper and more performant NLP models"
+metaDescription: "Text Classification with Hugging Face Transformers and SparseML to create cheaper and more performant NLP models"
 githubURL: "https://github.com/neuralmagic/docs/blob/main/src/content/use-cases/natural-language-processing/text-classification.mdx"
 index: 2000
 ---
 
-# Text Classification with HuggingFace Transformers and SparseML
+# Text Classification with Hugging Face Transformers and SparseML
 
 This page explains how to create and deploy a sparse Transformer for Text Classification.
 
@@ -19,7 +19,7 @@ This integration enables you to create a sparse model in two ways:
 - **Sparse Transfer Learning** - fine-tune a sparse model (or use one of our [sparse pre-trained models](https://sparsezoo.neuralmagic.com/?domain=nlp&sub_domain=text_classification)) on your own private dataset.
 
 Each option is useful in different situations:
-- **Sparsification from Scratch** enables you to create a sparse version of any model (even those not in the SparseZoo), but requires hand-tuning the hyperparameters of the Sparsification algorithm.
+- **Sparsification from Scratch** enables you to create a sparse version of any model (even those not in the SparseZoo), but requires hand-tuning the hyperparameters of the sparsification algorithm.
 - **Sparse Transfer Learning** is the easiest path to creating a sparse model trained on your data. Simply pull a pre-sparsified model and transfer learning recipe from the SparseZoo and fine-tune on your data with a single command.
 
 ## Installation Requirements
@@ -52,15 +52,15 @@ sparseml.transformers.text_classification \
   --recipe zoo:nlp/text_classification/bert-base/pytorch/huggingface/mnli/12layer_pruned90-none
 ```
 
-The SparseML train script is a wrapper around a [HuggingFace script](https://huggingface.co/docs/transformers/run_scripts), and 
-usage for most arguments follows the HuggingFace. The most important arguments for SparseML are:
-- `model_name_or_path`: specifies starting model. It can be a SparseZoo stub, HF model identifier, or a local directory 
+The SparseML train script is a wrapper around a [Hugging Face script](https://huggingface.co/docs/transformers/run_scripts), and 
+usage for most arguments follows the Hugging Face. The most important arguments for SparseML are:
+- `model_name_or_path`: specifies starting model. It can be a SparseZoo stub, Hugging Face model identifier, or a local directory 
 with `model.pt`, `tokenizer.json` and `config.json`
 - `recipe`: recipe containing the training hyperparamters (SparseZoo stub or a local file)
 - `task_name`: specifies the sentiment analysis task for the MNLI dataset
 
-To utilize a custom dataset, use the `--train_file` and `--validation_file` arguments. To use a dataset from the HuggingFace hub, use `--dataset_name`. 
-See the [HF Docs](https://huggingface.co/docs/transformers/run_scripts#run-a-script) for more details.
+To utilize a custom dataset, use the `--train_file` and `--validation_file` arguments. To use a dataset from the Hugging Face hub, use `--dataset_name`. 
+See the [Hugging Face Docs](https://huggingface.co/docs/transformers/run_scripts#run-a-script) for more details.
 
 Run the following to see the full list of options:
 ```bash
diff --git a/src/content/use-cases/natural-language-processing/token-classification.mdx b/src/content/use-cases/natural-language-processing/token-classification.mdx
@@ -1,12 +1,12 @@
 ---
 title: "Token Classification"
 metaTitle: "NLP Token Classification"
-metaDescription: "Token Classification with HuggingFace Transformers and SparseML to create cheaper and more performant NLP models"
+metaDescription: "Token Classification with Hugging Face Transformers and SparseML to create cheaper and more performant NLP models"
 githubURL: "https://github.com/neuralmagic/docs/blob/main/src/content/use-cases/natural-language-processing/text-classification.mdx"
 index: 3000
 ---
 
-# Token Classification with HuggingFace Transformers and SparseML
+# Token Classification with Hugging Face Transformers and SparseML
 
 This page explains how to create and deploy a sparse Transformer for Token Classification.
 
@@ -52,15 +52,15 @@ sparseml.transformers.token_classification \
   --recipe zoo:nlp/token_classification/bert-base/pytorch/huggingface/conll2003/12layer_pruned80_quant-none-vnni
 ```
 
-The SparseML train script is a wrapper around a [HuggingFace script](https://huggingface.co/docs/transformers/run_scripts),
-and usage for most arguments follows the HuggingFace. The most important arguments for SparseML are:
+The SparseML train script is a wrapper around a [Hugging Face script](https://huggingface.co/docs/transformers/run_scripts),
+and usage for most arguments follows the Hugging Face. The most important arguments for SparseML are:
 
-- `--model_name_or_path` indicates which model to start the pruning process from. It can be a SparseZoo stub, HF model identifier, or a path to a local model.
+- `--model_name_or_path` indicates which model to start the pruning process from. It can be a SparseZoo stub, Hugging Face model identifier, or a path to a local model.
 - `--recipe` points to recipe file containing the sparsification hyperparamters. It can be a SparseZoo stub or a local file. For more on creating a recipe see [here](/user-guide/recipes/creating).
 - `--dataset_name` indicates that we should fine tune on the CoNLL-2003 dataset.
 
-To utilize a custom dataset, use the `--train_file` and `--validation_file` arguments. To use a dataset from the HuggingFace hub, use `--dataset_name`.
-See the [HF Docs](https://huggingface.co/docs/transformers/run_scripts#run-a-script) for more details.
+To utilize a custom dataset, use the `--train_file` and `--validation_file` arguments. To use a dataset from the Hugging Face hub, use `--dataset_name`.
+See the [Hugging Face Docs](https://huggingface.co/docs/transformers/run_scripts#run-a-script) for more details.
 
 Run the following to see the full list of options:
 ```bash