Skip to content

Commit 0f179ef

Browse files
authored
Merge pull request #279983 from MicrosoftDocs/main
7/3 11:00 AM IST Publish
2 parents a363f71 + 8ec6949 commit 0f179ef

File tree

44 files changed

+670
-458
lines changed

Some content is hidden

Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.

44 files changed

+670
-458
lines changed

articles/ai-studio/how-to/deploy-models-timegen-1.md

Lines changed: 36 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -47,6 +47,42 @@ You can deploy TimeGEN-1 as a serverless API with pay-as-you-go billing. Nixtla
4747
- An [Azure AI Studio project](../how-to/create-projects.md).
4848
- Azure role-based access controls (Azure RBAC) are used to grant access to operations in Azure AI Studio. To perform the steps in this article, your user account must be assigned the __Azure AI Developer role__ on the resource group. For more information on permissions, visit [Role-based access control in Azure AI Studio](../concepts/rbac-ai-studio.md).
4949

50+
### Pricing information
51+
52+
#### Estimate the number of tokens needed
53+
54+
Before you create a deployment, it's useful to estimate the number of tokens that you plan to use and be billed for.
55+
One token corresponds to one data point in your input dataset or output dataset.
56+
57+
Suppose you have the following input time series dataset:
58+
59+
| Unique_id | Timestamp | Target Variable | Exogenous Variable 1 | Exogenous Variable 2 |
60+
|:---------:|:-------------------:|:---------------:|:--------------------:|:--------------------:|
61+
| BE | 2016-10-22 00:00:00 | 70.00 | 49593.0 | 57253.0 |
62+
| BE | 2016-10-22 01:00:00 | 37.10 | 46073.0 | 51887.0 |
63+
64+
To determine the number of tokens, multiply the number of rows (in this example, two) and the number of columns used for forecasting—not counting the unique_id and timestamp columns (in this example, three) to get a total of six tokens.
65+
66+
Given the following output dataset:
67+
68+
| Unique_id | Timestamp | Forecasted Target Variable |
69+
|:---------:|:-------------------:|:--------------------------:|
70+
| BE | 2016-10-22 02:00:00 | 46.57 |
71+
| BE | 2016-10-22 03:00:00 | 48.57 |
72+
73+
You can also determine the number of tokens by counting the number of data points returned after data forecasting. In this example, the number of tokens is two.
74+
75+
#### Estimate the pricing
76+
77+
There are four pricing meters, as described in the following table:
78+
79+
| Pricing Meter | Description |
80+
|-----------------------------------------------|-----------------------------------------------------------------------------------------|
81+
| paygo-inference-input-tokens | Costs associated with the tokens used as input for inference when *finetune_steps* = 0 |
82+
| paygo-inference-output-tokens | Costs associated with the tokens used as output for inference when *finetune_steps* = 0 |
83+
| paygo-finetuned-model-inference-input-tokens | Costs associated with the tokens used as input for inference when *finetune_steps* > 0 |
84+
| paygo-finetuned-model-inference-output-tokens | Costs associated with the tokens used as output for inference when *finetune_steps* > 0 |
85+
5086
### Create a new deployment
5187

5288
These steps demonstrate the deployment of TimeGEN-1. To create a deployment:

articles/aks/create-node-pools.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -31,7 +31,7 @@ This article shows you how to create one or more node pools in an AKS cluster.
3131
The following limitations apply when you create AKS clusters that support multiple node pools:
3232

3333
* See [Quotas, virtual machine size restrictions, and region availability in Azure Kubernetes Service (AKS)](quotas-skus-regions.md).
34-
* You can delete system node pools if you have another system node pool to take its place in the AKS cluster. Otherwise, you cannot delete the system node pool.
34+
* You can delete the system node pool if you have another system node pool to take its place in the AKS cluster. Otherwise, you cannot delete the system node pool.
3535
* System pools must contain at least one node, and user node pools may contain zero or more nodes.
3636
* The AKS cluster must use the Standard SKU load balancer to use multiple node pools. This feature isn't supported with Basic SKU load balancers.
3737
* The AKS cluster must use Virtual Machine Scale Sets for the nodes.
@@ -60,7 +60,7 @@ The following limitations apply when you create AKS clusters that support multip
6060
--name $CLUSTER_NAME \
6161
--vm-set-type VirtualMachineScaleSets \
6262
--node-count 2 \
63-
--generate-ssh-keys \
63+
--location $LOCATION \
6464
--load-balancer-sku standard \
6565
--generate-ssh-keys
6666
```

articles/app-service/manage-custom-dns-buy-domain.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -71,7 +71,7 @@ For pricing information on App Service domains, visit the [App Service Pricing p
7171
| Setting | Description |
7272
| -------- | ----------- |
7373
| **Auto renewal** | Your App Service domain is registered to you at one-year increments. Enable auto renewal so that your domain registration doesn't expire and that you retain ownership of the domain. Your Azure subscription is automatically charged the yearly domain registration fee at the time of renewal. If you leave it disabled, you must [renew it manually](#renew-the-domain). |
74-
| **Privacy protection** | Enabled by default. Privacy protection hides your domain registration contact information from the WHOIS database. Privacy protection is already included in the yearly domain registration fee. To opt out, select **Disable**. |
74+
| **Privacy protection** | Enabled by default. Privacy protection hides your domain registration contact information from the WHOIS database and is already included in the yearly domain registration fee. To opt out, select **Disable**. Privacy protection is not supported in following top-level domains (TLDs): co.uk, in, org.uk, co.in, and nl. |
7575

7676
1. Select **Next: Tags** and set the tags you want for your App Service domain. Tagging isn't required for using App Service domains, but is a [feature in Azure that helps you manage your resources](../azure-resource-manager/management/tag-resources.md).
7777

articles/azure-cache-for-redis/cache-best-practices-performance.md

Lines changed: 10 additions & 6 deletions
Original file line numberDiff line numberDiff line change
@@ -5,7 +5,7 @@ description: Learn how to test the performance of Azure Cache for Redis.
55
author: flang-msft
66
ms.service: cache
77
ms.topic: conceptual
8-
ms.date: 06/19/2023
8+
ms.date: 07/01/2024
99
ms.author: franlanglois
1010
---
1111

@@ -17,7 +17,7 @@ Fortunately, several tools exist to make benchmarking Redis easier. Two of the m
1717

1818
## How to use the redis-benchmark utility
1919

20-
1. Install open source Redis server to a client VM you can use for testing. The redis-benchmark utility is built into the open source Redis distribution. Follow the [Redis documentation](https://redis.io/docs/latest/operate/oss_and_stack/install/install-redis/) for instructions on how to install the open source image.
20+
1. Install open source Redis server to a client virtual machines (VMs) you can use for testing. The redis-benchmark utility is built into the open source Redis distribution. Follow the [Redis documentation](https://redis.io/docs/latest/operate/oss_and_stack/install/install-redis/) for instructions on how to install the open source image.
2121

2222
1. The client VM used for testing should be _in the same region_ as your Azure Cache for Redis instance.
2323

@@ -49,6 +49,10 @@ Fortunately, several tools exist to make benchmarking Redis easier. Two of the m
4949

5050
- On the Premium tier, scaling out, clustering, is typically recommended before scaling up. Clustering allows Redis server to use more vCPUs by sharding data. Throughput should increase roughly linearly when adding shards in this case.
5151

52+
- On _C0_ and _C1_ Standard caches, while internal Defender scanning is running on the VMs, you might see short spikes in server load not caused by an increase in cache requests. You see higher latency for requests while internal Defender scans are run on these tiers a couple of times a day. Caches on the _C0_ and _C1_ tiers only have a single core to multitask, dividing the work of serving internal Defender scanning and Redis requests. You can reduce the effect by scaling to a higher tier offering with multiple CPU cores, such as _C2_.
53+
54+
The increased cache size on the higher tiers helps address any latency concerns. Also, at the _C2_ level, you have support for as many as 2,000 client connections.
55+
5256
## Redis-benchmark examples
5357

5458
**Pre-test setup**:
@@ -88,7 +92,7 @@ redis-benchmark -h yourcache.region.redisenterprise.cache.azure.net -p 10000 -a
8892

8993
## Example performance benchmark data
9094

91-
The following tables show the maximum throughput values that were observed while testing various sizes of Standard, Premium, Enterprise, and Enterprise Flash caches. We used `redis-benchmark` from an IaaS Azure VM against the Azure Cache for Redis endpoint. The throughput numbers are only for GET commands. Typically, SET commands have a lower throughput. These numbers are optimized for throughput. Real-world throughput under acceptable latency conditions may be lower.
95+
The following tables show the maximum throughput values that were observed while testing various sizes of Standard, Premium, Enterprise, and Enterprise Flash caches. We used `redis-benchmark` from an IaaS Azure VM against the Azure Cache for Redis endpoint. The throughput numbers are only for GET commands. Typically, SET commands have a lower throughput. These numbers are optimized for throughput. Real-world throughput under acceptable latency conditions might be lower.
9296

9397
The following configuration was used to benchmark throughput for the Basic, Standard, and Premium tiers:
9498

@@ -144,7 +148,7 @@ redis-benchmark -h yourcache.region.redisenterprise.cache.azure.net -p 10000 -a
144148
145149
#### Enterprise Cluster Policy
146150

147-
| Instance | Size | vCPUs | Expected network bandwidth (Mbps)| GET requests per second without SSL (1-kB value size) | GET requests per second with SSL (1-kB value size) |
151+
| Instance | Size | vCPUs | Expected network bandwidth (Mbps)| `GET` requests per second without SSL (1-kB value size) | `GET` requests per second with SSL (1-kB value size) |
148152
|:---:| --- | ---:|---:| ---:| ---:|
149153
| E10 | 12 GB | 4 | 4,000 | 300,000 | 207,000 |
150154
| E20 | 25 GB | 4 | 4,000 | 680,000 | 480,000 |
@@ -156,7 +160,7 @@ redis-benchmark -h yourcache.region.redisenterprise.cache.azure.net -p 10000 -a
156160

157161
#### OSS Cluster Policy
158162

159-
| Instance | Size | vCPUs | Expected network bandwidth (Mbps)| GET requests per second without SSL (1-kB value size) | GET requests per second with SSL (1-kB value size) |
163+
| Instance | Size | vCPUs | Expected network bandwidth (Mbps)| `GET` requests per second without SSL (1-kB value size) | `GET` requests per second with SSL (1-kB value size) |
160164
|:---:| --- | ---:|---:| ---:| ---:|
161165
| E10 | 12 GB | 4 | 4,000 | 1,400,000 | 1,000,000 |
162166
| E20 | 25 GB | 4 | 4,000 | 1,200,000 | 900,000 |
@@ -170,7 +174,7 @@ redis-benchmark -h yourcache.region.redisenterprise.cache.azure.net -p 10000 -a
170174

171175
In addition to scaling up by moving to larger cache size, you can boost performance by [scaling out](cache-how-to-scale.md#how-to-scale-up-and-out---enterprise-and-enterprise-flash-tiers). In the Enterprise tiers, scaling out is called increasing the _capacity_ of the cache instance. A cache instance by default has capacity of two--meaning a primary and replica node. An Enterprise cache instance with a capacity of four indicates that the instance was scaled out by a factor of two. Scaling out provides access to more memory and vCPUs. Details on how many vCPUs are used by the core Redis process at each cache size and capacity can be found at the [Enterprise tiers best practices page](cache-best-practices-enterprise-tiers.md#sharding-and-cpu-utilization). Scaling out is most effective when using the OSS cluster policy.
172176

173-
The following tables show the GET requests per second at different capacities, using SSL and a 1-kB value size.
177+
The following tables show the `GET` requests per second at different capacities, using SSL and a 1-kB value size.
174178

175179
#### Scaling out - Enterprise cluster policy
176180

0 commit comments

Comments
 (0)