You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: articles/cosmos-db/analytical-store-introduction.md
+11-5Lines changed: 11 additions & 5 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -17,6 +17,11 @@ Azure Cosmos DB analytical store is a fully isolated column store for enabling l
17
17
18
18
Azure Cosmos DB transactional store is schema-agnostic, and it allows you to iterate on your transactional applications without having to deal with schema or index management. In contrast to this, Azure Cosmos DB analytical store is schematized to optimize for analytical query performance. This article describes in detailed about analytical storage.
19
19
20
+
> [!NOTE]
21
+
> Synapse Link for Gremlin API is now in preview. You can enable Synapse Link in your new or existing graphs using Azure CLI. For more information on how to configure it, click [here](configure-synapse-link.md).
22
+
23
+
24
+
20
25
## Challenges with large-scale analytics on operational data
21
26
22
27
The multi-model operational data in an Azure Cosmos DB container is internally stored in an indexed row-based "transactional store". Row store format is designed to allow fast transactional reads and writes in the order-of-milliseconds response times, and operational queries. If your dataset grows large, complex analytical queries can be expensive in terms of provisioned throughput on the data stored in this format. High consumption of provisioned throughput in turn, impacts the performance of transactional workloads that are used by your real-time applications and services.
@@ -220,9 +225,10 @@ df = spark.read\
220
225
221
226
There are two types of schema representation in the analytical store. These types define the schema representation method for all containers in the database account and have tradeoffs between the simplicity of query experience versus the convenience of a more inclusive columnar representation for polymorphic schemas.
222
227
223
-
* Well-defined schema representation, default option for SQL (CORE) API accounts.
228
+
* Well-defined schema representation, default option for SQL (CORE) and Gremlin API accounts.
224
229
* Full fidelity schema representation, default option for Azure Cosmos DB API for MongoDB accounts.
225
230
231
+
226
232
#### Well-defined schema representation
227
233
228
234
The well-defined schema representation creates a simple tabular representation of the schema-agnostic data in the transactional store. The well-defined schema representation has the following considerations:
@@ -325,7 +331,7 @@ Here's a map of all the property data types and their suffix representations in
325
331
326
332
##### Working with the MongoDB `_id` field
327
333
328
-
the MongoDB `_id` field is fundamental to every collection in MongoDB and originally has a hexadecimal representation. As you can see in the table above, `Full Fidelity Schema` will preserve its characteristics, creating a challenge for its vizualiation in Azure Synapse Analytics. For correct visualization, you must convert the `_id` datatype as below:
334
+
the MongoDB `_id` field is fundamental to every collection in MongoDB and originally has a hexadecimal representation. As you can see in the table above, `Full Fidelity Schema` will preserve its characteristics, creating a challenge for its visualization in Azure Synapse Analytics. For correct visualization, you must convert the `_id` datatype as below:
329
335
330
336
###### Spark
331
337
@@ -353,7 +359,7 @@ FROM OPENROWSET('CosmosDB',
353
359
HTAP) WITH (_id VARCHAR(1000)) as HTAP
354
360
```
355
361
356
-
#### Full fidelity schema for SQL API accounts
362
+
#### Full fidelity schema for SQL or Gremlin API accounts
357
363
358
364
It's possible to use full fidelity Schema for SQL (Core) API accounts, instead of the default option, by setting the schema type when enabling Synapse Link on a Cosmos DB account for the first time. Here are the considerations about changing the default schema representation type:
359
365
@@ -447,7 +453,7 @@ Synapse Link, and analytical store by consequence, has different compatibility l
447
453
448
454
### Backup Polices
449
455
450
-
There two possible backup polices and to understand how to use them, two details about Cosmos DB backups are very important:
456
+
There are two possible backup polices and to understand how to use them, the following details about Cosmos DB backups are very important:
451
457
452
458
* The original container is restored without analytical store in both backup modes.
453
459
* Cosmos DB doesn't support containers overwrite from a restore.
@@ -462,7 +468,7 @@ Now let's see how to use backup and restores from the analytical store perspecti
462
468
463
469
#### Restoring a container with TTTL < ATTL
464
470
465
-
When `transactional TTL` is smaller than `analytical TTL`, some data only exists in analytical store and won't be in the restored container. Again your have two possible situations:
471
+
When `transactional TTL` is smaller than `analytical TTL`, some data only exists in analytical store and won't be in the restored container. Again, you have two possible situations:
466
472
* To use the restored container as a replacement for the original container. In this case, when you enable Synapse Link at container level, only the data that was in transactional store will be included in the new analytical store. But please note that the analytical store of the original container remains available for queries as long as the original container exists. You may want to change your application to query both.
467
473
* To use the restored container as a data source to backfill or update the data in the original container:
468
474
* Analytical store will automatically reflect the data operations for the data that is in transactional store.
Copy file name to clipboardExpand all lines: articles/cosmos-db/analytical-store-private-endpoints.md
+6-1Lines changed: 6 additions & 1 deletion
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -4,7 +4,7 @@ description: Learn how to set up managed private endpoints for Azure Cosmos DB a
4
4
author: AnithaAdusumilli
5
5
ms.service: cosmos-db
6
6
ms.topic: how-to
7
-
ms.date: 03/02/2021
7
+
ms.date: 09/29/2022
8
8
ms.author: anithaa
9
9
10
10
---
@@ -17,6 +17,11 @@ In this article, you will learn how to set up managed private endpoints for Azur
17
17
> [!NOTE]
18
18
> If you are using Private DNS Zones for Cosmos DB and wish to create a Synapse managed private endpoint to the analytical store sub-resource, you must first create a DNS zone for the analytical store (`privatelink.analytics.cosmos.azure.com`) linked to your Cosmos DB's virtual network.
19
19
20
+
21
+
> [!NOTE]
22
+
> Synapse Link for Gremlin API is now in preview. You can enable Synapse Link in your new or existing graphs using Azure CLI. For more information on how to configure it, click [here](configure-synapse-link.md).
23
+
24
+
20
25
## Enable a private endpoint for the analytical store
21
26
22
27
### Set up Azure Synapse Analytics workspace with a managed virtual network and data-exfiltration
Copy file name to clipboardExpand all lines: articles/cosmos-db/configure-custom-partitioning.md
+10-4Lines changed: 10 additions & 4 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -1,28 +1,34 @@
1
1
---
2
2
title: Configure custom partitioning to partition analytical store data (Preview)
3
-
description: Learn how to trigger custom partitioning from Azure Synapse Spark notebook using Azure Synapse link for Azure Cosmos DB. It explains the configuration options.
3
+
description: Learn how to trigger custom partitioning from Azure Synapse Spark notebook using Azure Synapse Link for Azure Cosmos DB. It explains the configuration options.
4
4
author: Rodrigossz
5
5
ms.service: cosmos-db
6
6
ms.topic: conceptual
7
-
ms.date: 11/02/2021
7
+
ms.date: 09/29/2022
8
8
ms.author: rosouz
9
9
ms.custom: ignite-fall-2021
10
10
---
11
11
12
12
# Configure custom partitioning to partition analytical store data (Preview)
Custom partitioning enables you to partition analytical store data, on fields that are commonly used as filters in analytical queries, resulting in improved query performance.
16
17
To learn more about custom partitioning, see [what is custom partitioning](custom-partitioning-analytical-store.md) article.
17
18
18
-
To use custom partitioning, you must enable Azure Synapse Link on your Azure Cosmos DB account. To learn more, see [how to configure Azure Synapse Link](configure-synapse-link.md). Custom partitioning execution can be triggered from Azure Synapse Spark notebook using Azure Synapse link for Azure Cosmos DB.
19
+
To use custom partitioning, you must enable Azure Synapse Link on your Azure Cosmos DB account. To learn more, see [how to configure Azure Synapse Link](configure-synapse-link.md). Custom partitioning execution can be triggered from Azure Synapse Spark notebook using Azure Synapse Link for Azure Cosmos DB.
19
20
20
21
> [!IMPORTANT]
21
22
> Custom partitioning feature is currently in public preview. This preview version is provided without a service level agreement, and it's not recommended for production workloads. Certain features might not be supported or might have constrained capabilities. For more information, see [Supplemental Terms of Use for Microsoft Azure Previews](https://azure.microsoft.com/support/legal/preview-supplemental-terms/).
22
23
23
24
> [!NOTE]
24
25
> Azure Cosmos DB accounts should have Azure Synapse Link enabled to take advantage of custom partitioning. Custom partitioning is currently supported for Azure Synapse Spark 2.0 only.
25
26
27
+
28
+
> [!NOTE]
29
+
> Synapse Link for Gremlin API is now in preview. You can enable Synapse Link in your new or existing graphs using Azure CLI. For more information on how to configure it, click [here](configure-synapse-link.md).
30
+
31
+
26
32
## Trigger a custom partitioning job
27
33
28
34
Partitioning can be triggered from an Azure Synapse Spark notebook using Azure Synapse Link. You can schedule it to run as a background job, once or twice a day, or it can be executed more often if needed. You can also choose one or more fields from the dataset as the analytical store partition key.
[Azure Synapse Link for Azure Cosmos DB](synapse-link.md) is a cloud-native hybrid transactional and analytical processing (HTAP) capability that enables you to run near real-time analytics over operational data in Azure Cosmos DB. Synapse Link creates a tight seamless integration between Azure Cosmos DB and Azure Synapse Analytics.
16
16
17
+
> [!NOTE]
18
+
> Synapse Link for Gremlin API is now in preview. You can enable Synapse Link in your new or existing Graphs using Azure CLI.
19
+
17
20
Azure Synapse Link is available for Azure Cosmos DB SQL API or for Azure Cosmos DB API for Mongo DB accounts. Use the following steps to run analytical queries with the Azure Synapse Link for Azure Cosmos DB:
18
21
19
22
*[Enable Azure Synapse Link for your Azure Cosmos DB accounts](#enable-synapse-link)
@@ -23,11 +26,11 @@ Azure Synapse Link is available for Azure Cosmos DB SQL API or for Azure Cosmos
23
26
*[Improve performance with Best Practices](#best)
24
27
*[Use Azure Synapse serverless SQL pool to analyze and visualize data in Power BI](#analyze-with-powerbi)
25
28
26
-
You can also checkout the training module on how to [configure Azure Synapse Link for Azure Cosmos DB.](/training/modules/configure-azure-synapse-link-with-azure-cosmos-db/)
29
+
You can also check the training module on how to [configure Azure Synapse Link for Azure Cosmos DB.](/training/modules/configure-azure-synapse-link-with-azure-cosmos-db/)
27
30
28
31
## <aid="enable-synapse-link"></a>Enable Azure Synapse Link for Azure Cosmos DB accounts
29
32
30
-
The first step to use Synapse Link is to enable it for your Azure Cosmos DB database account. This is an one time operation.
33
+
The first step to use Synapse Link is to enable it for your Azure Cosmos DB database account.
31
34
32
35
> [!NOTE]
33
36
> If you want to use customer-managed keys with Azure Synapse Link, you must configure your account's managed identity in your Azure Key Vault access policy before enabling Synapse Link on your account. To learn more, see how to [Configure customer-managed keys using Azure Cosmos DB accounts' managed identities](how-to-setup-cmk.md#using-managed-identity) article.
@@ -70,6 +73,15 @@ Use `--enable-analytical-storage true` for both **create** or **update** operati
70
73
*[Create a new Azure Cosmos DB account with Synapse Link enabled](/cli/azure/cosmosdb#az-cosmosdb-create-optional-parameters)
71
74
*[Update an existing Azure Cosmos DB account to enable Synapse Link](/cli/azure/cosmosdb#az-cosmosdb-update-optional-parameters)
72
75
76
+
##### Use Azure CLI to enable Synapse Link for Azure Synapse Link for Gremlin API account.
77
+
Synapse Link for Gremlin API is now in preview. You can enable Synapse Link in your new or existing graphs using Azure CLI. Use the CLI command below to enable Synapse Link for your Gremlin API account:
For existing Gremlin API accounts, replace `create` with `update`.
84
+
73
85
#### PowerShell
74
86
75
87
Use `EnableAnalyticalStorage true` for both **create** or **update** operations. You also need to choose the representation schema type. For SQL API accounts you can use `--analytical-storage-schema-type` with the values `FullFidelity` or `WellDefined`. For MongoDB API accounts, always use `-AnalyticalStorageSchemaType FullFidelity`.
@@ -124,6 +136,16 @@ The following options enable Synapse Link in a container by using Azure CLI by s
124
136
*[Create an Azure Cosmos DB MongoDB collection](/cli/azure/cosmosdb/mongodb/collection#az-cosmosdb-mongodb-collection-create-examples)
125
137
*[Create or update an Azure Cosmos DB SQL API container](/cli/azure/cosmosdb/sql/container#az-cosmosdb-sql-container-create)
126
138
139
+
##### Use Azure CLI to enable Synapse Link for Azure Synapse Link for Gremlin API Graphs
140
+
141
+
Synapse Link for Gremlin API is now in preview. You can enable Synapse Link in your new or existing Graphs using Azure CLI. Use the CLI command below to enable Synapse Link for your Gremlin API graphs:
For existing graphs, replace `create` with `update`.
148
+
127
149
#### PowerShell
128
150
129
151
The following options enable Synapse Link in a container by using Azure CLI by setting the `-AnalyticalStorageTtl` property.
@@ -244,7 +266,7 @@ You can find samples to get started with Azure Synapse Link on [GitHub](https://
244
266
245
267
To learn more, see the following docs:
246
268
247
-
*Checkout the training module on how to [configure Azure Synapse Link for Azure Cosmos DB.](/training/modules/configure-azure-synapse-link-with-azure-cosmos-db/)
269
+
*Check the training module on how to [configure Azure Synapse Link for Azure Cosmos DB.](/training/modules/configure-azure-synapse-link-with-azure-cosmos-db/)
248
270
249
271
*[Azure Cosmos DB analytical store overview.](analytical-store-introduction.md)
Copy file name to clipboardExpand all lines: articles/cosmos-db/synapse-link-frequently-asked-questions.yml
+5-5Lines changed: 5 additions & 5 deletions
Original file line number
Diff line number
Diff line change
@@ -6,7 +6,7 @@ metadata:
6
6
ms.author: rosouz
7
7
ms.service: cosmos-db
8
8
ms.topic: faq
9
-
ms.date: 04/06/2022
9
+
ms.date: 09/29/2022
10
10
ms.custom: synapse-cosmos-db
11
11
title: Frequently asked questions about Azure Synapse Link for Azure Cosmos DB
12
12
summary: |
@@ -21,7 +21,7 @@ sections:
21
21
- question: |
22
22
Is Azure Synapse Link supported for all Azure Cosmos DB APIs?
23
23
answer: |
24
-
Azure Synapse Link is supported for the Azure Cosmos DB SQL (Core) API and for the Azure Cosmos DB API for MongoDB.
24
+
Azure Synapse Link is supported for the Azure Cosmos DB SQL (Core) API and for the Azure Cosmos DB API for MongoDB. The support for Gremlin API is in preview now.
25
25
26
26
- question: |
27
27
Is Azure Synapse Link supported for multi-region Azure Cosmos DB accounts?
@@ -139,12 +139,12 @@ sections:
139
139
- question: |
140
140
Can I access analytical store with Azure Cosmos DB SDKs?
141
141
answer: |
142
-
No, you can't access analytical store with the Azure Cosmos DB SDKs. You need to use Azure Synapse analitics, Spark or SQL serverless pools.
142
+
No, you can't access analytical store with the Azure Cosmos DB SDKs. You need to use Azure Synapse Analytics, Spark or SQL serverless pools.
143
143
144
144
- question: |
145
145
Can I access analytical store with Azure Cosmos DB REST APIs?
146
146
answer: |
147
-
No, you can't access analytical store with the Azure Cosmos DB REST APIs. You need to use Azure Synapse analitics, Spark or SQL serverless pools.
147
+
No, you can't access analytical store with the Azure Cosmos DB REST APIs. You need to use Azure Synapse Analytics, Spark or SQL serverless pools.
148
148
149
149
- question: |
150
150
Is analytical store supported by Terraform?
@@ -196,7 +196,7 @@ sections:
196
196
Yes, Azure Synapse Link supports configuring customer-managed keys using your Azure Cosmos DB account's managed identity. You can seamlessly encrypt the data across transactional and analytical stores using the same customer-managed keys in an automatic and transparent manner. To learn more, see [configuring customer-managed keys using Azure Cosmos DB accounts' managed identities](how-to-setup-cmk.md#using-managed-identity) article.
197
197
198
198
- question: |
199
-
Does Azure Synapse link provide network isolation with Azure Synapse managed virtual networks?
199
+
Does Azure Synapse Link provide network isolation with Azure Synapse managed virtual networks?
200
200
answer: |
201
201
Yes, You can control network access to the data in the transactional and analytical stores independently. Network isolation is done using separate managed private endpoints for each store, within managed virtual networks in Azure Synapse workspaces. To learn more, see how to [Configure private endpoints for analytical store](analytical-store-private-endpoints.md) article.
0 commit comments