Skip to content
This repository was archived by the owner on May 27, 2025. It is now read-only.

Commit 6531515

Browse files
committed
update docs with correct links and remove unnecessary zip file
1 parent b6c6d0e commit 6531515

File tree

3 files changed

+24
-16
lines changed

3 files changed

+24
-16
lines changed

docs/DEPLOYMENT-GUIDE.md

Lines changed: 23 additions & 15 deletions
Original file line numberDiff line numberDiff line change
@@ -10,27 +10,33 @@ The deployment process requires the following tools to be installed:
1010
* cut - a standard linux utility
1111
* sed - a standard linux utility
1212
* [curl](https://curl.se) - command line data transfer
13+
* [docker desktop](https://docs.docker.com/get-started/get-docker)
1314
* [helm](https://helm.sh/docs/intro/install) - k8s package manager
1415
* [jq](https://jqlang.github.io/jq/download) >= v1.6
1516
* [kubectl](https://kubernetes.io/docs/tasks/tools) - k8s command line tool
1617
* [kubelogin](https://github.com/Azure/kubelogin) - client-go credential (exec) plugin implementing azure authentication
1718
* [yq](https://github.com/mikefarah/yq?tab=readme-ov-file#install) >= v4.40.7 - yaml file parser
1819

19-
TIP: If you open this repository inside a devcontainer (i.e. VSCode Dev Containers or Codespaces), all required tools for deployment will already be available. Opening a devcontainer using VS Code requires <a href="https://docs.docker.com/engine/install/" target="_blank" >Docker to be installed</a>.
20+
TIP: If you open this repository as a devcontainer (i.e. VSCode Dev Containers or Codespaces), all required tools for deployment will already be available. Opening a devcontainer using VS Code requires <a href="https://docs.docker.com/engine/install/" target="_blank" >Docker to be installed</a>.
2021

2122
The setup/deployment process has been mostly automated with a shell script and Bicep files (infrastructure as code). Azure CLI will deploy all necessary Azure resources using these Bicep files. The deployment is configurable using values defined in `infra/deploy.parameters.json`. To the utmost extent, we have provided default values but users are still expected to modify some values.
2223

2324

2425
#### RBAC Permissions
25-
You will need the following <a href="https://learn.microsoft.com/en-us/azure/role-based-access-control/overview">Azure Role Based Access </a>permissions to deploy the GraphRAG solution accelerator. By default, Azure resources will be deployed with <a href="https://learn.microsoft.com/en-us/entra/identity/managed-identities-azure-resources/overview">Azure Managed Identities </a>in place, keeping with security best practices. Due to this enhanced security configuration, higher level permissions are required in order to deploy the necessary Azure resources:
26+
You will need the following <a href="https://learn.microsoft.com/en-us/azure/role-based-access-control/overview">Azure Role Based Access </a>permissions to deploy the GraphRAG solution accelerator. By default, Azure resources will be deployed with <a href="https://learn.microsoft.com/en-us/entra/identity/managed-identities-azure-resources/overview">Azure Managed Identities </a>, following security best practices. Due to this enhanced security configuration, higher level permissions are required in order to deploy the necessary Azure resources:
2627
| Permission | Scope |
2728
| :--- | ---: |
2829
Contributor | Subscription
2930
Role Based Access Control (RBAC) Administrator | Subscription
3031
Owner | Resource Group
3132

3233
#### Resource Providers
33-
The Azure subscription that you deploy this solution accelerator in will require the `Microsoft.OperationsManagement`, ` Microsoft.Compute` and `Microsoft.AlertsManagement` resource providers to be registered.
34+
The Azure subscription that you deploy this solution accelerator in requires several resource providers to be registered (if they aren't already). They include:
35+
36+
* `Microsoft.OperationsManagement`
37+
* ` Microsoft.Compute`
38+
* `Microsoft.AlertsManagement`
39+
3440
This can be accomplished via the [Azure Portal](https://learn.microsoft.com/en-us/azure/azure-resource-manager/management/resource-providers-and-types#azure-ortal) or with the following [Azure CLI](https://learn.microsoft.com/en-us/azure/azure-resource-manager/management/resource-providers-and-types#azure-cli) commands:
3541

3642
```shell
@@ -46,14 +52,14 @@ az provider show --namespace Microsoft.Compute -o table
4652

4753
## Installation
4854

49-
### 1. Azure OpenAI Quota
55+
### 1. Azure OpenAI
5056
As a prerequisite to deployment, you will either need access to an already deployed Azure OpenAI (AOAI) resource or have available quota. If an existing AOAI resource is not used, the deployment code in this accelerator will deploy an AOAI resource with some default model choices.
5157
Documentation on how to deploy an AOAI service can be found [here](https://learn.microsoft.com/en-us/azure/ai-services/openai/how-to/create-resource?pivots=web-portal).
5258

5359
Take note of the model deployment name and model name.
5460
Note that the AOAI instance **must** be in the same subscription that you plan to deploy this solution accelerator in.
5561

56-
As a starting point, we recommend the following quota thresholds be setup for this solution accelerator to run.
62+
As a starting point, we recommend the following quota thresholds be used for this solution accelerator to run.
5763
| Model Name | TPM Threshold |
5864
| :--- | ---: |
5965
gpt-4 turbo | 80K
@@ -88,22 +94,22 @@ In the `deploy.parameters.json` file, provide values for the following required
8894
`RESOURCE_GROUP` | <my_resource_group> | Yes | The resource group that GraphRAG will be deployed in. Will get created automatically if the resource group does not exist.
8995
`GRAPHRAG_API_BASE` | https://<my_openai_name>.openai.azure.com | No | An existing Azure OpenAI service endpoint.
9096
`GRAPHRAG_API_VERSION` | 2023-03-15-preview | No | OpenAI API version.
91-
`GRAPHRAG_LLM_MODEL` | gpt-4 | No | Name of the gpt-4 turbo model.
92-
`GRAPHRAG_LLM_MODEL_VERSION` | turbo-2024-04-09 | No | Model version of the gpt-4 turbo model. Only required if deploying a new AOAI instance (i.e. `GRAPHRAG_API_BASE` is left undefined).
93-
`GRAPHRAG_LLM_DEPLOYMENT_NAME` | gpt-4 | No | Deployment name of the gpt-4 turbo model.
97+
`GRAPHRAG_LLM_MODEL` | gpt-4 | No | Name of the Azure OpenAI LLM model to use (or deploy).
98+
`GRAPHRAG_LLM_MODEL_VERSION` | turbo-2024-04-09 | No | Model version of the LLM model to use (or deploy). Only required if deploying a new AOAI instance (i.e. `GRAPHRAG_API_BASE` is left undefined).
99+
`GRAPHRAG_LLM_DEPLOYMENT_NAME` | gpt-4 | No | Deployment name of the LLM model to use (or deploy).
94100
`GRAPHRAG_LLM_MODEL_QUOTA` | 80 | No | TPM quota of the LLM model in units of 1000 (i.e. 10 = 10,000 TPM). Only required if deploying a new AOAI instance (i.e. `GRAPHRAG_API_BASE` is left undefined).
95101
`GRAPHRAG_EMBEDDING_MODEL` | text-embedding-ada-002 | No | Name of the Azure OpenAI embedding model.
96-
`GRAPHRAG_EMBEDDING_MODEL_VERSION` | 2 | No | Model version of the Azure OpenAI embedding model. Only required if deploying a new AOAI instance (i.e. `GRAPHRAG_API_BASE` is left undefined).
97-
`GRAPHRAG_EMBEDDING_DEPLOYMENT_NAME` | text-embedding-ada-002 | No | Deployment name of the Azure OpenAI embedding model.
102+
`GRAPHRAG_EMBEDDING_MODEL_VERSION` | 2 | No | Model version of the embedding model to use (or deploy). Only required if deploying a new AOAI instance (i.e. `GRAPHRAG_API_BASE` is left undefined).
103+
`GRAPHRAG_EMBEDDING_DEPLOYMENT_NAME` | text-embedding-ada-002 | No | Deployment name of the embedding model to use (or deploy).
98104
`GRAPHRAG_EMBEDDING_MODEL_QUOTA` | 300 | No | TPM quota of the embedding model in units of 1000 (i.e. 10 = 10,000 TPM). Only required if deploying a new AOAI instance (i.e. `GRAPHRAG_API_BASE` is left undefined).
99105
`GRAPHRAG_IMAGE` | graphrag:backend | No | The name and tag of the graphrag docker image in the container registry. Will default to `graphrag:backend` and be hosted at `my_container_registry_name>.azurecr.io/graphrag:backend`.
100106
`CONTAINER_REGISTRY_LOGIN_SERVER` | <container_registry_name>.azurecr.io | No | Endpoint of an existing Azure Container Registry where the `GRAPHRAG_IMAGE` docker image is hosted. If not provided, a unique name will be generated (recommended).
101-
`COGNITIVE_SERVICES_AUDIENCE` | `https://cognitiveservices.azure.com/.default` | No | Endpoint for cognitive services identity authorization. Should be defined for deployments in other Azure clouds.
102-
`APIM_NAME` | <auto_generated_unique_name> | No | Hostname of the API. Must be a globally unique name. The API will be accessible at `https://<APIM_NAME>.azure-api.net`.
103-
`APIM_TIER` | Developer | No | The [APIM tier](https://azure.microsoft.com/en-us/pricing/details/api-management) to use. Can be either `Developer` or `StandardV2`.
107+
`COGNITIVE_SERVICES_AUDIENCE` | `https://cognitiveservices.azure.com/.default` | No | Endpoint for cognitive services identity authorization. Should be explicitly set for deployments in other Azure clouds.
108+
`APIM_NAME` | <auto_generated_unique_name> | No | Hostname of the graphrag API. Must be a globally unique name. The API will be available at `https://<APIM_NAME>.azure-api.net`.
109+
`APIM_TIER` | Developer | No | The [APIM tier](https://azure.microsoft.com/en-us/pricing/details/api-management) to use. Can be either `Developer` or `StandardV2`. `StandardV2` costs more but will deploy faster.
104110
`RESOURCE_BASE_NAME` | | No | Suffix to apply to all azure resource names. If not provided a unique suffix will be generated.
105111
`AISEARCH_ENDPOINT_SUFFIX` | `search.windows.net` | No | Suffix to apply to AI search endpoint. Should be overridden for deployments in other Azure clouds.
106-
`AISEARCH_AUDIENCE` | `https://search.azure.com/` | No | Audience for AAD for AI Search. Should be overridden for deployments in other Azure clouds.
112+
`AISEARCH_AUDIENCE` | `https://search.azure.com/` | No | AAD audience for AI Search. Should be overridden for deployments in other Azure clouds.
107113

108114
### 5. Deploy solution accelerator to the resource group
109115
```shell
@@ -113,5 +119,7 @@ bash deploy.sh -p deploy.parameters.json
113119
```
114120
When deploying for the first time, it may take ~40-50 minutes to deploy all resources. In cases where a deployment error may occur (e.g. not enough quota), subsequent runs of this command will be faster if you rerun the deployment using the same resource group.
115121

122+
TIP: The choice of `APIM_TIER` is a major contributing factor to the overall deployment time.
123+
116124
### 6. Use GraphRAG
117-
Once the deployment has finished, check out our [`Quickstart`](../notebooks/1-Quickstart.ipynb) notebook for a demonstration of how to use the GraphRAG API. To access the API documentation, visit `<APIM_gateway_url>/manpage/docs` in your browser. You can find the `APIM_gateway_url` by looking in the Azure Portal for the deployed APIM instance.
125+
Once the deployment has finished, check out our [`Quickstart`](../notebooks/) notebook for a demonstration of how to use the GraphRAG API. To access the API documentation, visit `<APIM_gateway_url>/manpage/docs` in your browser. You can find the `APIM_gateway_url` by looking in the settings of the deployed APIM instance.

infra/managed-app/README.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -93,4 +93,4 @@ There are two deployment options to consider when deploying a managed app. As an
9393
* 1-click Deployment Button
9494
If `mainTemplate.json` is hosted somewhere publicly (i.e. on Github), a deployment button can be created that deploys the app when clicked, like the the example below.
9595
96-
[![Deploy to Azure](https://aka.ms/deploytoazurebutton)](https://portal.azure.com/#create/Microsoft.Template/uri/https%3A%2F%2Fraw.githubusercontent.com%2FAzure-Samples%2Fgraphrag-accelerator%2Frefs%2Fheads%2Fharjit-managed-app%2Finfra%2FmainTemplate.json)
96+
[![Deploy to Azure](https://aka.ms/deploytoazurebutton)](https://portal.azure.com/#create/Microsoft.Template/uri/https%3A%2F%2Fraw.githubusercontent.com%2FAzure-Samples%2Fgraphrag-accelerator%2Frefs%2Fheads%2Fmain%2Finfra%2Fmanaged-app%2FmainTemplate.json)
-29.4 KB
Binary file not shown.

0 commit comments

Comments
 (0)