docs updates (#216)

sr-remsha · web-flow · commit a99fed75b301 · 2024-12-20T13:39:05.000+01:00
diff --git a/docs/Roles and Access Control/1.overview.md b/docs/Roles and Access Control/1.overview.md
@@ -41,7 +41,9 @@ In the previous example, we gave access to `chat-gpt-35-turbo model` for users w
         "limits": {
             "chat-gpt-35-turbo": { //model name
                 "minute": "100000", //number of tokens per minute
-                "day": "10000000" //number of tokens per day
+                "day": "10000000", //number of tokens per day
+                "week": "10000000", //number of tokens per week
+                "month": "10000000" //number of tokens per month
             }
         }
     }
diff --git a/docs/Roles and Access Control/2.chat-users.md b/docs/Roles and Access Control/2.chat-users.md
@@ -31,7 +31,9 @@ In the system configuration, you can then add rules and restrictions to manage a
                 "limits": {
                     "chat-gpt-35-turbo": {
                         "minute": "200000",
-                        "day": "10000000"
+                        "day": "10000000",
+                        "week": "10000000",
+                        "month": "10000000",
                     }
                 }
             }
diff --git a/docs/Roles and Access Control/3.API Keys.md b/docs/Roles and Access Control/3.API Keys.md
@@ -27,7 +27,9 @@ To create and configure access control for API keys:
             "limits": {
                 "chat-gpt-35-turbo": {
                 "minute": "100000", //number of tokens per minute
-                "day": "10000000" //number of tokens per day
+                "day": "10000000", //number of tokens per day
+                "week": "10000000", //number of tokens per week
+                "month": "10000000", //number of tokens per month
                 }
             }
         }
diff --git a/docs/privacy.md b/docs/privacy.md
@@ -0,0 +1,42 @@
+# PII Compliance & Privacy
+
+## Introduction
+
+There are two scenarios in which personal information can be exposed when working with DIAL. 
+
+* The first scenario takes place when an application uploads sensitive data, such as conversations or files, to DIAL via API. In this case, DIAL can capture and save conversation data in application audit logs.
+* The second scenario occurs when users interact with DIAL Chat, specifically when engaging with language models or applications. In this case, DIAL can capture and save conversation data in both audit logs and the user’s BLOB storage.
+
+To ensure compliance with necessary Personally Identifiable Information (PII) regulations, our system offers flexibility that allows you to customize your data management strategy and ensure that your data handling practices adhere to required standards. 
+
+DIAL allows you to choose which logs to store (if at all), determine which data to retain, and assists in implementing necessary policies in your file storage to effectively manage sensitive resources. 
+
+## Applications Audit Logs
+
+When a user interacts with DIAL applications programmatically using API keys, DIAL captures and records all conversation data in a designated audit log. 
+
+To prevent DIAL from storing such logs for a particular API key, you can enable a special flag in the API key specification in the [DIAL Core dynamic settings](https://github.com/epam/ai-dial-core/?tab=readme-ov-file#dynamic-settings): 
+
+```json
+//Example of DIAL Core dynamic settings configuration
+"keys": 
+{
+        "yourApiKey": {
+            "secured": true
+        }        
+}
+```
+
+**Important!**: It's important to be aware that custom applications can upload resources, like conversations or files, to DIAL. If this is the case, the logic within the custom application that carries out such actions is responsible for handling these resources. To manage them, you can additionally implement Time To Live (TTL) or other policies in your file storage. For more details on this topic, please refer to the [File Storage Policies](#file-storage-policies) section.
+
+## BLOB Storage
+
+When a user interacts with DIAL Chat using a JSON Web Token (JWT), DIAL captures and records all conversation data in a designated audit log and **also** stores it as JSON files in the user's BLOB storage.
+
+**Important!**: It's important to be aware that even if a user deletes all conversation data from the BLOB storage the data will still be retained in the audit logs. Because DIAL Chat uses JWT for user authentication, rather than API keys, the information will inevitably be saved in BLOB storage.
+
+## File Storage Policies
+
+To manage your resource uploaded to the BLOB storage, you can configure policies, which operate independently from DIAL. BLOB storage policies can utilize functionalities such as cloud lambdas to establish TTLs for specific file types. 
+
+To facilitate the enforcement of such policies, DIAL can add metadata to files stored in the BLOB storage. This metadata assists in the application of storage policies, ensuring effective management of your resources.
diff --git a/docs/supported-models.md b/docs/supported-models.md
@@ -7,7 +7,7 @@ You can use [DIAL SDK](https://github.com/epam/ai-dial-sdk) to create custom mod
 | Vendor | Models |
 | :-- | :-- |
 | AI21| ai21.j2-grande-instruct, ai21.j2-jumbo-instruct |
-| Amazon| amazon.titan-tg1-large, amazon.titan-embed-text-v1, amazon.titan-embed-text-v2:0, amazon.titan-embed-image-v1 |
+| Amazon| amazon.titan-tg1-large, amazon.titan-embed-text-v1, amazon.titan-embed-text-v2:0, amazon.titan-embed-image-v1, amazon.nova-pro-v1, amazon.nova-lite-v1, amazon.nova-micro-v1|
 | Anthropic| anthropic.claude-instant-v1, anthropic.claude-v2, anthropic.claude-v2-1, anthropic.claude-v3-opus, anthropic.claude-v3-haiku, anthropic.claude-3-5-haiku-20241022-v1, anthropic.claude-v3-sonnet, anthropic.claude-v3-5-sonnet, anthropic.claude-3-5-sonnet-20241022-v2, anthropic.claude |
 | Cohere| cohere.command-text-v14 |
 | Databricks| databricks-bge-large-en, databricks-llama-2-70b-chat, databricks-mixtral-8x7b-instruct, databricks-dbrx-instruct |
diff --git a/docs/tutorials/rate-limits-users.md b/docs/tutorials/rate-limits-users.md
@@ -33,7 +33,7 @@ For example purposes, lets configure rate limits for AI DIAL Chat users with use
           }
       }
       ```
-5. In AI DIAL Core configuration file, in the `roles` section, add your role ("azure-group-name") and limits for it. Refer to [configuration example](https://github.com/epam/ai-dial-core/blob/9d7e3ba8380ffea3b9b6a7ccd65a96f024e842e3/sample/aidial.config.json#L191) for example purposes. For `roles.<role_name>.limits` you can configure `minute` (total tokens per minute limit sent to the model, managed via floating window approach for well-distributed rate limiting. If it's not set the default value is unlimited) or `day` (total tokens per day limit sent to the model, managed via floating window approach for balanced rate limiting. If it's not set the default value is unlimited).
+5. In AI DIAL Core configuration file, in the `roles` section, add your role ("azure-group-name") and limits for it. Refer to [configuration example](https://github.com/epam/ai-dial-core/blob/9d7e3ba8380ffea3b9b6a7ccd65a96f024e842e3/sample/aidial.config.json#L191) for example purposes. For `roles.<role_name>.limits` you can configure `minute` (total tokens per minute limit sent to the model, managed via floating window approach for well-distributed rate limiting. If it's not set the default value is unlimited), `day` (total tokens per day limit sent to the model, managed via floating window approach for balanced rate limiting. If it's not set the default value is unlimited), `week` and `month` accordingly.
 
       > The `default` role applies in case other roles are not configured.
       > In case the same user has different roles with different limits, the role with the higher limit is an effective role.
diff --git a/docs/video demos/demos/dial-chathub.md b/docs/video demos/demos/dial-chathub.md
@@ -3,3 +3,14 @@
 https://youtu.be/IG0HawQuU-w
 
 DIAL ChatHub is an example of a flow orchestrator that combines several applications and models into one unified access point. ChatHub can automatically route prompts to one of several agents: text-to-text applications, text-to-image applications, vision-to-text applications, DIAL RAG, or DIAL Web RAG. It is just an example of how you can provide a unified access point to several different applications or models within your ecosystem.
+
+https://youtu.be/8Npbd0rESPI
+
+In this video, we explore the innovative prompt routing engine, ChatHub 2.0, that revolutionizes how users interact with AI tools. Learn how this intelligent system:
+
+* Provides a single entry point for all your GenAI chat needs 
+* Uses a GPT-4o based orchestrator to route prompts to the best tool 
+* Seamlessly integrates existing DIAL applications like RAG 
+* Leverages advanced models such as Gemini 1.5 Pro with Search 
+* Parallelizes searches and chains agents for efficient responses 
+* Combines text, data, and image generation capabilities
diff --git a/sidebars.js b/sidebars.js
@@ -7,6 +7,7 @@ const sidebars = {
       label: 'Home', // sidebar label
     },
     'quick-start',
+    'privacy',
     'architecture',
     'supported-models',
     {

Original file line number	Diff line number	Diff line change
@@ -41,7 +41,9 @@ In the previous example, we gave access to `chat-gpt-35-turbo model` for users w
`41`	`41`	`"limits": {`
`42`	`42`	`"chat-gpt-35-turbo": { //model name`
`43`	`43`	`"minute": "100000", //number of tokens per minute`
`44`		`- "day": "10000000" //number of tokens per day`
	`44`	`+ "day": "10000000", //number of tokens per day`
	`45`	`+ "week": "10000000", //number of tokens per week`
	`46`	`+ "month": "10000000" //number of tokens per month`
`45`	`47`	`}`
`46`	`48`	`}`
`47`	`49`	`}`
Original file line number	Diff line number	Diff line change
`@@ -31,7 +31,9 @@ In the system configuration, you can then add rules and restrictions to manage a`
`31`	`31`	`"limits": {`
`32`	`32`	`"chat-gpt-35-turbo": {`
`33`	`33`	`"minute": "200000",`
`34`		`- "day": "10000000"`
	`34`	`+ "day": "10000000",`
	`35`	`+ "week": "10000000",`
	`36`	`+ "month": "10000000",`
`35`	`37`	`}`
`36`	`38`	`}`
`37`	`39`	`}`
Original file line number	Diff line number	Diff line change
`@@ -27,7 +27,9 @@ To create and configure access control for API keys:`
`27`	`27`	`"limits": {`
`28`	`28`	`"chat-gpt-35-turbo": {`
`29`	`29`	`"minute": "100000", //number of tokens per minute`
`30`		`- "day": "10000000" //number of tokens per day`
	`30`	`+ "day": "10000000", //number of tokens per day`
	`31`	`+ "week": "10000000", //number of tokens per week`
	`32`	`+ "month": "10000000", //number of tokens per month`
`31`	`33`	`}`
`32`	`34`	`}`
`33`	`35`	`}`
Original file line number	Diff line number	Diff line change
`@@ -33,7 +33,7 @@ For example purposes, lets configure rate limits for AI DIAL Chat users with use`
`33`	`33`	`}`
`34`	`34`	`}`
`35`	`35`	```
`36`		-5. In AI DIAL Core configuration file, in the `roles` section, add your role ("azure-group-name") and limits for it. Refer to [configuration example](https://github.com/epam/ai-dial-core/blob/9d7e3ba8380ffea3b9b6a7ccd65a96f024e842e3/sample/aidial.config.json#L191) for example purposes. For `roles.<role_name>.limits` you can configure `minute` (total tokens per minute limit sent to the model, managed via floating window approach for well-distributed rate limiting. If it's not set the default value is unlimited) or `day` (total tokens per day limit sent to the model, managed via floating window approach for balanced rate limiting. If it's not set the default value is unlimited).
	`36`	+5. In AI DIAL Core configuration file, in the `roles` section, add your role ("azure-group-name") and limits for it. Refer to [configuration example](https://github.com/epam/ai-dial-core/blob/9d7e3ba8380ffea3b9b6a7ccd65a96f024e842e3/sample/aidial.config.json#L191) for example purposes. For `roles.<role_name>.limits` you can configure `minute` (total tokens per minute limit sent to the model, managed via floating window approach for well-distributed rate limiting. If it's not set the default value is unlimited), `day` (total tokens per day limit sent to the model, managed via floating window approach for balanced rate limiting. If it's not set the default value is unlimited), `week` and `month` accordingly.
`37`	`37`
`38`	`38`	> The `default` role applies in case other roles are not configured.
`39`	`39`	`> In case the same user has different roles with different limits, the role with the higher limit is an effective role.`
Original file line number	Diff line number	Diff line change
`@@ -7,6 +7,7 @@ const sidebars = {`
`7`	`7`	`label: 'Home', // sidebar label`
`8`	`8`	`},`
`9`	`9`	`'quick-start',`
	`10`	`+ 'privacy',`
`10`	`11`	`'architecture',`
`11`	`12`	`'supported-models',`
`12`	`13`	`{`