oracle-devrel
diff --git a/‎cloud-infrastructure/ai-infra-gpu/ai-infrastructure/dstack/README.md‎
Lines changed: 221 additions & 0 deletions b/‎cloud-infrastructure/ai-infra-gpu/ai-infrastructure/dstack/README.md‎
Lines changed: 221 additions & 0 deletions
diff --git a/‎cloud-infrastructure/ai-infra-gpu/ai-infrastructure/dstack/assets/images/image1.png‎
220 KB b/‎cloud-infrastructure/ai-infra-gpu/ai-infrastructure/dstack/assets/images/image1.png‎
220 KB
diff --git a/‎cloud-infrastructure/ai-infra-gpu/ai-infrastructure/dstack/assets/images/image2.png‎
171 KB b/‎cloud-infrastructure/ai-infra-gpu/ai-infrastructure/dstack/assets/images/image2.png‎
171 KB
diff --git a/‎data-platform/analytical-data-platform-lakehouse/README.md‎
Lines changed: 2 additions & 0 deletions b/‎data-platform/analytical-data-platform-lakehouse/README.md‎
Lines changed: 2 additions & 0 deletions
diff --git a/‎data-platform/analytics/oracle-analytics-cloud/README.md‎
Lines changed: 4 additions & 1 deletion b/‎data-platform/analytics/oracle-analytics-cloud/README.md‎
Lines changed: 4 additions & 1 deletion
diff --git a/‎data-platform/analytics/oracle-analytics-cloud/files/Machine Learning in Oracle Analytics - Understanding parameters of Naive Bayes.pdf‎
947 KB b/‎data-platform/analytics/oracle-analytics-cloud/files/Machine Learning in Oracle Analytics - Understanding parameters of Naive Bayes.pdf‎
947 KB
diff --git a/‎data-platform/analytics/oracle-analytics-cloud/files/OAC Guidelines.pdf‎
-124 KB b/‎data-platform/analytics/oracle-analytics-cloud/files/OAC Guidelines.pdf‎
-124 KB
diff --git a/‎data-platform/core-converged-db/ai-vector-search/README.md‎
Lines changed: 42 additions & 0 deletions b/‎data-platform/core-converged-db/ai-vector-search/README.md‎
Lines changed: 42 additions & 0 deletions
diff --git a/‎data-platform/core-converged-db/ai-vector-search/files/0_setup_ts_user_directory.sql‎
Lines changed: 22 additions & 0 deletions b/‎data-platform/core-converged-db/ai-vector-search/files/0_setup_ts_user_directory.sql‎
Lines changed: 22 additions & 0 deletions
@@ -0,0 +1,221 @@
+# From Fine-Tuning to Serving LLMs with OCI and dstack
+
+dstack is an open-source tool that simplifies AI container orchestration and makes distributed training and deployment of LLMs more accessible. Combining dstack and OCI unlocks a streamlined process for setting up cloud infrastructure for distributed training and scalable model deployment.
+
+This article walks you through fine-tuning a model using dstack on OCI, incorporating best practices from the Hugging Face Alignment Handbook, and then deploying the model using Hugging Face’s Text Generation Inference (TGI).
+
+**NOTE**: The experiment described in the article used an OCI cluster of three nodes, each with 2 x A10 GPUs, to fine-tune the Gemma 7B model.
+
+## How dstack works
+
+dstack offers a unified interface for the development, training, and deployment of AI models across any cloud or data center. For example, you can specify a configuration for a training task or a model to be deployed, and dstack will take care of setting up the required infrastructure and orchestrating the containers. One of the advantages dstack offers is that it allows the use of any hardware, frameworks, and scripts.
+
+## Setting up dstack with OCI
+
+With four simple steps, we can use dstack with OCI. First, we need to install the dstack Python package. Since dstack supports multiple cloud providers, we can narrow down the scope to OCI:
+
+```
+pip install dstack[oci]
+```
+
+Next, we need to configure the OCI specific credentials inside the `~/.dstack/server/config.yml`. Below assumes that you have credentials for OCI CLI configured. For other configuration options, please follow the dstack’s official document.
+
+```
+projects:
+- name: main
+  backends:
+  - type: oci
+    creds:
+      type: default
+```
+
+The final step is to run the dstack server as below.
+
+```
+dstack server
+INFO     Applying ~/.dstack/server/config.yml...
+INFO     Configured the main project in ~/.dstack/config.yml
+INFO     The admin token is ab6e8759-9cd9-4e84-8d47-5b94ac877ebf
+INFO     The dstack server 0.18.4 is running at http://127.0.0.1:3000
+```
+
+Then, switch to the folder with your project scripts and initialize dstack.
+
+```
+dstack init
+```
+
+## Fine-Tuning on OCI with dstack
+To fine-tune Gemma 7B model, we’ll be using the Hugging Face Alignment Handbook to ensure the incorporation of the best fine-tuning practices. The source code of this tutorial can be obtained from GitHub. Let's dive into the practical steps for fine-tuning your LLM. 
+
+Once, you switch to the project folder, here's the command to initiate the fine-tuning job on OCI with dstack:
+
+```
+ACCEL_CONFIG_PATH=fsdp_qlora_full_shard.yaml \   
+  FT_MODEL_CONFIG_PATH=qlora_finetune_config.yaml \
+  HUGGING_FACE_HUB_TOKEN=xxxx \
+  WANDB_API_KEY=xxxx \
+  dstack run . -f ft.task.dstack.yml
+```
+
+The `FT_MODEL_CONFIG_PATH`, `ACCEL_CONFIG_PATH`, `HUGGING_FACE_HUB_TOKEN`, and `WANDB_API_KEY` environment variables are defined inside the `ft.task.dstack.yml` task configuration. `dstack run` submits the task defined in `ft.task.dstack.yml` on OCI. 
+
+**NOTE**: that dstack automatically copies the current directory’s content when executing the task.
+
+Let’s explore the key parts of each YAML file (for the full contents, check the repository). 
+
+The `qlora_finetune_config.yaml` file is the recipe configuration that the Alignment Handbook can understand about how you would want to fine-tune an LLM:
+
+```
+# Model arguments
+model_name_or_path: google/gemma-7b
+tokenizer_name_or_path: philschmid/gemma-tokenizer-chatml 
+torch_dtype: bfloat16
+bnb_4bit_quant_storage: bfloat16
+
+# LoRA arguments
+load_in_4bit: true
+use_peft: true
+lora_r: 8
+lora_alpha: 16
+lora_dropout: 0.05
+lora_target_modules:
+  - q_proj
+  - k_proj
+# ...
+
+
+# Data training arguments
+dataset_mixer:
+  chansung/mental_health_counseling_conversations: 1.0
+dataset_splits:
+  - train
+  - test
+# ...
+```
+
+* **Model arguments**
+
+    * `model_name_or_path`: Google’s Gemma 7B is chosen as the base model
+    * `tokenizer_name_or_path`: alignment-handbook uses apply_chat_template() method of the chosen tokenizer. This tutorial uses the ChatML template instead of the Gemma 7B’s standard conversation template.
+    * `torch_dtype` and `bnb_4bit_quant_storage`: these two values should be defined the same if we want to leverage FSDP+QLoRA fine-tuning method. Since Gemma 7B is hard to fit into a single A10 GPU, this blog post uses FSDP+QLoRA to shard a model into 2 x A10 GPUs while leveraging QLoRA technique.
+* **LoRA arguments**: LoRA specific configurations. Since this blog post leverages FSDP+QLoRA technique, `load_in_4bit` is set to `true`. Other configurations could vary from experiment to experiment.
+* **Data training arguments**: we have prepared a dataset which is based on Amod’s mental health counseling conversations’ dataset. Since alignment-handbook only understands the data in the form of `[{"role": "user", "content": "..."}, {"role": "assistant", "content": "..."}, …]` which can be interpreted with tokenizer’s `apply_chat_template()` method, the prepared dataset is basically the conversion of the original dataset into the `apply_chat_template()` compatible format.
+
+The `fsdp_qlora_full_shard.yaml` file configures accelerate how to use the underlying infrastructure for fine-tuning the LLM:
+
+```
+compute_environment: LOCAL_MACHINE
+distributed_type: FSDP  # Use Fully Sharded Data Parallelism
+fsdp_config: 
+  fsdp_auto_wrap_policy: TRANSFORMER_BASED_WRAP
+  fsdp_backward_prefetch: BACKWARD_PRE
+  fsdp_cpu_ram_efficient_loading: true
+  fsdp_use_orig_params: false 
+  fsdp_offload_params: true
+  fsdp_sharding_strategy: FULL_SHARD
+  # ... (other FSDP configurations)
+# ... (other configurations)
+```
+
+* `distributed_type`: `FSDP` indicates the use of Fully Sharded Data Parallel (FSDP), a technique that enables training large models that would otherwise not fit on a single GPU.
+* `fsdp_config`: These set up how FSDP operates, such as how the model is sharded (`fsdp_sharding_strategy`) and whether parameters are offloaded to CPU (`fsdp_offload_params`).
+
+![Hybrid shards](assets/images/image2.png "Hybrid shards")
+
+With the `FSDP` of `distributed_type` and `FULL_SHARD` of `fsdp_config`’s `fsdp_sharding_strategy`, a model will be sharded across multiple GPUs in a single machine. When dealing with multiple compute nodes, each node will host an identical copy of the model, which is itself split across multiple GPUs within that node. This means each partitioned model instance on each node processes different sections or batches of your dataset. To distribute a single model across multiple GPUs spanning across multiple nodes, configure the parameter `fsdp_sharding_strategy` as `HYBRID_SHARD`.
+
+Additional parameters like "machine_rank," "num_machines," and "num_processes" are important for coordination. However, it's recommended to set these values dynamically at runtime, as this provides flexibility when switching between different infrastructure setups.
+
+## The power of dstack: simplified configuration
+
+Finally, let's explore the `fsdp_qlora_full_shard.yaml` configuration that puts everything together and instructs dstack on how to provision infrastructure and run the task.
+
+```
+type: task
+nodes: 3
+
+python: "3.11" 
+env:
+  - ACCEL_CONFIG_PATH
+  - FT_MODEL_CONFIG_PATH
+  - HUGGING_FACE_HUB_TOKEN
+  - WANDB_API_KEY 
+commands:
+  # ... (setup steps, cloning repo, installing requirements)
+  - ACCELERATE_LOG_LEVEL=info accelerate launch \
+      --config_file recipes/custom/accel_config.yaml \
+      --main_process_ip=$DSTACK_MASTER_NODE_IP \
+      --main_process_port=8008 \
+      --machine_rank=$DSTACK_NODE_RANK \
+      --num_processes=$DSTACK_GPUS_NUM \
+      --num_machines=$DSTACK_NODES_NUM \
+      scripts/run_sft.py recipes/custom/config.yaml
+ports:
+  - 6006 
+resources:
+  gpu: 1..2
+  shm_size: 24GB
+```
+
+**Key points to highlight**:
+* **Seamless Integration**: dstack effortlessly integrates with Hugging Face's open source ecosystem. In Particular, you can simply use the accelerate library with the configurations that we defined in `fsdp_qlora_full_shard.yaml` as normal.
+* **Automatic Configuration**: `DSTACK_MASTER_NODE_IP`, `DSTACK_NODE_RANK`, `DSTACK_GPUS_NUM`, and `DSTACK_NODES_NUM` variables are automatically managed by dstack, reducing manual setup.
+* **Resource Allocation**: dstack makes it easy to specify the number of nodes and GPUs (gpu: 1..2) for your fine-tuning job. Hence, for this blog post, there are three nodes each of which is equipped with 2 x A10(24GB) GPUs.
+
+## Serving your fine-tuned model with dstack
+
+Once your model is fine-tuned, dstack makes it a breeze to deploy it on OCI using Hugging Face's Text Generation Inference (TGI) framework. 
+
+Here's an example of how you can define a service in dstack:
+
+```
+type: service
+image: ghcr.io/huggingface/text-generation-inference:latest
+env: 
+  - HUGGING_FACE_HUB_TOKEN
+  - MODEL_ID=chansung/mental_health_counseling_merged_v0.1 
+commands: 
+  - text-generation-launcher \
+    --max-input-tokens 512 --max-total-tokens 1024 \      
+    --max-batch-prefill-tokens 512 --port 8000
+port: 8000
+
+resources:
+  gpu:
+    memory: 48GB
+
+# (Optional) Enable the OpenAI-compatible endpoint
+model: 
+  format: tgi
+  type: chat
+  name: chansung/mental_health_counseling_merged_v0.1 
+```
+
+**Key advantages of this approach**:
+* **Secure HTTPS Gateway**: Dstack simplifies the process of setting up a secure HTTPS connection through a gateway, a crucial aspect of production-level model serving.
+* **Optimized for Inference**: The TGI framework is designed for efficient text generation inference, ensuring your model delivers responsive and reliable results.
+* **Auto-scaling**: dstack allows to specify the auto-scaling policy, including the minimum and maximum number of model replicas.
+
+At this point, you can interact with the service via standard curl command and Python’s requests, OpenAI SDK, and Hugging Face’s InferenceClient libraries. For instance, the code snippet below shows an example of curl.
+
+```
+curl -X POST https://black-octopus-1.mycustomdomain.com/generate \
+  -H "Authorization: Bearer <dstack-token>" \
+  -H 'Content-Type: application/json' \
+  -d '{"inputs": "I feel bad...", "parameters": {"max_new_tokens": 128}}' 
+```
+
+Additionally, for a deployed model, dstack automatically provides a user interface to directly interact with the model:
+
+<p align="center">
+    <img src="https://github.com/oracle-devrel/technology-engineering/blob/dstack-tutorial/cloud-infrastructure/ai-infra-gpu/ai-infrastructure/dstack/assets/images/image1.png" width="600">
+</p>
+
+## Conclusion
+
+By following the steps outlined in this article, you've unlocked a powerful approach to fine-tuning and deploying LLMs using the combined capabilities of dstack, OCI, and Hugging Face's ecosystem. You can now leverage dstack's user-friendly interface to manage your OCI resources effectively, streamlining the process of setting up distributed training environments for your LLM projects. 
+
+Furthermore, the integration with Hugging Face's Alignment Handbook and TGI framework empowers you to fine-tune and serve your models seamlessly, ensuring they're optimized for performance and ready for real-world applications. We encourage you to explore the possibilities further and experiment with different models and configurations to achieve your desired outcomes in the world of natural language processing.
+
+**About the author**: Chansung Park is a HuggingFace fellow and is an AI researcher working on LLMs.
@@ -55,6 +55,8 @@ Reviewed: 18.01.2024
   - Blog post describing and comparing the different insert methods in Autonomous Database to support low latency data ingestion for IoT workloads.
 - [Managing Active Metadata with Oracle Data Platform](https://gianlucarossi06.github.io/data-organon/2024/05/31/Active-Metadata-4-OCI-Data-Platform.html)
   - Blog post describing how to define and store active metadata using OCI Data Platform, using a practical example. Active Metadata can be anything stored as custom properties in a data catalog allowing users to understand, for instance, data freshness.
+- [Streaming IoT Data into Object Storage with Streaming service](https://jakubillner.github.io/2024/06/28/streaming-ingest.html)
+  - Blog post describing how to ingest and store IoT data for an analytical workload using OCI Streaming, Connector Hub, and Object Storage.
 
 
 ## YouTube
 
@@ -13,6 +13,7 @@ Reviewed: 31.10.2023
 ## Specialists Blogs for various features & functionality
 |Content Link |Functionality|Descripton|
 | ------------ |------------|----------|
+|[Unleash the Power of Template Viewer: Streamlined Testing for Flawless Oracle Analytics Publisher Reports](https://www.linkedin.com/pulse/unleash-power-template-viewer-streamlined-testing-flawless-kasetty-bxiqc/)|Oracle Analytics Publisher Template Viewer|How to leverage Template Viewer to test Oracle Analytics Publisher templates.
 |[Leverage the OCI Modern Data Platform to implement an Enterprise Analytics Solution](https://blogs.oracle.com/coretec/post/leverage-oci-modern-data-platform-to-implement-enterprise-analytics-solution)|OAC-Enterprise Analytics Solution |How to leverage the OCI modern data platform to implement an enterprise analytics solution.
 |[Top 5 reasons Oracle Analytics Cloud stands apart in the ML/AI Analytics landscape](https://blogs.oracle.com/analytics/post/top-5-reasons-oracle-analytics-cloud-stands-apart-in-the-mlai-analytics-landscape)|OAC Machine Learning|What are primary reasons to choose Oracle Analytics Cloud (OAC) from an ML/AI perspective.
 |[Oracle Analytics Cloud: Set up and configure Oracle Analytics Cloud environments using Terraform](https://blogs.oracle.com/analytics/post/oracle-analytics-cloud-set-up-and-configure-oracle-analytics-cloud-environments-using-terraform)|OAC Setup & Configure|How to provision and configure Oracle analytics cloud on OCI using Terraform.
@@ -61,6 +62,8 @@ Reviewed: 31.10.2023
 ## OAC Latest Release and Announcements
  |Content Link |Descripton|
 | ------------ |------------|
+|[Oracle Analytics Cloud new features - Jul 2024](https://www.youtube.com/watch?v=0BVxTCvDmaQ&list=PL6gBNP-Fr8KXAOF9RgJIU5ykJvD8fHoxj)|Oracle Analytics Cloud Jul-2024 new features videos|
+|[Oracle Analytics Cloud new features - May 2024](https://www.youtube.com/watch?v=eoNmcRZ5wYI&list=PL6gBNP-Fr8KU55dSbzkEKySjSDWlL3BWm)|Oracle Analytics Cloud May-2024 new features videos|
 |[Oracle Analytics Cloud new features - March 2024](https://www.youtube.com/playlist?list=PL6gBNP-Fr8KWlnpaELiCxQJii-F4c7Ehz)|Oracle Analytics Cloud March-2024 new features videos|
 |[Oracle Analytics Cloud new features - January 2024](https://www.youtube.com/playlist?list=PL6gBNP-Fr8KUGvVDRGC8IyXo8yQzUtMiD)|Oracle Analytics Cloud January-2024 new features videos|
 |[Oracle Analytics New Capabilities - November 2023](https://www.youtube.com/playlist?list=PL6gBNP-Fr8KXVh3PVwWfl1nC_TyHi_yl8)|Oracle Analytics Cloud November-2023 release|
@@ -113,7 +116,7 @@ Reviewed: 31.10.2023
 |Content Link |Descripton|
 | ------------ |------------|
 |[OAC vs PowerBI vs Tableau](https://www.oracle.com/business-analytics/comparison-chart.html)|Comparison of Oracle Analytics Cloud with other leading business analytics products|
-|[Gartner Analytics Review 2023](https://www.youtube.com/watch?v=nYNbpGeu_nw)|Oracle Analytics as visionary in the 2023 Gartner’s Magic Quadrant|
+|[Gartner Analytics Review 2024](https://www.oracle.com/news/announcement/oracle-named-leader-in-2024-gartner-magic-quadrant-for-analytics-and-business-intelligence-platforms-2024-06-24/)|Oracle Named a Leader in the 2024 Gartner® Magic Quadrant™ for Analytics and Business Intelligence Platforms|
 
 
 ## Blogs for AI/ML with Oracle Analytics Platform
 
@@ -0,0 +1,42 @@
+# AI Vector Search
+
+Oracle AI Vector Search is designed for Artificial Intelligence (AI) workloads and allows you to query data based on semantics, rather than keywords. The VECTOR data type is introduced with the release of Oracle Database 23ai, providing the foundation to store vector embeddings alongside business data in the database. Using embedding models, you can transform unstructured data into vector embeddings that can then be used for semantic queries on business data.
+
+Reviewed Date: 17.07.2024
+
+# Useful Links
+
+## Documentation  
+ 
+- [Oracle.com](https://www.oracle.com/database/ai-vector-search/)
+- [Oracle AI Vector Search User's Guide](https://docs.oracle.com/en/database/oracle/oracle-database/23/vecse/overview-ai-vector-search.html)
+- [PL/SQL Packages and Types Reference: DBMS_VECTOR](https://docs.oracle.com/en/database/oracle/oracle-database/23/arpls/dbms_vector1.html#GUID-F9FCB225-821A-4CCA-92B5-58B9927234FA)
+- [PL/SQL Packages and Types Reference: DBMS_VECTOR_CHAIN](https://docs.oracle.com/en/database/oracle/oracle-database/23/arpls/dbms_vector_chain1.html#GUID-D80DDBEF-F1A9-4267-9D3C-A54D237D95C1)
+- [Oracle AI Vector Search FAQ](https://www.oracle.com/database/ai-vector-search/faq/)
+
+## Blogs & Videos
+
+- [Oracle Announces General Availability of AI Vector Search in Oracle Database 23ai](https://blogs.oracle.com/database/post/oracle-announces-general-availability-of-ai-vector-search-in-oracle-database-23ai)
+- [OML4Py: Leveraging ONNX and Hugging Face for AI Vector Search](https://blogs.oracle.com/machinelearning/post/oml4py-leveraging-onnx-and-hugging-face-for-advanced-ai-vector-search)
+- [Use AI Vector Search to Build GenAI Apps with Enterprise Data| Oracle DatabaseWorld AI Edition](https://www.youtube.com/watch?v=5o5Ds8KLqVw&list=PLcFwxJMrxygALJRhZCbnjtDBYWCpWXPGz&index=3)
+
+
+## LiveLabs Workshops
+
+- [Oracle AI Vector Search - 15 Minute Basics](https://apexapps.oracle.com/pls/apex/r/dbpm/livelabs/view-workshop?wid=3975&clear=RR,180&session=3449305441143)
+- [Oracle AI Vector Search - Basics](https://apexapps.oracle.com/pls/apex/r/dbpm/livelabs/view-workshop?wid=1070&clear=RR,180)
+- [AI Vector Search - Complete RAG Application using PL/SQL in Oracle Database 23ai](https://apexapps.oracle.com/pls/apex/r/dbpm/livelabs/view-workshop?wid=3934&clear=RR,180&session=11020955624236)
+- [AI Vector Search - 7 Easy Steps to Building a RAG Application using LangChain](https://apexapps.oracle.com/pls/apex/r/dbpm/livelabs/view-workshop?wid=3927&clear=RR,180&session=11020955624236)
+  
+
+# Team Publications
+
+- [Getting started with vectors in 23ai](https://blogs.oracle.com/coretec/post/getting-started-with-vectors-in-23ai)
+
+# License
+
+Copyright (c) 2024 Oracle and/or its affiliates.
+
+Licensed under the Universal Permissive License (UPL), Version 1.0.
+
+See [LICENSE](https://github.com/oracle-devrel/technology-engineering/blob/main/LICENSE) for more details.
@@ -0,0 +1,22 @@
+REM Creation of tablespace, user and directory
+
+-- connect as user sys to FREEPDB1
+
+-- create tablespace
+
+create bigfile tablespace TBS_VECTOR datafile size 256M autoextend on maxsize 2G;
+
+-- create user with the new role DB_DEVELOPER_ROLE
+DROP USER vector_user cascade;
+
+create user vector_user identified by "Oracle_4U"
+default tablespace TBS_VECTOR temporary tablespace TEMP
+quota unlimited on TBS_VECTOR;
+
+grant create mining model to vector_user;
+grant DB_DEVELOPER_ROLE to vector_user;
+
+-- create directory
+
+CREATE OR REPLACE DIRECTORY dm_dump as '&directorypath';
+GRANT all ON DIRECTORY dm_dump TO vector_user;