Skip to content

Commit 24f3c31

Browse files
authored
Merge branch 'main' into hyperion-update-february
2 parents 42839da + fae3b2c commit 24f3c31

File tree

125 files changed

+3542
-165
lines changed

Some content is hidden

Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.

125 files changed

+3542
-165
lines changed

ai-and-app-modernisation/ai-services/generative-ai-service/rag-genai/files/README.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -26,7 +26,7 @@ Before getting started, make sure you have the following installed:
2626
## Setting up the Environment
2727
1. Install the required packages:
2828
```bash
29-
pip install oci==2.112.1+preview.1.1649 llama-index qdrant-client sentence-transformers
29+
pip install oci==2.118.1+preview.1.1697 llama-index qdrant-client sentence-transformers
3030
```
3131

3232
## Loading data
Lines changed: 35 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,35 @@
1+
Copyright (c) 2024 Oracle and/or its affiliates.
2+
3+
The Universal Permissive License (UPL), Version 1.0
4+
5+
Subject to the condition set forth below, permission is hereby granted to any
6+
person obtaining a copy of this software, associated documentation and/or data
7+
(collectively the "Software"), free of charge and under any and all copyright
8+
rights in the Software, and any and all patent rights owned or freely
9+
licensable by each licensor hereunder covering either (i) the unmodified
10+
Software as contributed to or provided by such licensor, or (ii) the Larger
11+
Works (as defined below), to deal in both
12+
13+
(a) the Software, and
14+
(b) any piece of software and/or hardware listed in the lrgrwrks.txt file if
15+
one is included with the Software (each a "Larger Work" to which the Software
16+
is contributed by such licensors),
17+
18+
without restriction, including without limitation the rights to copy, create
19+
derivative works of, display, perform, and distribute the Software and make,
20+
use, sell, offer for sale, import, export, have made, and have sold the
21+
Software and the Larger Work(s), and to sublicense the foregoing rights on
22+
either these or other terms.
23+
24+
This license is subject to the following condition:
25+
The above copyright notice and either this complete permission notice or at
26+
a minimum a reference to the UPL must be included in all copies or
27+
substantial portions of the Software.
28+
29+
THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
30+
IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
31+
FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
32+
AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
33+
LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
34+
OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE
35+
SOFTWARE.
Lines changed: 85 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,85 @@
1+
# Integrate OCI AI Speech Service and Generative AI Summarization in Visual Builder
2+
3+
# Introduction
4+
5+
OCI Speech is an AI service that applies automatic speech recognition technology to transform audio-based content into text. Generative AI, The Large Language Model (LLM) analyzes the text input and can generate, summarize, transform, and extract information. Using these AI capabilities, we built a low code application- “Integrate OCI AI Speech Service and Generative AI Summarization in Visual Builder " to invoke AI Speech REST API to convert audio files into text and then further invoke the Generative AI REST API to Summarize it.
6+
7+
Reviewed: 20.02.2024
8+
9+
<img src="./files/AISpeechGenAISummary.png"></img>
10+
11+
# Prerequisites
12+
13+
Before getting started, make sure you have access to these services:
14+
15+
- Oracle Speech Service
16+
- Oracle Generative AI Service
17+
- Oracle Visual Builder Cloud Service
18+
- Oracle Visual Builder Service Connection
19+
20+
# AI Speech and OCI Generative AI Service Integration Architecture
21+
22+
1. AI Speech App using VBCS
23+
24+
- Oracle Visual Builder Cloud Service (VBCS) is a hosted environment for your application development infrastructure. It provides an open-source standards-based development service to create, collaborate on, and deploy applications within Oracle Cloud. This application is developed in VBCS.
25+
26+
2. Transcriptions with OCI AI Speech Service:
27+
- Speech harnesses the power of spoken language enabling you to easily convert media files containing human speech into highly exact text transcriptions.
28+
- Produces accurate and easy-to-use JSON and SubRip Subtitle (SRT) files written directly to the Object Storage bucket you choose.
29+
30+
3. Integration with OCI Generative AI Service:
31+
- The transcriptions (text) are sent to the OCI Generative AI Service for text summarization.
32+
33+
4. Integration with OCI AI Vision and OCI Generative AI Service using Visual Builder Service Endpoint:
34+
- Build a Service Connection Endpoint option is used to integrate the VBCS app and OCI Object Storage, OCI AI Speech Service, and Generative AI Summarization.
35+
36+
5. Summarization Process:
37+
- OCI Generative AI Service generates text using the keywords received from OCI Speech service, to create a concise summary of the audio or video.
38+
39+
40+
<img src="./files/AISpeechSummaryAppArch.svg"></img>
41+
42+
# Application Flow in Detail (VBCS, OCI Speech, OCI Generative AI Service)
43+
44+
In this application, the drag-and-drop component in VBCS allows the user to drop the audio or video.
45+
- Create a Service Endpoint connection in Visual Builder to handle the communication between Visual Builder and OCI Speech Service.
46+
- Pass the selected audio or video from Visual Builder to OCI Speech Service to convert it into text.
47+
- OCI Speech Service analyzes the media (audio or video) file and converts it into text.
48+
- The OCI Speech Service returns the transcription to the AI Speech Service Endpoint and returns the results to the Visual Builder app.
49+
- The transcription further passes to the Generative AI Service Endpoint and returns the Summarization results to the Visual Builder app.
50+
51+
User (Visual Builder) --> (Drag and Drop File) --> |Media File (adudio or video) --> (Service Endpoint) --> |OCI Speech Service| --> |Speech to Text| --> (Service Endpoint) --> |Result| --> (Visual Builder) --> (Gen AI Service Endpoint) --> |Result| --> (Visual Builder)
52+
53+
<img src="./files/AISpeechEngine.png"></img>
54+
55+
# Service Endpoint call - Invoke OCI Object Storage
56+
57+
uploadfile - /n/{namespaceName}/b/{bucketName}/o/{objectName}
58+
getObject - /n/{namespaceName}/b/{bucketName}/o/{outputFolderName}/{outputObjectName}
59+
60+
61+
# Service Endpoint call - Invoke AI Speech Service
62+
63+
create transcription - /transcriptionJobs
64+
get transcription - transcriptionJobs/{transcriptionJobId}
65+
66+
# Service Endpoint call - Invoke Generative AI Service
67+
68+
create summary - /20231130/actions/summarizeText
69+
70+
71+
# Conclusion
72+
73+
In this article, we've covered how to utilize Oracle AI Speech Service features to provide a transription and summarize using Generative AI service.
74+
75+
Feel free to modify and expand upon this template according to your specific use case and preferences.
76+
77+
78+
# License
79+
80+
Copyright (c) 2024 Oracle and/or its affiliates.
81+
82+
Licensed under the Universal Permissive License (UPL), Version 1.0.
83+
84+
See [LICENSE](https://github.com/oracle-devrel/technology-engineering/blob/main/LICENSE) for more details.
85+
171 KB
Loading
528 KB
Loading

ai-and-app-modernisation/ai-services/generative-ai-service/speech-genai/files/AISpeechSummaryAppArch.svg

Lines changed: 4 additions & 0 deletions
Loading
Lines changed: 35 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,35 @@
1+
Copyright (c) 2024 Oracle and/or its affiliates.
2+
3+
The Universal Permissive License (UPL), Version 1.0
4+
5+
Subject to the condition set forth below, permission is hereby granted to any
6+
person obtaining a copy of this software, associated documentation and/or data
7+
(collectively the "Software"), free of charge and under any and all copyright
8+
rights in the Software, and any and all patent rights owned or freely
9+
licensable by each licensor hereunder covering either (i) the unmodified
10+
Software as contributed to or provided by such licensor, or (ii) the Larger
11+
Works (as defined below), to deal in both
12+
13+
(a) the Software, and
14+
(b) any piece of software and/or hardware listed in the lrgrwrks.txt file if
15+
one is included with the Software (each a "Larger Work" to which the Software
16+
is contributed by such licensors),
17+
18+
without restriction, including without limitation the rights to copy, create
19+
derivative works of, display, perform, and distribute the Software and make,
20+
use, sell, offer for sale, import, export, have made, and have sold the
21+
Software and the Larger Work(s), and to sublicense the foregoing rights on
22+
either these or other terms.
23+
24+
This license is subject to the following condition:
25+
The above copyright notice and either this complete permission notice or at
26+
a minimum a reference to the UPL must be included in all copies or
27+
substantial portions of the Software.
28+
29+
THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
30+
IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
31+
FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
32+
AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
33+
LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
34+
OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE
35+
SOFTWARE.
Lines changed: 95 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,95 @@
1+
# Describe an image using OCI AI Vision Service and OCI Generative AI Service
2+
3+
# Introduction
4+
In this article, we'll explore how to describe an image using OCI AI Vision Service and OCI Generative AI Service.
5+
The application is developed using Oracle VBCS, OIC, OCI AI Vision service, and OCI Generative AI Service.
6+
This integrated approach combines the strength of OCI AI Vision and OCI Generative AI Service, allowing for efficient and insightful summarization of image content.
7+
8+
9+
Reviewed: 21.02.2024
10+
11+
<img src="./files/AIVisionApp.jpg"></img>
12+
13+
14+
# Prerequisites
15+
16+
Before getting started, make sure you have access to these services:
17+
18+
- Oracle Generative AI Service
19+
- Oracle Vision Service
20+
- Oracle Visual Builder Cloud Service
21+
- Oracle Integration Cloud
22+
23+
# AI Vision and OCI Generative AI Service Integration Architecture
24+
25+
1. AI Vision App using VBCS
26+
- Oracle Visual Builder Cloud Service (VBCS) is a hosted environment for your application development infrastructure. It provides an open-source standards-based development service to create, collaborate on, and deploy applications within Oracle Cloud. This application is developed in VBCS.
27+
28+
2. Image Analysis with OCI AI Vision Service:
29+
- The AI Vision service is employed to analyze images.
30+
- It identifies objects within the image by using advanced computer vision algorithms.
31+
32+
3. Integration with OCI Generative AI Service:
33+
- The extracted object keywords are sent to the OCI Generative AI Service
34+
35+
4. Integration with OCI AI Vision and OCI Generative AI Service using OIC:
36+
- Oracle Integration Cloud integrates the VBCS app and OCI AI Services.
37+
38+
5. Summarization Process:
39+
- OCI Generative AI Service generates text using the keywords received from OCI Vision service, to create a concise summary of the image.
40+
41+
<img src="./files/AIVisionAppArch.svg"></img>
42+
43+
# Application Flow in Detail (VBCS, OIC, OCI Vision, OCI Generative AI Service)
44+
45+
In this application,
46+
- The File Picker action in VBCS allows the user to select the image.
47+
- Create an integration process in Oracle Integration Cloud (OIC) to handle the communication between VBCS and OCI Vision Service.
48+
- Pass the selected image from VBCS to OCI Vision Service to analyze the image.
49+
- OCI Vision Service analyzes the image and identifies objects within it.
50+
- The OCI Vision Service returns the detected objects (keywords) to the OIC integration process and returns the results to VBCS.
51+
52+
User (VBCS) --> (File Picker) --> |Image| --> (OIC) --> |OCI Vision Service| --> |Detected Objects| --> (OIC) --> |Result| --> (VBCS)
53+
54+
<img src="./files/VBCS_Vision.jpg">
55+
</img>
56+
57+
OIC call - Invoke OCI Vision Service
58+
Endpoint - /actions/analyzeImage
59+
60+
<img src="./files/OIC_VisionService.jpg">
61+
</img>
62+
63+
- User clicks the "Generate" button in the app to initiate the summary generation.
64+
- Configure the OIC integration process to invoke the GenAI service.
65+
- Pass the keywords returned by the OCI Vision Service along with any additional relevant information.
66+
- Generative AI Service processes the received keywords and generates a summary of the image content.
67+
68+
User (VBCS) --> (File Picker) --> |Image| --> (OIC) --> |OCI Vision Service| --> |Detected Keywords| --> (OIC) --> | OCI Generative AI Service --> |Summary| --> (OIC) --> |Result| --> (VBCS)
69+
70+
<img src="./files/VBCS_GenerateSummary.jpg">
71+
</img>
72+
73+
OIC call - Invoke OCI Generative AI Service
74+
Endpoint - /20231130/actions/generatText
75+
<img src="./files/OIC_GenerateSummary.jpg">
76+
</img>
77+
78+
# Code
79+
VBCS app - ImageClassification-1.0.zip
80+
OIC Vision Integration - RESTVISION_01.00.0000.iar
81+
OIC Generate Summary - IMAGEDESCRIPTIONGENERATION_01.00.0000.iar
82+
83+
# Conclusion
84+
85+
In this article, we've covered how to utilize Oracle AI Vision Service features to provide a summary of an image using Generative AI service.
86+
Feel free to modify and expand upon this template according to your specific use case and preferences.
87+
88+
# License
89+
90+
Copyright (c) 2024 Oracle and/or its affiliates.
91+
92+
Licensed under the Universal Permissive License (UPL), Version 1.0.
93+
94+
See [LICENSE](https://github.com/oracle-devrel/technology-engineering/blob/main/LICENSE) for more details.
95+
118 KB
Loading

ai-and-app-modernisation/ai-services/generative-ai-service/vision-genai/files/AIVisionAppArch.svg

Lines changed: 4 additions & 0 deletions
Loading

0 commit comments

Comments
 (0)