Skip to content

Commit 4c94f0a

Browse files
authored
Merge pull request #838 from oracle-devrel/prabhakar-aispeech-summary-branch
Prabhakar aispeech summary branch
2 parents 9bfbfe3 + 1d02d0c commit 4c94f0a

File tree

4 files changed

+79
-0
lines changed

4 files changed

+79
-0
lines changed
Lines changed: 75 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,75 @@
1+
# Integrate OCI AI Speech Service and Generative AI Summarization in Visual Builder
2+
3+
## Introduction
4+
OCI Speech is an AI service that applies automatic speech recognition technology to transform audio-based content into text. Generative AI, The Large Language Model (LLM) analyzes the text input, and can generate, summarize, transform, extract information. Using these AI capabilities, we built a low code application- “Integrate OCI AI Speech Service and Generative AI Summarization in Visual Builder " to invoke AI Speech REST API to convert audio file into text and then further invoke the Generative AI REST API to Summarize it.
5+
6+
<img src="./AISpeechGenAISummary.jpg>
7+
</img>
8+
9+
## Prerequisites
10+
11+
Before getting started, make sure you have the access to these services:
12+
13+
- Oracle Speech Service
14+
- Oracle Generative AI Service
15+
- Oracle Visual Builder Cloud Service
16+
- Oracle Visual Builder Service Connection
17+
18+
## AI Speech and OCI Generative AI Service Integration Architecture
19+
1. AI Speech App using VBCS
20+
o Oracle Visual Builder Cloud Service (VBCS) is a hosted environment for your application development infrastructure. It provides an open-source standards-based development service to create, collaborate on, and deploy applications within Oracle Cloud. This application is developed in VBCS.
21+
2. Transcriptions with OCI AI Speech Service:
22+
o Speech harnesses the power of spoken language enabling you to easily convert media files containing human speech into highly exact text transcriptions.
23+
o Produces an accurate and easy to use JSON and SubRip Subtitle (SRT) files written directly to the Object Storage bucket you choose.
24+
3. Integration with OCI Generative AI Service:
25+
o The transcriptions (text) sent to the OCI Generative AI Service for text summarization.
26+
4. Integration with OCI AI Vision and OCI Generative AI Service using Visual Builder Service Endpoint:
27+
o Build a Service Connection Endpoint option is used to integrate VBCS app and OCI Object Storage, OCI AI Speech Service, and Generative AI Summarization.
28+
5. Summarization Process:
29+
o OCI Generative AI Service generates text using the keywords received from OCI Speech service, to create a concise summary of the audio or video.
30+
31+
32+
<img src="./files/AISpeechSummaryAppArch.svg>
33+
</img>
34+
35+
## Application Flow in Detail (VBCS, OCI Speech, OCI Generative AI Service)
36+
In this application,
37+
• The Drag and Drop component in VBCS allows the user to drop the audio or video.
38+
• Create an Service Endpoint connection in Visual Builder to handle the communication between Visual Builder and OCI Speech Service.
39+
• Pass the selected audio or video from Visual Builder to OCI Speech Service to convert into the text.
40+
• OCI Speech Service analyzes the media (audio or video) file and converts into the text.
41+
• The OCI Speech Service returns the trascription to the AI Speech Service Endpoint and returns the results to Visual Builder app.
42+
• The trascription further pass to Generative AI Service Endpoint and returns the Summarization results to Visual Builder app.
43+
44+
User (Visual Builder) --> (Drag and Drop File) --> |Media File (adudio or video) --> (Service Endpoint) --> |OCI Speech Service| --> |Speech to Text| --> (Service Endpoint) --> |Result| --> (Visual Builder) --> (Gen AI Service Endpoint) --> |Result| --> (Visual Builder)
45+
46+
<img src="./AISpeechEngine.png">
47+
</img>
48+
49+
## Service Endpoint call - Invoke OCI Object Storage
50+
uploadfile - /n/{namespaceName}/b/{bucketName}/o/{objectName}
51+
getObject - /n/{namespaceName}/b/{bucketName}/o/{outputFolderName}/{outputObjectName}
52+
53+
54+
## Service Endpoint call - Invoke AI Speech Service
55+
create transcription - /transcriptionJobs
56+
get transcription - transcriptionJobs/{transcriptionJobId}
57+
58+
## Service Endpoint call - Invoke Generative AI Service
59+
create summary - /20231130/actions/summarizeText
60+
61+
62+
## Conclusion
63+
64+
In this article, we've covered how to utilise Oracle AI Speech Service features to provide a transription and summarize using Geneartive AI service .
65+
Feel free to modify and expand upon this template according to your specific use case and preferences.
66+
67+
68+
# License
69+
70+
Copyright (c) 2024 Oracle and/or its affiliates.
71+
72+
Licensed under the Universal Permissive License (UPL), Version 1.0.
73+
74+
See [LICENSE](https://github.com/oracle-devrel/technology-engineering/blob/main/LICENSE) for more details.
75+
171 KB
Loading
528 KB
Loading

ai-and-application-integration/ai-services/generative-ai-service/speech-genai/files/AISpeechSummaryAppArch.svg

Lines changed: 4 additions & 0 deletions
Loading

0 commit comments

Comments
 (0)