Skip to content

Commit c8d40e1

Browse files
authored
Merge branch 'main' into GitHubReview
2 parents a93fe7b + 6e15c4c commit c8d40e1

File tree

268 files changed

+3107
-784
lines changed

Some content is hidden

Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.

268 files changed

+3107
-784
lines changed

ai/ai-document-understanding/README.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -23,7 +23,7 @@ Reviewed: 17.10.2024
2323
- [Cloud Coaching - Boost Your Oracle AI Services](https://youtu.be/VVWTqqlIEhg)
2424
- Learn how to Develop a Multi-Chain Document Evaluation Apps with Oracle Generative AI, Document Understanding, and Integration Cloud.
2525

26-
- [Blog: Document Evaluation Tool using OCI Generative AI, Document Understanding & Integration Cloud](https://github.com/oracle-devrel/technology-engineering/tree/main/ai-and-app-modernisation/ai-services/generative-ai-service/doc-evaluation-genai)
26+
- [Blog: Document Evaluation Tool using OCI Generative AI, Document Understanding & Integration Cloud](https://github.com/oracle-devrel/technology-engineering/tree/main/ai/generative-ai-service/doc-evaluation-genai)
2727
- In this article, we'll explore how to make a handy tool that helps to evaluate documents using Oracle Generative AI, OCI Document Understanding, and Oracle Integration Cloud (OIC). This application combines a low-code approach to orchestrate LLM AI services and applications using Oracle Integration Cloud and Generative AI prompting techniques for tasks like document key criteria extraction, summarization, and evaluation.
2828

2929
- [Cloud Coaching - How to code and develop a Web (or Mobile) Application with Visual Builder that uses and leverages OCI Document Understanding Service](https://youtu.be/0oHixpA9JDc?si=3CWh0d2RpuEzzLKU)

ai/ai-speech/README.md

Lines changed: 2 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -2,7 +2,7 @@
22

33
OCI Speech is an AI service that applies automatic speech recognition technology to transform audio-based content into text. Developers can easily make API calls to integrate OCI Speech’s pre-trained models into their applications. OCI Speech can be used for accurate, text-normalized, time-stamped transcription via the console and REST APIs as well as command-line interfaces or SDKs. You can also use OCI Speech in an OCI Data Science notebook session. With OCI Speech, you can filter profanities, get confidence scores for both single words and complete transcriptions, and more.
44

5-
Reviewed: 17.10.2024
5+
Reviewed: 13.11.2024
66

77
# Table of Contents
88

@@ -22,8 +22,6 @@ Reviewed: 17.10.2024
2222

2323
- [Cloud Coaching - Boost Your Oracle AI Services](https://youtu.be/VVWTqqlIEhg)
2424
- Integrate OCI AI Speech Service and Generative AI Summarization with Oracle Integration Cloud and Visual Builder
25-
- [Blog: Integrate OCI AI Speech Service and Generative AI Summarization with Oracle Integration Cloud & Visual Builder](https://github.com/oracle-devrel/technology-engineering/tree/main/ai-and-app-modernisation/ai-services/generative-ai-service/speech-genai)
26-
- OCI Speech is an AI service that applies automatic speech recognition technology to transform audio-based content into text. Generative AI, The Large Language Model (LLM) analyzes the text input and can generate, summarize, transform, and extract information. Using these AI capabilities, we built a low code application- “Integrate OCI AI Speech Service and Generative AI Service for Summarization in Visual Builder " to invoke AI Speech REST API to convert audio files into text and then further invoke the Generative AI REST API to Summarize it.
2725
- [Demos built using OCI Python SDK](https://github.com/luigisaetta/oci-speech-demos)
2826
- [AI Speech console demo](https://youtu.be/EWBSoSLNph8)
2927
- [Cloud Coaching - Unlock the potential of enterprise Oracle GenAI](https://www.youtube.com/watch?v=dtvP0DU7Mdg)
@@ -34,6 +32,7 @@ Reviewed: 17.10.2024
3432
- Integrate Python and Oracle Functions for backend processing.
3533
- Build a user-friendly interface with Oracle Visual Builder Cloud Service.
3634
-Download, and preview transcribed text.
35+
- [Podcast Generator](https://github.com/oracle-devrel/technology-engineering/tree/main/ai/ai-speech/podcast-generator)
3736

3837
## Architecture Center
3938

Lines changed: 35 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,35 @@
1+
Copyright (c) 2024 Oracle and/or its affiliates.
2+
3+
The Universal Permissive License (UPL), Version 1.0
4+
5+
Subject to the condition set forth below, permission is hereby granted to any
6+
person obtaining a copy of this software, associated documentation and/or data
7+
(collectively the "Software"), free of charge and under any and all copyright
8+
rights in the Software, and any and all patent rights owned or freely
9+
licensable by each licensor hereunder covering either (i) the unmodified
10+
Software as contributed to or provided by such licensor, or (ii) the Larger
11+
Works (as defined below), to deal in both
12+
13+
(a) the Software, and
14+
(b) any piece of software and/or hardware listed in the lrgrwrks.txt file if
15+
one is included with the Software (each a "Larger Work" to which the Software
16+
is contributed by such licensors),
17+
18+
without restriction, including without limitation the rights to copy, create
19+
derivative works of, display, perform, and distribute the Software and make,
20+
use, sell, offer for sale, import, export, have made, and have sold the
21+
Software and the Larger Work(s), and to sublicense the foregoing rights on
22+
either these or other terms.
23+
24+
This license is subject to the following condition:
25+
The above copyright notice and either this complete permission notice or at
26+
a minimum a reference to the UPL must be included in all copies or
27+
substantial portions of the Software.
28+
29+
THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
30+
IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
31+
FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
32+
AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
33+
LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
34+
OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE
35+
SOFTWARE.
Lines changed: 67 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,67 @@
1+
# Podcast Generator with Oracle AI and Low-Code
2+
Welcome to this guide on building a groundbreaking application that uses Oracle's AI-powered tools to generate high-quality podcasts effortlessly. Leveraging the Oracle Cloud Infrastructure (OCI) GenAI and AI Speech (Text-to-Speech) services, this solution transforms written content into engaging, natural-sounding audio.
3+
The application is designed to streamline podcast production through advanced AI capabilities. Starting from a simple text input, the app uses GenAI to generate a structured podcast script. It then enhances the speech output with SSML (Speech Synthesis Markup Language), which gives the audio a natural flow, making it sound like a professional recording. The result is a high-quality audio experience that can cater to a wide range of content needs and audiences.
4+
This application is built using Oracle Visual Builder Cloud Service (VBCS), a powerful low-code platform that simplifies development and accelerates the creation of robust applications without extensive coding. With this low-code approach, even complex workflows are straightforward to set up, allowing developers to focus on leveraging AI's potential for high-quality audio synthesis.
5+
This AI-powered solution not only automates and optimizes the podcast creation process but also allows content creators to deliver professional audio content at scale efficiently.
6+
# **1. Prepare your user**
7+
8+
In Oracle Cloud Infrastructure (OCI), API keys are used for secure authentication when accessing OCI resources through REST APIs. OCI API keys consist of two parts: a Public key and a Private key. You use the OCI console to generate the Private/Public key pair.
9+
Generate API Keys using OCI Console
10+
To Generate the API Keys using OCI Console:
11+
12+
- Login into your OCI Account.
13+
![alt text](ak1.png)
14+
- Click on the Profile icon at the top-right corner and select your Profile hyperlink.
15+
![alt text](ak2.png)
16+
- Under Resources section at the bottom-left, select API Keys and then click Add API Key.
17+
![alt text](ak3.png)
18+
- The Add API Key dialog is displayed. Select Generate API Key Pair to create a new key pair.
19+
![alt text](ak4.png)
20+
- Click Download Private Key. A .pem file is saved to your local device. You do not need to download the public key and click Add button.
21+
![alt text](ak5.png)
22+
23+
24+
25+
# **2.Pick you compartment**
26+
Identify the compartment you're currently working within. Navigate to 'Identity' -> 'Compartments'. Locate your compartment and make a note of its OCID (Oracle Cloud Identifier)
27+
28+
# **3.Open Visual Builder**
29+
## Import Visual Builder project
30+
* Open Visual Builder and click on the "Import" button. Choose "Application from file".
31+
* Drop the zip project file
32+
* Provide a name and an ID, for example "Podcast_Generator". Click on Import button.
33+
![alt text](import_project.jpg)
34+
35+
## Configure REST APIs authentication
36+
* Open the recently created project.
37+
38+
* Click on Services button (left side) and click on "Backends"
39+
![alt text](services.jpg)
40+
41+
* Now, click on TTS, and Servers to edit server authentication.
42+
![alt text](edit_tts.jpg)
43+
44+
* Click the pencil to provide the OCI Crendentials
45+
![alt text](edit_tts_2.jpg)
46+
47+
* Provide the crendentials you got during the step 1.
48+
![alt text](signature.jpg)
49+
50+
* Repeat the same process with the GenAI backend.
51+
52+
## Provide your compartmentId
53+
* Provide compartmentId default value in the project variable named "comparmentid" that you got during the step 2.
54+
![alt text](compartmentid.jpg)
55+
56+
# **4.Preview the application**
57+
* Now can provide a topic in the text area and click "generate" button.
58+
![alt text](preview.jpg)
59+
60+
* Automatically a podcast script will be created and will generate the audio podcast using the Cindy and Bob voices
61+
![alt text](generated.jpg)
62+
63+
## Notes
64+
* It is important to note that, as of the time of writing this tutorial, the Speech-to-Text service is available exclusively in the Phoenix region.
65+
66+
## I hope you liked it.
67+
Author: Jesús Brasero
1.87 MB
Loading
316 KB
Loading
440 KB
Loading
185 KB
Loading
335 KB
Loading
170 KB
Loading

0 commit comments

Comments
 (0)