oracle-quickstart
diff --git a/‎consolidated_bluperints_2.json‎
Lines changed: 0 additions & 1 deletion b/‎consolidated_bluperints_2.json‎
Lines changed: 0 additions & 1 deletion
diff --git a/‎docs/sample_blueprints/other/whisper_transcription/README.md‎
Lines changed: 50 additions & 38 deletions b/‎docs/sample_blueprints/other/whisper_transcription/README.md‎
Lines changed: 50 additions & 38 deletions
@@ -1,66 +1,75 @@
 # Whisper Transcription API
 
-### Transcription + Summarization + Diarization Pipeline (FastAPI-powered)
+#### Transcription + Summarization + Diarization Pipeline (FastAPI-powered)
 
 This blueprint provides a complete solution for running **audio/video transcription**, **speaker diarization**, and **summarization** via a RESTful API. It integrates [Faster-Whisper](https://github.com/guillaumekln/faster-whisper) for efficient transcription, [pyannote.audio](https://github.com/pyannote/pyannote-audio) for diarization, and Hugging Face instruction-tuned LLMs (e.g., Mistral-7B) for summarization. It supports multi-GPU acceleration, real-time streaming logs, and JSON/text output formats.
 
 ---
+
 ## Pre-Filled Samples
 
 Below are pre-configured blueprints for deploying Whisper transcription using different GPU configurations on Oracle Cloud Infrastructure.
 
-| Feature Showcase Title                                               | Description                                                           | Blueprint File                    |
-|----------------------------------------------------------------------|-----------------------------------------------------------------------|-----------------------------------|
-| Deploy Whisper transcription on A10 GPU for real-time speech-to-text | Real-time audio transcription with Whisper on BM.GPU.A10.8          | [whisper-transcription-A10.json](whisper-transcription-A10.json)  |
-| Deploy Whisper transcription on A100 GPU for high-speed processing   | High-performance Whisper transcription using BM.GPU.A100.8          | [whisper-transcription-A100.json](whisper-transcription-A100.json) |
-| Deploy Whisper transcription on H100 GPU for next-gen AI workloads   | Ultra-fast Whisper transcription with Whisper on BM.GPU.H100.8      | [whisper-transcription-H100.json](whisper-transcription-H100.json) |
+## Pre-Filled Samples
+
+| Feature Showcase                                                     | Title              | Description                                                    | Blueprint File                                                     |
+| -------------------------------------------------------------------- | ------------------ | -------------------------------------------------------------- | ------------------------------------------------------------------ |
+| Deploy Whisper transcription on A10 GPU for real-time speech-to-text | A10 Transcription  | Real-time audio transcription with Whisper on BM.GPU.A10.8     | [whisper-transcription-A10.json](whisper-transcription-A10.json)   |
+| Deploy Whisper transcription on A100 GPU for high-speed processing   | A100 Transcription | High-performance Whisper transcription using BM.GPU.A100.8     | [whisper-transcription-A100.json](whisper-transcription-A100.json) |
+| Deploy Whisper transcription on H100 GPU for next-gen AI workloads   | H100 Transcription | Ultra-fast Whisper transcription with Whisper on BM.GPU.H100.8 | [whisper-transcription-H100.json](whisper-transcription-H100.json) |
 
-## Key Features
+---
+
+# In-Depth Feature Overview
 
-| Capability              | Description                                                                                   |
-|------------------------|-----------------------------------------------------------------------------------------------|
-| Transcription          | Fast, multi-GPU inference with Faster-Whisper                                                  |
-| Summarization          | Uses Mistral-7B (or other HF models) to create summaries of long transcripts                  |
-| Speaker Diarization    | Global speaker labeling via pyannote.audio                                                     |
-| Denoising              | Hybrid removal of background noise using Demucs and noisereduce                               |
-| Real-Time Streaming    | Logs stream live via HTTP if enabled                                                           |
-| Format Compatibility   | Supports `.mp3`, `.wav`, `.flac`, `.aac`, `.m4a`, `.mp4`, `.webm`, `.mov`, `.mkv`, `.avi`, etc. |
+| Capability           | Description                                                                                     |
+| -------------------- | ----------------------------------------------------------------------------------------------- |
+| Transcription        | Fast, multi-GPU inference with Faster-Whisper                                                   |
+| Summarization        | Uses Mistral-7B (or other HF models) to create summaries of long transcripts                    |
+| Speaker Diarization  | Global speaker labeling via pyannote.audio                                                      |
+| Denoising            | Hybrid removal of background noise using Demucs and noisereduce                                 |
+| Real-Time Streaming  | Logs stream live via HTTP if enabled                                                            |
+| Format Compatibility | Supports `.mp3`, `.wav`, `.flac`, `.aac`, `.m4a`, `.mp4`, `.webm`, `.mov`, `.mkv`, `.avi`, etc. |
 
 ---
 
 ## Deployment on OCI Blueprint
 
 ### Sample Recipe (Service Mode)
-please look at this json file as an example [whisper-transcription-A10.json](whisper-transcription-A10.json)  
+
+please look at this json file as an example [whisper-transcription-A10.json](whisper-transcription-A10.json)
 
 ### Endpoint
+
 ```
 POST https://<YOUR_DEPLOYMENT>.nip.io/transcribe
 ```
+
 **Example:**  
 `https://whisper-transcription-a10-6666.130-162-199-33.nip.io/transcribe`
 
 ---
 
 ## API Parameters
 
-| Name              | Type      | Description                                                                                                           |
-|-------------------|-----------|-----------------------------------------------------------------------------------------------------------------------|
-| `audio_url`       | string    | URL to audio file in OCI Object Storage (requires PAR)                                                               |
-| `model`           | string    | Whisper model to use: `base`, `medium`, `large`, `turbo`, etc.                                                      |
-| `summary`         | bool      | Whether to generate a summary (default: false). Requires `hf_token` if model path not provided                      |
-| `speaker`         | bool      | Whether to run diarization (default: false). Requires `hf_token`                                                    |
-| `max_speakers`    | int       | (Optional) Maximum number of speakers expected for diarization                                                       |
-| `denoise`         | bool      | Whether to apply noise reduction                                                                                     |
-| `streaming`       | bool      | Enables real-time logs via /stream_log endpoint                                                                      |
-| `hf_token`        | string    | Hugging Face access token (required for diarization or HF-hosted summarizers)                                       |
-| `prop_decrease`   | float     | (Optional) Controls level of noise suppression. Range: 0.0–1.0 (default: 0.7)                                        |
-| `summarized_model`| string    | (Optional) Path or HF model ID for summarizer. Default: `mistralai/Mistral-7B-Instruct-v0.1`                         |
-| `ground_truth`    | string    | (Optional) Path to reference transcript file to compute WER                                                          |
+| Name               | Type   | Description                                                                                    |
+| ------------------ | ------ | ---------------------------------------------------------------------------------------------- |
+| `audio_url`        | string | URL to audio file in OCI Object Storage (requires PAR)                                         |
+| `model`            | string | Whisper model to use: `base`, `medium`, `large`, `turbo`, etc.                                 |
+| `summary`          | bool   | Whether to generate a summary (default: false). Requires `hf_token` if model path not provided |
+| `speaker`          | bool   | Whether to run diarization (default: false). Requires `hf_token`                               |
+| `max_speakers`     | int    | (Optional) Maximum number of speakers expected for diarization                                 |
+| `denoise`          | bool   | Whether to apply noise reduction                                                               |
+| `streaming`        | bool   | Enables real-time logs via /stream_log endpoint                                                |
+| `hf_token`         | string | Hugging Face access token (required for diarization or HF-hosted summarizers)                  |
+| `prop_decrease`    | float  | (Optional) Controls level of noise suppression. Range: 0.0–1.0 (default: 0.7)                  |
+| `summarized_model` | string | (Optional) Path or HF model ID for summarizer. Default: `mistralai/Mistral-7B-Instruct-v0.1`   |
+| `ground_truth`     | string | (Optional) Path to reference transcript file to compute WER                                    |
 
 ---
 
 ## Example cURL Command
+
 ```bash
 curl -k -N -L -X POST https://<YOUR_DEPLOYMENT>.nip.io/transcribe \
   -F "audio_url=<YOUR_PAR_URL>" \
@@ -88,13 +97,16 @@ Each processed audio generates the following:
 ## Streaming Logs
 
 If `streaming=true`, the response will contain a log filename:
+
 ```json
 {
   "meta": "logfile_name",
   "logfile": "transcription_log_remote_audio_<timestamp>.log"
 }
 ```
+
 To stream logs in real-time:
+
 ```bash
 curl -N https://<YOUR_DEPLOYMENT>.nip.io/stream_log/<log_filename>
 ```
@@ -113,15 +125,15 @@ https://huggingface.co/settings/tokens
 
 ## Dependencies
 
-| Package             | Purpose                          |
-|---------------------|----------------------------------|
-| `faster-whisper`    | Core transcription engine        |
-| `transformers`      | Summarization via Hugging Face   |
-| `pyannote.audio`    | Speaker diarization              |
-| `pydub`, `librosa`  | Audio chunking and processing    |
-| `demucs`            | Vocal separation / denoising     |
-| `fastapi`, `uvicorn`| REST API server                  |
-| `jiwer`             | WER evaluation                   |
+| Package              | Purpose                        |
+| -------------------- | ------------------------------ |
+| `faster-whisper`     | Core transcription engine      |
+| `transformers`       | Summarization via Hugging Face |
+| `pyannote.audio`     | Speaker diarization            |
+| `pydub`, `librosa`   | Audio chunking and processing  |
+| `demucs`             | Vocal separation / denoising   |
+| `fastapi`, `uvicorn` | REST API server                |
+| `jiwer`              | WER evaluation                 |
 
 ---