Skip to content

Commit 8410392

Browse files
committed
More readme updates
1 parent 8c32347 commit 8410392

File tree

3 files changed

+38
-20
lines changed
  • functional-samples

3 files changed

+38
-20
lines changed
Lines changed: 14 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -1,13 +1,22 @@
1-
# Alt-texter: On-device multimodal AI with Gemini Nano - image understanding
1+
# Alt Texter: Generate accessible image descriptions with Chrome's multimodal Prompt AI
22

3-
This sample demonstrates how to use the image understanding capabilities of the multi-modal Gemini Nano API preview together with [Chrome's translation API](https://developer.chrome.com/docs/ai/translator-api). To learn more about the API and how to sign-up for the origin trial, head over to [Built-in AI on developer.chrome.com](https://developer.chrome.com/docs/extensions/ai/prompt-api).
3+
This sample demonstrates how to use Chrome's built-in AI APIs to generate alt text for images, making web content more accessible. It combines two on-device AI capabilities:
4+
5+
- **[Prompt API](https://developer.chrome.com/docs/extensions/ai/prompt-api)** with multimodal input (Gemini Nano) for image understanding
6+
- **[Translator API](https://developer.chrome.com/docs/ai/translator-api)** for translating descriptions into multiple languages
47

58
## Overview
69

7-
This extension adds a context menu entry for images on the web to generate an alt text description that is displayed in a popup window.
10+
Alt Texter adds a context menu entry for images on the web. When activated, it:
11+
12+
1. Analyzes the image using Gemini Nano's multimodal capabilities
13+
2. Generates a concise, functional description following accessibility best practices (object-action-context framework)
14+
3. Displays the description in a popup where you can optionally translate it
15+
4. Lets you copy the alt text to your clipboard for use elsewhere
816

917
## Running this extension
1018

1119
1. Clone this repository.
12-
1. Load this directory in Chrome as an [unpacked extension](https://developer.chrome.com/docs/extensions/get-started/tutorial/hello-world#load-unpacked).
13-
1. Right click an image on a webpage and select "Generate alt text"
20+
2. Load this directory in Chrome as an [unpacked extension](https://developer.chrome.com/docs/extensions/get-started/tutorial/hello-world#load-unpacked).
21+
3. Right-click an image on a webpage and select "Generate alt text".
22+
4. Wait for the description to be generated, then optionally translate it or copy it to your clipboard.
Lines changed: 14 additions & 7 deletions
Original file line numberDiff line numberDiff line change
@@ -1,20 +1,27 @@
1-
# Audio-Scribe: On-device multimodal AI with Gemini Nano - audio transcription
1+
# Audio-Scribe: Transcribe audio messages with Chrome's multimodal Prompt API
22

3-
This sample demonstrates how to use the audio transcription capabilities of the multi-modal Gemini Nano API preview. To learn more about the API and how to sign-up for the origin trial, head over to [Built-in AI on developer.chrome.com](https://developer.chrome.com/docs/extensions/ai/prompt-api).
3+
This sample demonstrates how to use Chrome's built-in AI APIs to transcribe audio messages directly in the browser. It uses:
4+
5+
- **[Prompt API](https://developer.chrome.com/docs/extensions/ai/prompt-api)** with multimodal audio input (Gemini Nano) for on-device speech-to-text transcription
46

57
## Overview
68

7-
This extension adds a sidepanel that will, when opened, display a transcription of all audio files on a web page (currently it looks only for audio files created using `URL.createObjectUrl`).
9+
Audio-Scribe adds a side panel that automatically transcribes audio messages from chat applications. When activated, it:
10+
11+
1. Monitors the page for audio blobs created via `URL.createObjectURL`
12+
2. Detects audio content and sends it to Gemini Nano for transcription
13+
3. Streams the transcribed text in real-time to the side panel
14+
4. Works with messaging apps like WhatsApp Web that use blob URLs for audio messages
815

916
## Running this extension
1017

1118
1. Clone this repository.
12-
1. Load this directory in Chrome as an [unpacked extension](https://developer.chrome.com/docs/extensions/get-started/tutorial/hello-world#load-unpacked).
13-
1. Open the audio-scribe sidepanel by clicking the audio-scribe action or by pressing the `ALT + A` keyboard shortcut.
14-
1. Open a chat app in the browser, for example https://web.whatsapp.com/. You can also run the demo chat app via:
19+
2. Load this directory in Chrome as an [unpacked extension](https://developer.chrome.com/docs/extensions/get-started/tutorial/hello-world#load-unpacked).
20+
3. Open a chat app in the browser, for example https://web.whatsapp.com/. You can also run the included demo chat app:
1521
```
1622
npx serve demo-chat-app
1723
```
18-
1. All audio messages in the current chat will be transcribed in the side panel.
24+
4. Open the Audio-Scribe side panel by clicking the extension icon or pressing `Alt+A`.
25+
5. Play or load audio messages in the chat - they will be automatically transcribed in the side panel.
1926

2027
![Screenshot displaying a demo chat app with a few audio messages. On the right, there is the audio-scribe extension's sidepanel which displayes the transcribed text messages](assets/screenshot.png)

functional-samples/ai.gemini-on-device-summarization/README.md

Lines changed: 10 additions & 8 deletions
Original file line numberDiff line numberDiff line change
@@ -1,19 +1,21 @@
11
# On-device Summarization with Gemini Nano
22

3-
This sample demonstrates how to use the experimental Summarization API in Chrome. To learn more about the API and how to sign-up for the preview, head over to the [summarizer guide on developer.chrome.com](https://developer.chrome.com/docs/ai/summarizer-api).
3+
This sample demonstrates how to use Chrome's built-in Summarizer API to generate AI-powered summaries of web pages directly on the user's device. The summarization runs entirely locally using Gemini Nano, ensuring privacy and fast performance without requiring an internet connection or API keys.
4+
5+
To learn more about the Summarizer API, head over to the [Summarizer API guide on developer.chrome.com](https://developer.chrome.com/docs/ai/summarizer-api).
46

57
## Overview
68

7-
The extension summarizes the content of the currently open tab. It uses Mozilla's [readability](https://github.com/mozilla/readability) library to extract the content of the currently active tab and displays a summary of the page generated by [Chrome's experimental summarization API](https://developer.chrome.com/blog/august2024-summarization-ai) in a side panel.
9+
This extension adds a side panel that automatically displays AI-generated summaries of any web page you visit. It uses Mozilla's [Readability](https://github.com/mozilla/readability) library to extract the main content from web pages (stripping away navigation, ads, and other clutter), then passes that content to Chrome's built-in Summarizer API.
810

911
## Running this extension
1012

11-
1. Clone this repository
12-
1. Run `npm install` in this folder to install all dependencies.
13-
1. Run `npm run build` to build the extension.
14-
1. Load the newly created `dist` directory in Chrome as an [unpacked extension](https://developer.chrome.com/docs/extensions/get-started/tutorial/hello-world#load-unpacked).
15-
1. Click the extension icon to open the summary side panel.
16-
1. Open any web page, the page's content summary will automatically be displayed in the side panel.
13+
1. Clone this repository.
14+
2. Run `npm install` in this folder to install all dependencies.
15+
3. Run `npm run build` to build the extension.
16+
4. Load the newly created `dist` directory in Chrome as an [unpacked extension](https://developer.chrome.com/docs/extensions/get-started/tutorial/hello-world#load-unpacked).
17+
5. Click the extension icon to open the summary side panel.
18+
6. Open any web page. The page's content summary will automatically be displayed in the side panel.
1719

1820
## Creating your own extension
1921

0 commit comments

Comments
 (0)