Replies: 3 comments 1 reply
-
|
I think it's an interesting idea, especially since they also have a way to for metadata, but I recommend against this for three reasons:
|
Beta Was this translation helpful? Give feedback.
-
|
Reading the transcript text - https://github.com/Podcastindex-org/podcast-namespace/blob/main/docs/examples/transcripts/transcripts.md "Want to support only one format? WebVTT is used by Apple Podcasts for ingest, and also natively supported by web browsers. Because the WebVTT format is the most flexible, it's an ideal choice if you can only support one format." The JSON representation is a flexible format that accomodates various degrees of fidelity in a concise way. At the most precise, it enables word-by-word highlighting. This format for podcast transcripts should adhere to the following specifications. Apple uses the VTT format with accurate word highlighting. |
Beta Was this translation helpful? Give feedback.
-
|
I think there are three proposals here from @dascritch - let's see if I can help unpack them: 1. "Use WebVTT standard" I agree with this part of the proposal. I'd like to propose that we retire SRT/TXT/HTML format to simplify the specification. WebVTT is a standard for browsers, which supports VTT files out of the box for video, and supports VTT files for audio quite simply as well. Here's use of an AUDIO player with VTT support. SRT files are much less well supported by the web. If you have to have an SRT file for your application, then it's an easy transform from a VTT file. They're almost identical in nature. Similarly, a TXT or HTML transcription can be built from a VTT file as well. Removing complexity from the 2. "Use WebVTT chapters" WebVTT has chapter support. However, podcasting uses four chapter formats currently - "chapters in descriptions" (Apple, YouTube, Spotify); "chapters in podlove format" (Spotify?); "JSON chapters" (Apple); ID3 tags (Apple). I would be keen to avoid adding a fifth chapter format without a clear understanding of the benefits. I don't believe that chapters are in-use by browser implementations. 3. "Improve the "cooked-up" JSON file" I do see the benefit of offering a word-by-word format (which VTT isn't). Is there any prior art in word-by-word format? Should we be aligning with a standard? Next stepsDo we split out the three parts of this proposal? |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
-
WebVTT is a standard well implemented in browsers for chapters, annotations and captions because it is in WhatWG and W3C recommended implemntations for the Media objects .
In https://github.com/Podcastindex-org/podcast-namespace/blob/main/docs/1.0.md#user-content-chapters , for the tag
<podcast:chapters>, it states that browsers aren't supporting ID3 chapters tags. This is true. But instead the proposal use a less standard solution, a brand new json file.Support for WebVTT files is nearly complete on web browsers and are a W3C standard.](https://www.w3.org/TR/webvtt1/), works smoothly in 99% market-share browsers, are exposed and used in accessibility tools. So why preconize a new file format without native implementation instead to use one perfectly used now for 10 years (in subtitling, but it works perfectly too for chaptering, i'm using it) ?
I suggest to change this to recommend WebVTT as a preferred solution, mime type
text/vtt, documentation https://www.w3.org/TR/webvtt1/, and alternatively to suggestapplication/json+chapters.cf #315
Beta Was this translation helpful? Give feedback.
All reactions