-
Notifications
You must be signed in to change notification settings - Fork 1
Description
Complexity and bandwidth are significant challenges to accessing metadata embedded within a video file. Detailed knowledge of the video encoding format is required to successfully access the metadata packets within its structure. For example, OGC Testbed-16: Full Motion Video to Moving Features Engineering Report outlines the five steps required to access MISB metadata packets from a MPEG-2 Transport Stream file.
A useful first step for analysis is to export metadata verbatim from a video file to a VMT file. Decoupling metadata packets from a video file facilitates access - as described in the Testbed-16 video - and requires no knowledge of the metadata content itself.
Binary metadata can be exported as text to a VMT file in several ways, including:
- Hexadecimal: Each data byte is encoded as two text characters that represent the hexadecimal value of the 8-bit number, such as '3F' to represent a value of 63. This format is verbose, though easy to edit.
- Base64: Each 8-bit data byte is encoded into a 6-bit range using only printable ASCII characters. This format is cheap to compute and more compact than hexadecimal, but less easy to edit.
Integration with HTML allows WebVMT to synchronise the exported metadata with a video element in a web browser. Those metadata can then be surfaced with either DataCue or a custom cue based on TextTrackCue for rapid analysis of metadata content using reuseable tools and integrated with web services.
Timed metadata can be exposed in its original binary form in HTML so the data consumer may be agnostic of the text encoding. Once initial analysis has interpreted the binary metadata content, this step can be modified to expose the decoded metadata content in a more accessible form.
graph TD;
A(Video file) --> |Export| B;
B(Metadata packets) --> |Hex| C;
B --> |Base64| C;
C(VMT file) --> |DataCue| D;
C --> |Custom cue| D;
D(HTML) --> |Timed metadata| E;
E(Data consumer);