The spec says:
The font shorthand property on the (root) list of WebVTT Node Objects must be set to 5vh sans-serif.
However browsers don't do that:
- Gecko uses 5% of the video element height (code).
- Blink uses 5% of the smaller of video element width or height (code).
- WebKit seems to just use a fixed value based off preferences? (code).
It seems per #489 the idea is that these are relative to the video size?