So if VP9 and AV1 already store HDR data, How is Dolby Vision used? Its HDR metadata shipped alongside video formats that don't already encode this data?
Like if I stream netflix to a TV supporting Dolby Vision what format is the video being streamed in and is the TV manufacturer just paying Dolby for the right to correctly decode this HDR info then?
Dolby Vision supplements the HDR data already present in a video. For example, when you buy a Blu-ray disc that supports Dolby Vision, the disc contains several m2ts files containing HEVC encoded videos. Present within the HEVC stream is also the metadata, which supplies metadata for each frame. To see what this data is, I used[1], and then got the info for frame 1000 and saved it as a json in this [2] pastebin. As you can see, it contains info regarding the minimum and maximum PQ encoded values, the coefficients of the ycc to rgb matrix, among other things. This allows TVs to better display the HDR data, as currently, video data encoded using rec. 2020 color primaries with max light level of 10000 nits is far outside what current TVs are capable of displaying, so metadata showing max pq of a frame or scene allows these devices to make better decisions.
Dolby Vision is a format of that metadata - it's hiding either at the beginning or as part of a frames in proprietary metadata extensions. When the video is being played, this metadata is extracted by video decoder (or demuxer) and then sent together with video frames to the display where the display then applies the
(color, etc.) metadata to correctly show the frames based on its capabilities.
Since the format of this metadata is proprietary, the demuxers, decoders and displays need to understand it and properly apply it when rendering. That's the part that needs to be implemented and paid for.
But that's really not all of the technology - Dolby Vision isn't just the metadata format, it's also definition of how the videos are mastered and under which limitations (e.g. DV allows video to be stored in 12-bit per pixel format, allows mastering with up to 10.000 nits of brightness for white pixels and defines wider color range so better, brighter colors can be displayed by a panel capable of doing that).
https://www.elecard.com/page/article_hdr is actually a pretty good article that overviews this topic (although you do need a basic understading how digital video encoding works).
Like if I stream netflix to a TV supporting Dolby Vision what format is the video being streamed in and is the TV manufacturer just paying Dolby for the right to correctly decode this HDR info then?