Rendered at 15:21:22 GMT+0000 (Coordinated Universal Time) with Cloudflare Workers.
a-dub 53 minutes ago [-]
this is interesting. would be cool to explore something like integrating a vlm to add a "semantic" term to the loss function. looking through the comparisons, some of the baseline codecs create meaningfully different details (as could be described by text) in the images.