Learnings from 4 months of Image-Video VAE experiments

36 points by schopra9091 day ago

4 comments

DonThomasitos7 minutes ago
Nice summary! I missed the mention of EQ-VAE when it comes to generation quality. Tiny trick, huge impact! Have you tried it?
lastdong41 minutes ago
This seems like a great model to experiment fine tuning with original art, given it’s relatively small and with open license. Is that a fair assessment?<p>Thanks for the great write up and making it available to us all.
- schopra90932 minutes ago
  yep, Apache 2.0! so anyone's welcome to download and hack away
schopra9091 day ago
Hi HN, I’m one of the two authors of the post and the Linum v2 text-to-video model (<a href="https://news.ycombinator.com/item?id=46721488">https://news.ycombinator.com/item?id=46721488</a>). We're releasing our Image-Video VAE (open weights) and a deep dive on how we built it. Happy to answer questions about the work!
fjejfhdh51 minutes ago
I take my children to school to learn them how to use English grammar.