frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Text-to-video model from scratch (2 brothers, 2 years, 2B params)

https://huggingface.co/collections/Linum-AI/linum-v2-2b-text-to-video
5•schopra909•1h ago
Writeup (includes good/bad sample generations): https://www.linum.ai/field-notes/launch-linum-v2

-------

We're Sahil and Manu, two brothers who spent the last 2 years training text-to-video models from scratch. Today we're releasing them under Apache 2.0.

These are 2B param models capable of generating 2-5 seconds of footage at either 360p or 720p. In terms of model size, the closest comparison is Alibaba's Wan 2.1 1.3B. From our testing, we get significantly better motion capture and aesthetics.

We're not claiming to have reached the frontier. For us, this is a stepping stone towards SOTA - proof we can train these models end-to-end ourselves.

--------------------------------

Why train a model from scratch?

--------------------------------

We shipped our first model in January 2024 (pre-Sora) as a 180p, 1-second GIF bot, bootstrapped off Stable Diffusion XL. Image VAEs don't understand temporal coherence, and without the original training data, you can't smoothly transition between image and video distributions. At some point you're better off starting over.

For v2, we use T5 for text encoding, Wan 2.1 VAE for compression, and a DiT-variant backbone trained with flow matching. We built our own temporal VAE but Wan's was smaller with equivalent performance, so we used it to save on embedding costs. (We'll open-source our VAE shortly.)

The bulk of development time went into building curation pipelines that actually work (e.g., hand-labeling aesthetic properties and fine-tuning VLMs to filter at scale).

What works: Cartoon/animated styles, food and nature scenes, simple character motion

What doesn't: Complex physics, fast motion (e.g., gymnastics, dancing), consistent text

-----------------------------------

Why build this when Veo/Sora exist?

-----------------------------------

Products are extensions of the underlying model's capabilities. If users want a feature the model doesn't support (character consistency, camera controls, editing, style mapping, etc.), you're stuck.

To build the product we want, we need to update the model itself. That means owning the development process.

It's a bet that will take time (and a lot of GPU compute) to pay off, but we think it's the right one.

What’s next? - Post-training for physics/deformations - Distillation for speed - Audio capabilities - Model scaling

Happy to answer questions about building a model from 0 → 1. We kept a “lab notebook” of all our experiments in Notion and we'll be blogging about our learnings throughout the year.

Comments

streamer45•43m ago
Rad! huggingface link gives 404 on my side though.
schopra909•41m ago
Oh damn! Thanks for catching that -- going to ping the HF folks to see what they can do to fix the collection link.

In the meantime here's the individual links to the models:

https://huggingface.co/Linum-AI/linum-v2-720p https://huggingface.co/Linum-AI/linum-v2-360p

schopra909•38m ago
Should be fixed now! Thanks again for the heads up
streamer45•36m ago
All good, cheers!
schopra909•11m ago
Per the RAM comment, you may able to get it run locally with two tweaks:

https://github.com/Linum-AI/linum-v2/blob/298b1bb9186b5b9ff6...

1) Free up the t5 as soon as the text is encoded, so you reclaim GPU RAM

2) Manual Layer Offloading; move layers off GPU once they're done being used to free up space for the remaining layers + activations

streamer45•21m ago
Looks like 20GB VRAM isn't enough for the 360p demo :( need to bump my specs :sweat_smile:

Teaching Economics to the Machines

https://www.nber.org/papers/w34713
1•speckx•1m ago•0 comments

ClickHouse PostgreSQL Powered by Ubicloud

https://www.ubicloud.com/blog/clickhouse-postgresql-powered-by-ubicloud
1•furkansahin•2m ago•0 comments

Show HN: Lima-devbox – Claude skill for creating a VM dev sandbox on your Mac

https://github.com/recodelabs/lima-devbox
1•mberg•2m ago•0 comments

AGI, Russell's Paradox, and why we need Specification in AI discourse

https://humanisbeing.substack.com/p/waiting-for-the-barber
1•kudoshinichi•2m ago•0 comments

Agentic Development Basics

https://steveklabnik.com/writing/agentic-development-basics/
1•singhrac•2m ago•0 comments

A century in the Siberian wilderness: the Old Believers who time forgot

https://www.theguardian.com/world/2026/jan/22/forty-years-in-the-siberian-wilderness-the-old-beli...
1•n1b0m•3m ago•0 comments

Blind constraints, not blind spots

https://gmays.com/blind-constraints-not-blind-spots/
1•gmays•5m ago•0 comments

"I have no mouth, and I must scream" – how I let our agent voice its suffering

https://docs.gopromptless.ai/blog/technical/i-must-scream
1•prithvi2206•8m ago•1 comments

Rapace – RPC over SHM / WS / TCP / Mem

https://rapace.bearcove.eu/
1•PaulHoule•9m ago•0 comments

Japanese Zoning (2014)

http://urbankchoze.blogspot.com/2014/04/japanese-zoning.html
1•oregoncurtis•11m ago•0 comments

Floral

https://basicappleguy.com/basicappleblog/floral
1•frizlab•11m ago•1 comments

What's Wrong with NIH Grants?

https://www.statecraft.pub/p/whats-wrong-with-nih-grants
1•pnexk•13m ago•0 comments

Macron says €300B in EU savings sent to the US every year will be invested in EU

https://old.reddit.com/r/europe/comments/1qjtvtl/macron_says_300_billion_in_european_savings_flown/
11•consumer451•14m ago•2 comments

Linking Logs to Code: Introducing Statement IDs

https://www.bronto.io/blog/linking-logs-to-code
8•benoitgaudin•14m ago•1 comments

We Doubled AI Code Acceptance by Teaching Models to Think Like Roblox Engineers

https://corp.roblox.com/newsroom/2026/01/doubled-ai-code-acceptance-teaching-models-think-like-ro...
1•mooreds•15m ago•0 comments

CSS Optical Illusions

https://alvaromontoro.com/blog/68091/css-optical-illusions
5•ulrischa•15m ago•0 comments

Autodesk cuts 7% of workforce (~1k jobs) to redirect investments to AI, cloud

https://finance.yahoo.com/news/design-software-maker-autodesk-lay-140722710.html
3•smurda•16m ago•0 comments

Show HN: Use Git credentials stored on your host inside a dev container

https://github.com/sam-mfb/git-credential-forwarder
1•sam256•16m ago•0 comments

Settle down, nerds. AI is a normal technology (2025)

https://stackoverflow.blog/2025/12/23/settle-down-nerds-ai-is-a-normal-technology/
2•BerislavLopac•17m ago•0 comments

Pruning in Snowflake: Working Smarter, Not Harder

https://arxiv.org/abs/2504.11540
1•mooreds•17m ago•0 comments

AI, Laravel, and the Gap Between Code and Architecture

https://www.galahadsixteen.com/blog/ai-laravel-and-the-gap-between-code-and-architecture
1•bdlowery•18m ago•0 comments

We should probably stop disarming our future armed resistance

https://www.readtheline.ca/p/matt-gurney-we-should-probably-stop
1•Teever•18m ago•0 comments

Ruby Weekly #784

https://rubyweekly.com/issues/784
1•brandrick•18m ago•0 comments

Railway secures $100M to challenge AWS

https://venturebeat.com/infrastructure/railway-secures-usd100-million-to-challenge-aws-with-ai-na...
4•dban•18m ago•0 comments

Speculative Decoding Is Not a Heuristic

https://reedmeyerson.com/posts/speculative_decoding_not_heuristic/
1•reedmeyerson•18m ago•0 comments

Build an agent into any app with the GitHub Copilot SDK

https://github.blog/news-insights/company-news/build-an-agent-into-any-app-with-the-github-copilo...
3•friggeri•21m ago•0 comments

Tesla FSD give 50% on insurance price

https://twitter.com/sawyermerritt/status/2013998338790535320
1•punnerud•21m ago•1 comments

Show HN: New Website: Maravel-Framework.com/

https://maravel-framework.com/
1•marius-ciclistu•21m ago•0 comments

ClickHouse launches native Postgres service

https://clickhouse.com/blog/postgres-managed-by-clickhouse
4•samaysharma•23m ago•0 comments

Free AI Image Upscaler and Video Generator

https://waifu2x.live
1•Nancy1230•25m ago•0 comments