frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

GPTZero finds 100 new hallucinations in NeurIPS 2025 accepted papers

https://gptzero.me/news/neurips/
593•segmenta•6h ago•304 comments

Show HN: isometric.nyc – giant isometric pixel art map of NYC

https://cannoneyed.com/isometric-nyc/
427•cannoneyed•5h ago•121 comments

Qwen3-TTS family is now open sourced: Voice design, clone, and generation

https://qwen.ai/blog?id=qwen3tts-0115
385•Palmik•8h ago•113 comments

Compiling Scheme to WebAssembly

https://eli.thegreenplace.net/2026/compiling-scheme-to-webassembly/
38•chmaynard•4d ago•6 comments

CSS Optical Illusions

https://alvaromontoro.com/blog/68091/css-optical-illusions
104•ulrischa•4h ago•10 comments

'Active' sitting is better for brain health: review of studies

https://www.sciencealert.com/not-all-sitting-is-equal-one-type-was-just-linked-to-better-brain-he...
39•mikhael•3h ago•14 comments

Recent discoveries on the acquisition of the highest levels of human performance

https://www.science.org/doi/abs/10.1126/science.adt7790
65•colincooke•4h ago•30 comments

Show HN: First Claude Code client for Ollama local models

https://github.com/21st-dev/1code
20•SerafimKorablev•4h ago•8 comments

Brex is joining forces with Capital One

https://www.brex.com/journal/brex-and-capital-one-join-forces
6•ChrisArchitect•26m ago•1 comments

Tree-sitter vs. Language Servers

https://lambdaland.org/posts/2026-01-21_tree-sitter_vs_lsp/
183•ashton314•7h ago•52 comments

My first year in sales as technical founder

https://www.fabiandietrich.com/blog/first-year-in-sales.html
20•f3b5•5d ago•3 comments

Launch HN: Constellation Space (YC W26) – AI for satellite mission assurance

28•kmajid•5h ago•6 comments

Your app subscription is now my weekend project

https://rselbach.com/your-sub-is-now-my-weekend-project
112•robteix•3d ago•105 comments

Reverse engineering Lyft Bikes for fun (and profit?)

https://ilanbigio.com/blog/lyft-bikes.html
34•ibigio•5h ago•8 comments

Keeping 20k GPUs healthy

https://modal.com/blog/gpu-health
57•jxmorris12•4d ago•20 comments

AnswerThis (YC F25) Is Hiring

https://www.ycombinator.com/companies/answerthis/jobs/r5VHmSC-ai-agent-orchestration
1•ayush4921•5h ago

Why does SSH send 100 packets per keystroke?

https://eieio.games/blog/ssh-sends-100-packets-per-keystroke/
164•eieio•2h ago•117 comments

Mote: An Interactive Ecosystem Simulation [video]

https://www.youtube.com/watch?v=Hju0H3NHxVI
45•evakhoury•23h ago•5 comments

The Education of the Broligarchy

https://colossus.com/article/education-broligarchy-silicon-valley-canon/
3•pseudolus•19m ago•0 comments

A Year of 3D Printing

https://brookehatton.com/blog/making/a-year-of-3d-printing/
63•nindalf•5d ago•65 comments

I was banned from Claude for scaffolding a Claude.md file?

https://hugodaniel.com/posts/claude-code-banned-me/
241•hugodan•3h ago•190 comments

Vulnerable WhisperPair Devices – Hijack Bluetooth Accessories Using Fast Pair

https://whisperpair.eu/vulnerable-devices
17•gnabgib•4d ago•4 comments

Preserved Fish, Boss of New York City

https://signoregalilei.com/2025/12/21/preserved-fish-boss-of-new-york-city/
5•surprisetalk•3d ago•0 comments

Design Thinking Books (2024)

https://www.designorate.com/design-thinking-books/
256•rrm1977•10h ago•118 comments

It looks like the status/need-triage label was removed

https://github.com/google-gemini/gemini-cli/issues/16728
254•nickswalker•6h ago•64 comments

Show HN: CLI for working with Apple Core ML models

https://github.com/schappim/coreml-cli
19•schappim•2h ago•0 comments

Show HN: Synesthesia, make noise music with a colorpicker

https://visualnoise.ca
23•tevans3•16h ago•8 comments

Show HN: Text-to-video model from scratch (2 brothers, 2 years, 2B params)

https://huggingface.co/collections/Linum-AI/linum-v2-2b-text-to-video
22•schopra909•5h ago•7 comments

Viking Ship Museum in Denmark announces the discovery of the largest cog

https://www.medievalists.net/2025/12/medieval-ship-discovered-copenhagen/
3•PaulHoule•29m ago•0 comments

Show HN: BrowserOS – "Claude Cowork" in the browser

https://github.com/browseros-ai/BrowserOS
33•felarof•5h ago•13 comments
Open in hackernews

Show HN: Text-to-video model from scratch (2 brothers, 2 years, 2B params)

https://huggingface.co/collections/Linum-AI/linum-v2-2b-text-to-video
22•schopra909•5h ago
Writeup (includes good/bad sample generations): https://www.linum.ai/field-notes/launch-linum-v2

We're Sahil and Manu, two brothers who spent the last 2 years training text-to-video models from scratch. Today we're releasing them under Apache 2.0.

These are 2B param models capable of generating 2-5 seconds of footage at either 360p or 720p. In terms of model size, the closest comparison is Alibaba's Wan 2.1 1.3B. From our testing, we get significantly better motion capture and aesthetics.

We're not claiming to have reached the frontier. For us, this is a stepping stone towards SOTA - proof we can train these models end-to-end ourselves.

Why train a model from scratch?

We shipped our first model in January 2024 (pre-Sora) as a 180p, 1-second GIF bot, bootstrapped off Stable Diffusion XL. Image VAEs don't understand temporal coherence, and without the original training data, you can't smoothly transition between image and video distributions. At some point you're better off starting over.

For v2, we use T5 for text encoding, Wan 2.1 VAE for compression, and a DiT-variant backbone trained with flow matching. We built our own temporal VAE but Wan's was smaller with equivalent performance, so we used it to save on embedding costs. (We'll open-source our VAE shortly.)

The bulk of development time went into building curation pipelines that actually work (e.g., hand-labeling aesthetic properties and fine-tuning VLMs to filter at scale).

What works: Cartoon/animated styles, food and nature scenes, simple character motion. What doesn't: Complex physics, fast motion (e.g., gymnastics, dancing), consistent text.

Why build this when Veo/Sora exist? Products are extensions of the underlying model's capabilities. If users want a feature the model doesn't support (character consistency, camera controls, editing, style mapping, etc.), you're stuck. To build the product we want, we need to update the model itself. That means owning the development process. It's a bet that will take time (and a lot of GPU compute) to pay off, but we think it's the right one.

What’s next? - Post-training for physics/deformations - Distillation for speed - Audio capabilities - Model scaling

We kept a “lab notebook” of all our experiments in Notion. Happy to answer questions about building a model from 0 → 1. Comments and feedback welcome!

Comments

streamer45•4h ago
Rad! huggingface link gives 404 on my side though.
schopra909•4h ago
Oh damn! Thanks for catching that -- going to ping the HF folks to see what they can do to fix the collection link.

In the meantime here's the individual links to the models:

https://huggingface.co/Linum-AI/linum-v2-720p https://huggingface.co/Linum-AI/linum-v2-360p

schopra909•4h ago
Should be fixed now! Thanks again for the heads up
streamer45•4h ago
All good, cheers!
schopra909•4h ago
Per the RAM comment, you may able to get it run locally with two tweaks:

https://github.com/Linum-AI/linum-v2/blob/298b1bb9186b5b9ff6...

1) Free up the t5 as soon as the text is encoded, so you reclaim GPU RAM

2) Manual Layer Offloading; move layers off GPU once they're done being used to free up space for the remaining layers + activations

streamer45•4h ago
Looks like 20GB VRAM isn't enough for the 360p demo :( need to bump my specs :sweat_smile:
E-Reverance•56m ago
Post it on r/StableDiffusion