frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

So whats the next word, then? Almost-no-math intro to transformer models

https://matthias-kainer.de/blog/posts/so-whats-the-next-word-then-/
1•oesimania•1m ago•0 comments

Ed Zitron: The Hater's Guide to Microsoft

https://bsky.app/profile/edzitron.com/post/3me7ibeym2c2n
2•vintagedave•4m ago•1 comments

UK infants ill after drinking contaminated baby formula of Nestle and Danone

https://www.bbc.com/news/articles/c931rxnwn3lo
1•__natty__•5m ago•0 comments

Show HN: Android-based audio player for seniors – Homer Audio Player

https://homeraudioplayer.app
1•cinusek•5m ago•0 comments

Starter Template for Ory Kratos

https://github.com/Samuelk0nrad/docker-ory
1•samuel_0xK•7m ago•0 comments

LLMs are powerful, but enterprises are deterministic by nature

1•prateekdalal•10m ago•0 comments

Make your iPad 3 a touchscreen for your computer

https://github.com/lemonjesus/ipad-touch-screen
2•0y•15m ago•1 comments

Internationalization and Localization in the Age of Agents

https://myblog.ru/internationalization-and-localization-in-the-age-of-agents
1•xenator•16m ago•0 comments

Building a Custom Clawdbot Workflow to Automate Website Creation

https://seedance2api.org/
1•pekingzcc•18m ago•1 comments

Why the "Taiwan Dome" won't survive a Chinese attack

https://www.lowyinstitute.org/the-interpreter/why-taiwan-dome-won-t-survive-chinese-attack
1•ryan_j_naughton•19m ago•0 comments

Xkcd: Game AIs

https://xkcd.com/1002/
1•ravenical•20m ago•0 comments

Windows 11 is finally killing off legacy printer drivers in 2026

https://www.windowscentral.com/microsoft/windows-11/windows-11-finally-pulls-the-plug-on-legacy-p...
1•ValdikSS•21m ago•0 comments

From Offloading to Engagement (Study on Generative AI)

https://www.mdpi.com/2306-5729/10/11/172
1•boshomi•23m ago•1 comments

AI for People

https://justsitandgrin.im/posts/ai-for-people/
1•dive•24m ago•0 comments

Rome is studded with cannon balls (2022)

https://essenceofrome.com/rome-is-studded-with-cannon-balls
1•thomassmith65•29m ago•0 comments

8-piece tablebase development on Lichess (op1 partial)

https://lichess.org/@/Lichess/blog/op1-partial-8-piece-tablebase-available/1ptPBDpC
2•somethingp•30m ago•0 comments

US to bankroll far-right think tanks in Europe against digital laws

https://www.brusselstimes.com/1957195/us-to-fund-far-right-forces-in-europe-tbtb
3•saubeidl•31m ago•0 comments

Ask HN: Have AI companies replaced their own SaaS usage with agents?

1•tuxpenguine•34m ago•0 comments

pi-nes

https://twitter.com/thomasmustier/status/2018362041506132205
1•tosh•36m ago•0 comments

Show HN: Crew – Multi-agent orchestration tool for AI-assisted development

https://github.com/garnetliu/crew
1•gl2334•37m ago•0 comments

New hire fixed a problem so fast, their boss left to become a yoga instructor

https://www.theregister.com/2026/02/06/on_call/
1•Brajeshwar•38m ago•0 comments

Four horsemen of the AI-pocalypse line up capex bigger than Israel's GDP

https://www.theregister.com/2026/02/06/ai_capex_plans/
1•Brajeshwar•39m ago•0 comments

A free Dynamic QR Code generator (no expiring links)

https://free-dynamic-qr-generator.com/
1•nookeshkarri7•39m ago•1 comments

nextTick but for React.js

https://suhaotian.github.io/use-next-tick/
1•jeremy_su•41m ago•0 comments

Show HN: I Built an AI-Powered Pull Request Review Tool

https://github.com/HighGarden-Studio/HighReview
1•highgarden•41m ago•0 comments

Git-am applies commit message diffs

https://lore.kernel.org/git/bcqvh7ahjjgzpgxwnr4kh3hfkksfruf54refyry3ha7qk7dldf@fij5calmscvm/
1•rkta•44m ago•0 comments

ClawEmail: 1min setup for OpenClaw agents with Gmail, Docs

https://clawemail.com
1•aleks5678•51m ago•1 comments

UnAutomating the Economy: More Labor but at What Cost?

https://www.greshm.org/blog/unautomating-the-economy/
1•Suncho•57m ago•1 comments

Show HN: Gettorr – Stream magnet links in the browser via WebRTC (no install)

https://gettorr.com/
1•BenaouidateMed•58m ago•0 comments

Statin drugs safer than previously thought

https://www.semafor.com/article/02/06/2026/statin-drugs-safer-than-previously-thought
1•stareatgoats•1h ago•0 comments
Open in hackernews

Wan 2.6 – Open-source AI video generator with native audio sync

https://wan26.io
1•xbaicai•2mo ago

Comments

xbaicai•2mo ago
The Problem Current AI video generators struggle with audio integration. Most tools generate silent videos, forcing creators to manually add and sync audio in post-production. This breaks the creative flow and adds hours of work. Even when audio is supported, lip-sync is often off, creating an uncanny valley effect.

What We Built Wan 2.6 is a multimodal AI platform that generates videos at 1080p resolution (24fps) with audio baked in from the start. Key features:

Text-to-video: Describe what you want, get a video with synchronized audio Image-to-video: Animate static images with motion and sound Native audio sync: Audio isn't added afterwards—it's generated as part of the video creation process Precise lip-sync: Character mouth movements match the audio naturally AI image generation: Create images when you need them for video inputs How It Works We're using a breakthrough approach where audio and visual generation happen in a unified pipeline rather than as separate steps. This allows the model to understand the relationship between sound and motion from the ground up, resulting in more natural synchronization.

The system runs at 1080p (full HD) at 24fps, which hits the sweet spot between quality and generation speed. We've optimized for realistic motion and coherent multi-shot storytelling—something earlier models struggled with.

Why Open Source? We believe generative AI works best when the community can inspect, improve, and build upon it. Making Wan 2.6 open-source means:

Transparency in how the model works Community contributions to improve quality Easier integration into creative workflows No vendor lock-in for creators Use Cases We've Seen Early users are creating:

Marketing videos from product descriptions Animated social media content Concept visualizations for pitches Educational content with narration Character animations with dialogue Technical Details The model is built on a multimodal architecture that processes text, image, and audio signals simultaneously. We're generating at 1080p resolution with 24fps frame rate, which provides cinematic quality while keeping generation times reasonable.

Try It Out The platform is live at https://wan26.io. We're offering free access to let people experiment and give us feedback. We'd love to hear what you think—especially if you run into edge cases or have ideas for improvements.

What's Next We're working on:

Longer video generation (currently optimized for shorter clips) More control over camera angles and scene composition Better handling of complex multi-character scenes API access for developers We'd love your feedback! What would you use this for? What features are missing? Any creative use cases we haven't thought of?

Questions we expect:

How does this compare to Runway/Pika/Sora? Our main differentiator is native audio sync. Most competitors generate silent videos or add audio as a post-process. We also prioritize being open-source.

What are the limitations? Like all AI video generators, we occasionally produce artifacts or inconsistent motion. Longer videos are harder to keep coherent. We're actively working on these.

Can I use this commercially? Yes, the open-source nature means you can use generated content in commercial projects (check our license for specifics).