frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Ask HN: What are you optimistic about in the age of AI?

1•annodomini2019•28s ago•0 comments

Show HN: DocEndorse – An AI assistant that runs your e-sign workflow in chat

1•kariopaul•1m ago•0 comments

Show HN: A 4.8MB native iOS voice transcriber built with SwiftUI

https://apps.apple.com/us/app/convoxa-ai-meeting-minutes/id6755150446
1•karamalaskar•1m ago•0 comments

Chrome will make popular scripts load faster (by picking winners)

https://danfabulich.medium.com/chrome-will-make-popular-scripts-load-faster-by-picking-winners-bc...
1•dfabulich•1m ago•0 comments

Show HN: We ship Flutter app updates without resubmitting on AppStore

https://github.com/StacDev/stac
1•divyanshub024•2m ago•0 comments

Think Smart About Sparse Compute: LatentMoE for Higher Accuracy per Flop, Param

https://research.nvidia.com/labs/nemotron/LatentMoE/
1•buildbot•2m ago•0 comments

Crabtime, a novel way to write Rust macros

https://ferrisoft.com/blog/crate_crabtime
1•adamnemecek•2m ago•0 comments

Miicrobiome influence on Black Ivory Coffee fermentation in Asian elephants

https://www.nature.com/articles/s41598-025-24196-0
1•PaulHoule•4m ago•0 comments

World War II in Europe with Flags: Every Day

https://www.youtube.com/watch?v=pQrgObFz6uo
1•mahirsaid•4m ago•1 comments

'Ralph Wiggum' loop prompts Claude to vibe-clone commercial software for $10 HR

https://www.theregister.com/2026/01/27/ralph_wiggum_claude_loops/
1•rmason•4m ago•0 comments

How to Nail Big Tech Behavioral Interviews as a Senior Software Engineer

https://newsletter.eng-leadership.com/p/how-to-nail-big-tech-behavioral-interviews
1•rbanffy•7m ago•0 comments

Zuckerberg blocked curbs on sex-talking chatbots for minors court filing alleges

https://www.reuters.com/legal/government/meta-ceo-zuckerberg-blocked-curbs-sex-talking-chatbots-m...
3•jethronethro•9m ago•0 comments

The evolution of my todo list system over 5 years

https://www.njbrown.com/blog/77/
3•ntnbr•9m ago•0 comments

How Many Chess Games Are Possible?

https://win-vector.com/2026/01/27/how-many-chess-games-are-possible/
1•jmount•11m ago•0 comments

US consumer confidence plunges to 12-year low

https://www.msn.com/en-us/money/markets/consumer-confidence-plunges-to-12-year-low/ar-AA1V6kow
3•akyuu•11m ago•0 comments

Chuck Klosterman on why we've never actually seen a real football game

https://www.latimes.com/entertainment-arts/books/story/2026-01-22/chuck-klosterman-new-book-football
5•proposal•13m ago•2 comments

Convolutional Neural Network Visualizations

https://github.com/utkuozbulak/pytorch-cnn-visualizations
1•auraham•13m ago•0 comments

Gov. Abbott orders Texas universities, agencies to halt H-1B visa petitions

https://www.texastribune.org/2026/01/26/texas-greg-abbott-h1b-visa-schools-universities/
1•malshe•14m ago•0 comments

The Census Bureau was undercounting business AI adoption

https://econlab.substack.com/p/the-census-bureau-was-undercounting
1•gmays•14m ago•0 comments

EU now has its own 'secure and encrypted' satellite communication system

https://www.euronews.com/my-europe/2026/01/27/eu-now-has-its-own-secure-and-encrypted-satellite-c...
3•akyuu•15m ago•0 comments

Betting on War: Prediction Markets and the Corruption of National Security

https://warontherocks.com/2026/01/betting-on-war-prediction-markets-and-the-corruption-of-nationa...
1•coloneltcb•15m ago•1 comments

African nations now send more money to China than they receive in new loans

https://www.reuters.com/business/finance/african-nations-now-send-more-money-china-than-they-rece...
1•DustinEchoes•16m ago•0 comments

You Can't Handle the Buddhabrot

https://lcamtuf.substack.com/p/you-cant-handle-the-buddhabrot
1•weinzierl•17m ago•0 comments

Codeless: From Idea to Software

https://www.anildash.com/2026/01/22/codeless/
3•janpio•17m ago•0 comments

Words with Spaces

https://www.linguabase.org/words-with-spaces.html
2•michaeld123•17m ago•1 comments

Show HN: 50+ open source AI-built SaaS apps

1•bhackett•18m ago•0 comments

The Rubin Observatory Will Rapidly Detect More Supernovae

https://www.universetoday.com/articles/the-rubin-observatory-will-rapidly-detect-more-supernovae
1•rbanffy•18m ago•0 comments

Show HN: Sciro – SDK to detect learner confusion without cameras or mics

https://www.sciro.site/
1•absmugz•18m ago•0 comments

CSS in 2026: The new features reshaping front end development

https://blog.logrocket.com/css-in-2026/
2•ulrischa•19m ago•0 comments

Show HN: pcpb – preview effects of `curl – bash` scripts

https://github.com/federicotdn/pcpb
1•federicotdn•19m ago•0 comments
Open in hackernews

Show HN: LemonSlice – Upgrade your voice agents to real-time video

26•lcolucci•2h ago
Hey HN, we're the co-founders of LemonSlice (try our HN playground here: https://lemonslice.com/hn). We train interactive avatar video models. Our API lets you upload a photo and immediately jump into a FaceTime-style call with that character. Here's a demo: https://www.loom.com/share/941577113141418e80d2834c83a5a0a9

Chatbots are everywhere and voice AI has taken off, but we believe video avatars will be the most common form factor for conversational AI. Most people would rather watch something than read it. The problem is that generating video in real-time is hard, and overcoming the uncanny valley is even harder.

We haven’t broken the uncanny valley yet. Nobody has. But we’re getting close and our photorealistic avatars are currently best-in-class (judge for yourself: https://lemonslice.com/try/taylor). Plus, we're the only avatar model that can do animals and heavily stylized cartoons. Try it: https://lemonslice.com/try/alien. Warning! Talking to this little guy may improve your mood.

Today we're releasing our new model* - Lemon Slice 2, a 20B-parameter diffusion transformer that generates infinite-length video at 20fps on a single GPU - and opening up our API.

How did we get a video diffusion model to run in real-time? There was no single trick, just a lot of them stacked together. The first big change was making our model causal. Standard video diffusion models are bidirectional (they look at frames both before and after the current one), which means you can't stream.

From there it was about fitting everything on one GPU. We switched from full to sliding window attention, which killed our memory bottleneck. We distilled from 40 denoising steps down to just a few - quality degraded less than we feared, especially after using GAN-based distillation (though tuning that adversarial loss to avoid mode collapse was its own adventure).

And the rest was inference work: modifying RoPE from complex to real (this one was cool!), precision tuning, fusing kernels, a special rolling KV cache, lots of other caching, and more. We kept shaving off milliseconds wherever we could and eventually got to real-time.

We set up a guest playground for HN so you can create and talk to characters without logging in: https://lemonslice.com/hn. For those who want to build with our API (we have a new LiveKit integration that we’re pumped about!), grab a coupon code in the HN playground for your first Pro month free ($100 value). See the docs: https://lemonslice.com/docs. Pricing is usage-based at $0.12-0.20/min for video generation.

Looking forward to your feedback!

EDIT: Tell us what characters you want to see in the comments and we can make them for you to talk to (e.g. Max Headroom)

*We did a Show HN last year for our V1 model: https://news.ycombinator.com/item?id=43785044. It was technically impressive but so bad compared to what we have today.

Comments

zvonimirs•2h ago
We're launching a new AI assistant and I wanted to make it alive so I started to play around with LemonSlice and I loved it!! I wanted to make our assistant be like a coworker that can give it an ability to create Loom style videos. Here's what I created - https://drive.google.com/file/d/1nIpEvNkuXA0jeZVjHC8OjuJlT-3...

Anyway, big thumbs up for the LemonSlice team, I'm excited to see it progress. I can definitely see products start coming alive with tools like this.

sid-the-kid•1h ago
Very cool! Thanks for sharing. I love your use-case of turning an AI coding agent into more of an AI employee. Will be interesting to see if users can connect better with the product this way.
sid-the-kid•1h ago
hey HN! one of the founders here. as of today, we are seeing informational avatars + roleplaying for training as the most common use cases. The roleplaying use-case was surprising to us. Think a nurse training to triage with AI patients. Or, SDRs practicing lead qualification with different kinds of clients.
buddycorp•1h ago
I'm curious if I can plug in my own OpenAI realtime voice agents into this.
lcolucci•1h ago
Good question! Yes and to do this you'd want to use our "Self-Managed Pipeline": https://lemonslice.com/docs/self-managed/overview. You can combine any TTS, LLM and STT combination with LemonSlice as the avatar layer.
jfaat•1h ago
I'm using an openAI realtime voice with livekit, and they said they have a livekit integration so it would probably be doable that way. I haven't used video in livekit though and I don't know how the plugins are setup for it
lcolucci•1h ago
Yes this is exactly right. Using the LiveKit integration you can add LemonSlice as an avatar layer on top of any voice provider
sid-the-kid•1h ago
Good question. When using the API, you can bring any voice agent (or LLM). Our API takes in what the agent will say, and then streams back the video of the agent saying it.

For the fully hosted version, we are currently partnered with ElevenLabs.

ed_mercer•1h ago
This looks super awesome!
sid-the-kid•1h ago
thank you! it's by far the thing I have worked on that I am most proud of.
dreamdeadline•1h ago
Cool! Do you plan to expose controls over the avatar’s movement, facial expressions, or emotional reactions so users can fine-tune interactions?
lcolucci•1h ago
Yes we do! Within the web app, there's a "action text prompt" section that allows you to control the overall actions of the character (e.g. "a fox talking with lots of arm motions"). We'll soon expose this in the API so you can control the characters movements dynamically (e.g. "now wave your hand")
sid-the-kid•1h ago
Our text control is good, especially for emotions. For example, you can add the text prompt: "a person talking. they are angry", and agent will have an angry expression.

You can also control background motions (like ocean waves, or a waterfall or car driving).

We are actively training a model that has better text control over hand motions.

marieschneegans•1h ago
This is next-level!
lcolucci•1h ago
Thanks so much! We're super proud of it
bennyp101•1h ago
Heads up, your privacy policy[0] does not work in dark mode - I was going to comment saying it made no sense, then I highlighted the page and more text appeared :)

[0] https://lemonslice.com/privacy

sid-the-kid•58m ago
Good catch! Working on a fix now.
sid-the-kid•44m ago
Fix deployed! This is why it's good to launch on hacker news. thanks for the tip.
bennyp101•32m ago
Nice one - thanks :)
benswerd•1h ago
The last year vs this year is crazy
sid-the-kid•50m ago
thanks! it just barley worked last year, but not much else. this year it's actually good. we got lucky: it's both new tech and turned out to be good quality.
lcolucci•48m ago
Agreed. We were so excited about the results last year and they are SO BAD now by comparison. Hopefully we'll say the same thing again in the couple months
r0fl•59m ago
Wow this is the most impressive thing I’ve seen on hacker news in years!!!!!

Take my money!!!!!!

lcolucci•53m ago
Wow thank you so much :) We're so proud of it!!
skandan•59m ago
Wow this team is non-stop!!! Wild that this small crew is dropping hit after hit. Is there an open polymarket on who acquires them?
lcolucci•47m ago
haha thank you so much! The team is incredible - small but mighty
r0fl•58m ago
Where’s the hn playground to grab a free month?

I have so many websites that would do well with this!

lcolucci•52m ago
https://lemonslice.com/hn - There's a button for "Get 1st month free" in the Developer Quickstart
r0fl•55m ago
Pricing is confusing

Video Agents Unlimited agents Up to 3 concurrent calls Creative Studio 1min long videos Up to 3 concurrent generations

Does that mean I can have a total of 1 minute of video calls? Or video calls can only be 1 minute long? Or does it mean I can have unlimited calls, 3 calls at a time all month long?

Can I have different avatars or only the same avatar x 3?

Can I record the avatar and make videos and post on social media?

lcolucci•50m ago
Sorry about the confusion. Video Agents and Creative Studio are two entirely different products. Video Agents = interactive video. Creative Studio = make a video and download it. If you're interested in real-time video calls, then Video Agents is the only pricing and feature set you should look at.
r0fl•51m ago
Wow I can’t get enough of this site! This is literally all I’ve been playing with for like half an hour. Even moved a meeting!

My mind is blown! It feels like the first time I used my microphone to chat with ai

sid-the-kid•47m ago
glad we found somebody who likes it as much as us! BTW, biggest thing we are working to improve is speed of the response. I think we can make that much faster.
lcolucci•42m ago
This comment made my day! So happy you're liking it
koakuma-chan•51m ago
> You're probably thinking, how is this useful

I was thinking why the quality is so poor.

sid-the-kid•49m ago
curious what avatar you think is poor quality? Or, what you think is poor quality. i want to know :)
koakuma-chan•46m ago
Low res and low fps. Not sure if lipsync is poor, or if low fps makes it look poor. Voice sounds low quality, as if recorded on a bad mic, and doesn't feel like it matches the avatar.
sid-the-kid•26m ago
thanks for the feedback. that's helpful. Ya, some avatars have worse lip synch than others. It depends a little on how zoomed in you are.

I am double checking now to make 100% sure we return the original audio (and not the encoded/decoded audio).

We are working on high-res.

wumms•46m ago
You could add a Max Headroom to the hn link. You might reach real time by interspersing freeze frames, duplicates, or static.
sid-the-kid•42m ago
1) yes on Max Headroom. we are on it. 2) it already is real time...?
wumms•19m ago
Whoops! Mistook the "You're about to speak with an AI."-progress bar for processing delay.
sid-the-kid•36m ago
And, just like that, Max Headroom is back: https://lemonslice.com/try/agent_ccb102bdfc1fcb30
shj2105•29m ago
Not working on mobile iOS
lcolucci•23m ago
what's not working for you?
convivialdingo•11m ago
That's super impressive! Definitely one of the best quality conversational agents I've tried syncing A/V and response times.

The text processing is running Qwen / Alibaba?

sid-the-kid•8m ago
Thank you! Yes, right now we are using Qwen for the LLM. They also released a super fast TTS model that we have not tried yet, which is supposed to be very fast.
lcolucci•5m ago
Qwen is the default but you can pick any LLM in the web app (though not the HN playground)