frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

I deleted ChatGPT (and most of my other apps)

https://www.dtlarson.com/feed-the-beast
1•ajflores1604•1m ago•1 comments

Show HN: Fivefold – a logic puzzle where the rules change every day

https://fivefold.ca
1•MattRix•1m ago•0 comments

Elevator Saga – the elevator programming game

https://play.elevatorsaga.com/
1•pavel_lishin•2m ago•0 comments

Need free help on an Industry Project?

1•sfromana•2m ago•0 comments

"Slippery Zips and Sticky Tar-Pits" from Python's Security Dev Seth Larson [pdf]

https://alpha-omega.dev/wp-content/uploads/sites/22/2025/10/ao_wp_102725a.pdf
1•AlSweigart•2m ago•1 comments

Show HN: Opensource.Builders 2.0 – find and build open source alternatives

https://opensource.builders
1•theturtletalks•3m ago•0 comments

Steam Frame

https://store.steampowered.com/sale/steamframe
5•Philpax•4m ago•1 comments

A tale of three customer service chatbots

https://pluralistic.net/2025/11/11/sorry-to-bother-you/
1•hn_acker•6m ago•0 comments

The FusionAuth MCP Server

https://fusionauth.io/blog/fusionauth-mcp-server
1•mooreds•7m ago•0 comments

Ancient enzyme structure reveals new path to sustainable ethylene production

https://phys.org/news/2025-10-ancient-enzyme-reveals-path-sustainable.html
1•PaulHoule•7m ago•0 comments

Applying to Zapier? Here's why you might meet an AI recruiter

https://zapier.com/blog/zapier-ai-recruiters/
1•mooreds•7m ago•1 comments

Show HN: Comprehensive Chicago area golf resource

https://www.golfscout.net
1•golfer•7m ago•0 comments

How to Identify a Prime Number Without a Computer

https://www.scientificamerican.com/article/how-to-identify-a-prime-number-without-a-computer/
2•beardyw•8m ago•0 comments

Directory of Turing Tests

https://turingtest.tech/
1•smooke•8m ago•0 comments

What's Next in Agentic Coding

https://seconds0.substack.com/p/heres-whats-next-in-agentic-coding
1•gmays•8m ago•0 comments

Bottlenecks in AI-Assisted Product Development

https://nahurst.substack.com/p/bottlenecks-in-ai-assisted-product
1•nathanh•9m ago•0 comments

Court: Colorado's Mandatory Social Media "Warning Labels" Are Unconstitutional

https://blog.ericgoldman.org/archives/2025/11/colorados-mandatory-social-media-warning-labels-are...
1•hn_acker•9m ago•1 comments

Ask HN: Did you notice any change in DDG's search quality over the last years?

1•basilikum•11m ago•0 comments

OLAP migration complexity is the cost of fast reads

https://www.fiveonefour.com/blog/olap-migration-complexity
1•oatsandsugar•13m ago•0 comments

'Initial Interest Confusion' Is More of a Vibe Than a Credible Legal Doctrine

https://blog.ericgoldman.org/archives/2025/11/initial-interest-confusion-is-more-of-a-vibe-than-a...
1•hn_acker•13m ago•1 comments

What's New for C++ Developers in Visual Studio 2026

https://devblogs.microsoft.com/cppblog/whats-new-for-cpp-developers-in-visual-studio-2026-version...
1•mariuz•14m ago•0 comments

Launch HN: JSX Tool (YC F25) – A Browser Dev-Panel IDE for React

4•jsunderland323•15m ago•1 comments

Redditor Convicted of Sharing Nude Shots in Danish 'Moral Rights' Copyright Case

https://torrentfreak.com/redditor-convicted-for-sharing-nude-scenes-in-landmark-moral-rights-copy...
1•embedding-shape•16m ago•0 comments

Beads: A coding agent memory system

https://steve-yegge.medium.com/introducing-beads-a-coding-agent-memory-system-637d7d92514a
1•mooreds•16m ago•0 comments

TuneIn sold to Canadian media group Stingray

https://radiotoday.co.uk/2025/11/tunein-sold-to-canadian-media-group-stingray/
2•vool•17m ago•1 comments

Were URLs a Bad Idea?

https://neilmadden.blog/2025/11/12/were-urls-a-bad-idea/
1•speckx•18m ago•0 comments

Show HN: FPGA Based IBM-PC-XT

https://bit-hack.net/2025/11/10/fpga-based-ibm-pc-xt/
1•bit-hack•21m ago•0 comments

Show HN: Replace Your macOS Dock

https://www.flowylabs.ai
2•talksik•22m ago•0 comments

Discovering orphaned binaries in /usr/sbin on Fedora 42

https://utcc.utoronto.ca/~cks/space/blog/linux/Fedora42OrphanUsrSbinBinaries
1•speckx•24m ago•0 comments

U of T hires three top U.S. scholars, announces $24M recruitment plan

https://www.theglobeandmail.com/canada/education/article-u-of-t-hires-three-top-us-scholars-plans...
1•Teever•24m ago•0 comments
Open in hackernews

Baidu releases open-source multimodal AI that it claims beats GPT-5 and Gemini

https://venturebeat.com/ai/baidu-just-dropped-an-open-source-multimodal-ai-that-it-claims-beats-gpt-5
5•teleforce•1h ago

Comments

bn-l•1h ago
> The model, dubbed ERNIE-4.5-VL-28B-A3B-Thinking

No way at so few parameters

verdverm•1h ago
Recent research results from many groups suggest otherwise. The lag between private models to competitive open models has been shrinking, same for the resources required to train and run them

The people who are spending billions on ai infra build outs want you to believe it's necessary, because frontier mega models are supposedly so much better. China has been showing us otherwise, especially being handicapped by export controls and showing how you can do more with less

NitpickLawyer•37m ago
> The lag between private models to competitive open models has been shrinking

It really hasn't. It's the opposite, actually. The latest breakthroughs in RL by the big4 labs haven't been replicated yet in any open model (including the latest k2-thinking). Even gemini-2.5 still delivers on generalisation in a way that no open models do, today (almost a year later). The general consensus was that "open" models were 6-8 months behind SotA, but with the RL stuff we can see they've moved further away.

I don't know what exactly it is, if it's simply RL scale, or data + scale, or better secret sauce (rewards, masking, something else) but the way these new models generalise is leagues ahead of open models, sadly.

Don't be fooled by benchmarks alone. You have to test them on problems that you own and you can be fairly sure no one is targeting for benchmark scores. Recently there was a python golfing competition on kaggle, and I tested some models on that task. While the top4 models were chugging along, in both agentic and 0shot regimes, the open models (coding specific or, older "thinking" models) were really bad at the task. 480b models, coding specific, would go in circles, get lost on one example, and so on. Night and day between the open models and gpt5/claude/gemini2.5. Even grok fast solved a lot of tasks in agentic mode.

verdverm•29m ago
While I agree with your comments here, I will note that the big 4 models were released this year (summer-ish) so we are still not at a point you can claim the open models are more than a year behind something that is not a year old yet
verdverm•1h ago
HF link: https://huggingface.co/baidu/ERNIE-4.5-VL-28B-A3B-Thinking
JSR_FDED•55m ago
I know it’s popular to hate on China right now, but can we acknowledge that Chinese companies and research groups have done more for us hackers in terms of making amazing models available with open weights for free, than US companies and research groups?