frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Laguna XS.2 and M.1

https://poolside.ai/blog/laguna-a-deeper-dive
46•tosh•1h ago

Comments

rohitpaulk•1h ago
Been testing these via their "pool" agent. It's fast, and the agent adheres to the ACP spec pretty well (better than codex, opencode etc.) so it's a good experience in Zed.
throwaw12•1h ago
Has anyone tried these models?

I like their honesty in benchmarks, looks like Qwen3.6 35B is outperforming their Laguna M.1 225B model

kingjimmy•1h ago
the color-codes make those benchmarks charts impossible to understand. very pretty though.
data-ottawa•1h ago
For what it's worth, the bars correspond in order with the legend. Plus there’s hover text.
franksiem•1h ago
Felt like they would never come out of stealth mode but very nice to see it materialized into something competitive.
refulgentis•39m ago
What makes them distinctive?
throwaw12•39m ago
Not sure if this is competitive, look at the numbers for Qwen3.6
jaen•1h ago
For similarly sized models, not looking very good on the slightly-less-benchmaxxed Terminal-Bench 2.0:

  Laguna XS.2  33B-A3B params: 30.6
  Qwen 3.6     35B-A3B       : 51.5
  Devstral 2   123B          : 31.2
Quite a huge lead for Qwen... well, at least it's catching up to other smaller Western labs.
megavon•55m ago
Need to look at SWEBench-Pro, it's super competitive. Suspect they'll catch up given the longer-tail on TB scores.
jaen•39m ago
Just by the (lack of) inter-model variance, I don't think SWEBench-Pro does a very good job of representing model capability. Terminal-Bench seems more challenging and separates the wheat from the chaff.

Also, *ops work, which in my experience can actually be more complicated than SWE is underrepresented there obviously.

speedgoose•1h ago
Please update the charts. Consider using textures or filling patterns.

I usually score pretty well in colour perception tests but distinguishing between those two purples made me doubt myself.

matthewfcarlson•43m ago
My phone is in grayscale to make it less interesting (I still watch way too many videos in grayscale but it helps) so I’m right with you
esafak•33m ago
They're not winning any popular benchmark. Is there some niche where it excels?
vmarkovtsev2•3m ago
Well there are benchmarks, and there is real experience, right? They are not the same.
gslepak•24m ago
Very cool to see more small open models being worked on!

One nit: I've seen on this homepage, and many others, this notion that the people behind the models are "working towards AGI".

I get that this is marketing speak, but transformers are not AGI, and they will never be AGI, so it'd be great if people stopped saying that as it sort of wears out the meaning of "working towards AGI".

altruios•16m ago
What does AGI mean to you?

Transformers have approximate knowledge of many things. Is this not 'general'? Where is the goalpost here?

gslepak•9m ago
> Transformers have approximate knowledge of many things. Is this not 'general'?

Of course not. That's like saying the Encyclopedia Britannica is AGI.

> What does AGI mean to you?

I would define AGI as human-like machine intelligence (or superior).

This is difficult for some people to understand because they don't understand what "human-like" means in the first place. Neuroscientists would be able to set some of these wayward computer scientists straight on this question.

liuliu•16m ago
> but transformers are not AGI, and they will never be AGI

Like the claim "transformers are AGI", this needs proof, otherwise should be prefixed "I think". And honestly, positive proof is easier than negative proof (you just need to make one transformer model that is a AGI, whereas the never claim requires you to enumerated all possibilities).

gslepak•12m ago
That's like saying we should wait for positive proof of AGI from combustion engines. That'll never happen, no matter how much you tweak the engine. It's just not possible.

The negative proof is there in the definition itself. Transformers are not AGI, they're frozen human intelligence of the autocomplete variety. That can never be AGI and anyone who says otherwise doesn't understand transformers or AGI.

AISLE Discovers 38 CVEs in OpenEMR Healthcare Software

https://aisle.com/blog/aisle-discovers-38-critical-security-vulnerabilities-in-healthcare-softwar...
125•mmsc•2h ago•72 comments

Localsend: An open-source cross-platform alternative to AirDrop

https://github.com/localsend/localsend
555•bilsbie•6h ago•189 comments

BookStack Moves from GitHub to Codeberg

https://github.com/BookStackApp/BookStack/issues/4551
41•RadiozRadioz•43m ago•1 comments

Microsoft VibeVoice: Open-Source Frontier Voice AI

https://github.com/microsoft/VibeVoice
236•tosh•6h ago•145 comments

Laguna XS.2 and M.1

https://poolside.ai/blog/laguna-a-deeper-dive
46•tosh•1h ago•19 comments

Show HN: Live Sun and Moon Dashboard with NASA Footage

https://www.lumara-space.app/
103•beeswaxpat•4h ago•27 comments

Google and Pentagon reportedly agree on deal for 'any lawful' use of AI

https://www.theverge.com/ai-artificial-intelligence/919494/google-pentagon-classified-ai-deal
172•granzymes•2h ago•155 comments

Infisical (YC W23) Is Hiring Full Stack Software Engineers (Remote)

https://jobs.ashbyhq.com/infisical/782b9da8-20e1-48b2-919e-6c5430c58628
1•vmatsiiako•1h ago

I have officially retired from Emacs

https://nullprogram.com/blog/2026/04/26/
65•Fudgel•2d ago•36 comments

Who owns the code Claude Code wrote?

https://legallayer.substack.com/p/who-owns-the-claude-code-wrote
94•senaevren•6h ago•112 comments

GitHub Copilot code review will start consuming GitHub Actions minutes

https://github.blog/changelog/2026-04-27-github-copilot-code-review-will-start-consuming-github-a...
164•whtsky•9h ago•116 comments

FCC Funding Application Notes Paramount Will Be 49.5% Foreign-Owned Post-Merger

https://deadline.com/2026/04/paramount-fcc-request-wbd-merger-middle-east-1236873732/
106•throw0101c•2h ago•50 comments

Things C++26 define_static_array can't do

https://quuxplusone.github.io/blog/2026/04/24/define-static-array/
12•jandeboevrie•2d ago•1 comments

Deep under Antarctic ice, a long-predicted cosmic whisper breaks through

https://phys.org/news/2026-04-deep-antarctic-ice-cosmic-strange.html
83•rbanffy•1d ago•35 comments

GitHub Actions is the weakest link

https://nesbitt.io/2026/04/28/github-actions-is-the-weakest-link.html
115•dochtman•6h ago•22 comments

Talkie: a 13B vintage language model from 1930

https://talkie-lm.com/introducing-talkie
570•jekude•20h ago•234 comments

GitHub RCE Vulnerability: CVE-2026-3854 Breakdown

https://www.wiz.io/blog/github-rce-vulnerability-cve-2026-3854
23•bo0tzz•1h ago•11 comments

ASML became the chokepoint for cutting-edge chips

https://worksinprogress.co/issue/the-worlds-most-complex-machine/
255•mellosouls•3d ago•151 comments

AI's Economics Don't Make Sense

https://www.wheresyoured.at/ais-economics-dont-make-sense/
89•spking•1h ago•48 comments

Anthropic Joins the Blender Development Fund as Corporate Patron

https://www.blender.org/press/anthropic-joins-the-blender-development-fund-as-corporate-patron/
180•Philpax•2h ago•150 comments

PyWry: Cross-Platform Rendering Engine in Python

https://deeleeramone.github.io/PyWry/
21•filipovic•1d ago•5 comments

UAE Leaves OPEC and OPEC+

https://www.reuters.com/markets/commodities/uae-says-it-quits-opec-opec-statement-2026-04-28/
268•TechTechTech•4h ago•134 comments

Can You Find the Comet?

https://apod.nasa.gov/apod/ap260427.html
119•ColinWright•1d ago•74 comments

I Spent My Sabbatical Building a Power Meter for Sledgehammers

https://leblancfg.com/intensity-pad-founder-story.html
67•alin23•1d ago•48 comments

After Spain's blackout, its shift to renewables and grid evolution power on

https://www.theguardian.com/world/2026/apr/28/blackout-spain-renewable-energy-grid-solar-wind
42•lentil_soup•2h ago•6 comments

Physicists Discover the Most Complex Forms of Ice Yet

https://www.quantamagazine.org/physicists-discover-the-most-complex-forms-of-ice-yet-20260427/
8•ibobev•2h ago•2 comments

Voice Modems

https://computer.rip/2026-04-26-voice-modems.html
56•K7PJP•1d ago•7 comments

Cybersec is a thankless job: expanding workload and shrinking pay packet

https://www.theregister.com/2026/04/27/from_a_massive_skills_gap/
38•rustoo•2h ago•18 comments

WASM is not quite a stack machine

https://purplesyringa.moe/blog/wasm-is-not-quite-a-stack-machine/
139•signa11•13h ago•42 comments

The predictable failure of the QDay Prize

https://algassert.com/post/2601
49•firefly284•2d ago•4 comments