frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Tar files made in macOS generate "xattr" errors when expanded in Linux

https://aruljohn.com/blog/macos-created-tar-files-linux-errors/
1•heresie-dabord•1m ago•0 comments

The quiet layoffs China's tech giants

https://restofworld.org/2026/china-tech-layoffs-alibaba-baidu-ai-pivot/
1•giuliomagnifico•1m ago•0 comments

Show HN: Run Claude Code sessions on Linear issues via two MCP servers

https://lanes.sh/blog/linear-to-lanes
2•s-xyz•1m ago•0 comments

Americans lost $2.1B to social media scams in 2025, 8x more than in 2020

https://fortune.com/2026/04/28/ftc-social-media-scams-americans-lose-billions-meta-facebook-whats...
1•1vuio0pswjnm7•3m ago•0 comments

Tap Jewels is a classic match-3 gem game for the Amiga computer

https://www.amiga-shop.net/en/Amiga-Software/Amiga-Games/Tap-Jewels-classic-Amiga-download-versio...
1•doener•5m ago•0 comments

Do the geniuses in the datacenter get lunch breaks?

https://blog.dnmfarrell.com/post/do-the-geniuses-in-the-datacenter-get-lunch-breaks/
1•davidfarrell•5m ago•0 comments

Thales – TypeScript compiler and JavaScript engine in Lean

https://github.com/jessealama/thales
1•michaelkrem•6m ago•0 comments

Content Engineering with Claude Code

https://ahrefs.com/blog/how-i-do-content-engineering-with-claude-code/
1•eigenBasis•8m ago•0 comments

Get Your Website/API Ready for Agentic Commerce in 1 Minute

https://www.startuphub.ai/agent-readiness
1•compulsivebuild•11m ago•1 comments

Bitmap and tilemap generation from a single example

https://github.com/mxgmn/WaveFunctionCollapse
1•futurecat•14m ago•0 comments

Redshift – Rehearsing for humanity's future on Mars

https://harpers.org/archive/2026/05/redshift-elena-saavedra-buckley-mars/
1•pseudolus•16m ago•0 comments

CPanel and WHM Authentication Bypass

https://labs.watchtowr.com/the-internet-is-falling-down-falling-down-falling-down-cpanel-whm-auth...
1•teapowered•16m ago•0 comments

TIL – OS Login Guest Environment for Google Compute Engine

https://github.com/GoogleCloudPlatform/guest-oslogin
1•ankitg12•18m ago•0 comments

Oracle plans to power its New Mexico mega datacenter with 2.45GW fuel cell farm

https://www.theregister.com/2026/04/28/oracle_new_mexico_power_fuel_cell_farm/
2•pseudolus•18m ago•1 comments

Oops, I invented REST APIs

https://clemorl.fr/Articles/Oops,-I-invented-REST-APIs
1•Topy•21m ago•0 comments

Crow-CLI/crow-CLI: Minimal MCP based ACP agent

https://github.com/crow-cli/crow-cli
1•themaxdavitt•21m ago•0 comments

LocalPilot with Ollama as a Replacement for CoPilot in VS2026

https://github.com/FutureStackSolution/LocalPilot
1•voidmain0001•22m ago•0 comments

The Hanging God (Odin) [video]

https://www.youtube.com/watch?v=yizsAZ4z8xA
1•Imustaskforhelp•23m ago•0 comments

GCC 16 has been released

https://gcc.gnu.org/gcc-16/changes.html
10•HeliumHydride•24m ago•0 comments

Donating to Open Source

https://entropicthoughts.com/open-source-donation
2•birdculture•27m ago•1 comments

The Bombadil Terminal Experiment

https://wickstrom.tech/2026-04-30-bombadil-terminal-experiment.html
2•simonklee•27m ago•0 comments

Harness engineering: leveraging Codex in an agent-first world

https://openai.com/index/harness-engineering/
1•kumulo•33m ago•0 comments

For the first time in history, more Americans are moving to EU than vice versa

https://twitter.com/benbawan/status/2049303326999609846
3•akyuu•33m ago•0 comments

Show HN: Automation Resilience

https://automationresilience.com
1•dewilliamsco•33m ago•0 comments

Qwen-Scope: Official Sparse Autoencoders (SAEs) for Qwen 3.5 Models

https://qwen.ai/blog?id=qwen-scope
2•embedding-shape•33m ago•1 comments

The Grid Doesn't Need More Power Plants (It Needs This) [video]

https://www.youtube.com/watch?v=pLIatO-RA1c
1•zeristor•33m ago•3 comments

Show HN: A cozy reverse city-builder with custom shaders. Free demo on Steam

https://store.steampowered.com/app/4623880/Open_Land_Demo/
1•ralph_sc•34m ago•0 comments

Show HN: Can a person's chess mind outlive them? An attempt with 41 masters

https://playchessgate.com/
1•Tdxt•38m ago•0 comments

Pangram Is Good

https://andrewpwheeler.com/2026/04/30/pangram-is-good/
2•apwheele•38m ago•0 comments

Will Sodium-Ion Batteries Revolutionize Electric Ships?

https://cleantechnica.com/2026/04/27/will-sodium-ion-batteries-revolutionize-electric-ships/
1•xbmcuser•39m ago•0 comments
Open in hackernews

Granite 4.1: IBM's 8B Model Matching 32B MoE

https://firethering.com/granite-4-1-ibm-open-source-model-family/
95•steveharing1•1h ago

Comments

mdp2021•1h ago
Wish they also released an embedding model, in the line of their previous: compact (while good)...
RugnirViking•1h ago
sounds interesting. Here's hoping they release a 32B model, thats a pretty good sweet spot for feasibility of home setups.

edit: I just realised they do actually have a 30b release alongside this. Haven't tried it yet.

2ndorderthought•1h ago
I test drove it yesterday. It's pretty impressive at 8b. Runs on commodity hardware quickly.

Qwen3.6 35b a3b is still my local champion but I may use this for auto complete and small tasks. It has recent training data which is nice. If the other small models got fine tuned on recent data I don't know if I would use this at all, but that alone makes it pretty decent.

The 4b they released was not good for my needs but could probably handle tool calls or something

steveharing1•1h ago
Yea, No doubt Qwen 3.6 open weights are far more strong
rnadomvirlabe•53m ago
Why no doubt?
steveharing1•34m ago
Because Qwen 3.6 pushes way above its weight. Granite 8B is impressive, but Qwen still wins on raw capability, especially for coding.
actionfromafar•24m ago
Way above its weights.
drittich•14m ago
Nanobanana for scale.
rnadomvirlabe•21m ago
You just asserted the same thing again. Why do you say this is the case?
noodletheworld•4m ago
Having tried it.

Qwen is really good.

Also, generally, it makes sense. 8B models are shit.

That this 8B model is decent is impressive, but that it could perform on par with a good model 4 times as large is a daydream.

captainbland•17m ago
No comparison with competitor models other than the previous granite version strongly implies that it does not compete well with other comparable models. At least this is the most reasonable assumption until data comes out to the contrary
vessenes•12m ago
Have you tried the Gemma 4 series, out of curiosity? I haven’t run a local model in a while, but the benchmarks look good. I’d take a free local tool-use model if it was relatively consistent.
Havoc•1h ago
Interesting to see a pivot away from MoE by both IBM and mistral while the larger classes of SOTA of models all seem to be sticking to it.

Quick vibe check of it- 8B @ Q6 - seems promising. Bit of a clinical tone, but can see that being useful for data processing and similar. You don't really want a LLM that spams you with emojis sometimes...

embedding-shape•13m ago
Makes sense, dense for small models, dense or MoE for larger ones, end up fitting various hardware setups pretty neatly, no need for MoE at smaller scale and dense too heavy at large scale.
100ms•51m ago
> Full stop.

Why people don't edit out obvious sloppification and expect to still have readers left

cbg0•49m ago
So are we saying it's fine that the article is written by an LLM as long as it doesn't have the tell-tale signs of LLMs?
ramon156•38m ago
It's more about curating the things you're publishing. Why would I bother reading what you couldn't bother to read?
100ms•35m ago
I don't really see reason to complain about tool use, so long as the result is cohesive, accurate and that ultimately means a human has at least read their own output before publishing. It's a bit like receiving a supposedly personal letter that starts "Dear [INSERT_FIRST_NAME_FIELD]," are you really going to read such a thing?
HighGoldstein•16m ago
An article without telltale signs of an LLM is indistinguishable from an article written by a human, so yes.
spicyusername•8m ago
My opinion is that literature and art will continue pushing the envelope in the places they always pushed the envelope. LLMs will not change this, humans love making art, and they love doing it in new ways.

Corporate announcements were never the places that literature and art were pushing the envelope. They were slop before, and they're slop now.

wewewedxfgdf•34m ago
Third line in to the article: "But there’s one result in the benchmarks I keep coming back to."

I hear this sort of thing all the time now on YouTube from media/news personalities:

“And that’s the part nobody seems to be talking about.”

"And here's what keeps me up at night."

“This is where the story gets complicated.”

“Here’s the piece that doesn’t quite fit.”

“And this is where the usual explanation starts to break down.”

“Here’s what I can’t stop thinking about.”

“The part that should worry us is not the obvious one.”

“And that’s where the real problem begins.”

“But the more interesting question is the one no one is asking.”

“And this is where things stop being simple.”

It doesn't really worry me but I think its interesting that LLM speak sounds so distinctive, and how willing these media personalities are to be so obvious in reading out on TV what the LLM spat out.

I've never studied what LLMs say in depth is it is interesting that my brain recognises the speech pattern so easily.

bambax•17m ago
I notice this very often in LinkedIn posts, and it's annoying, but I had not realized it was LLM-speak? Isn't it possible that people write like this naturally?
trvz•12m ago
Yes. Some people are very trigger happy in attributing human slop to LLMs.
spicyusername•11m ago
Arguably it's exactly because it was used naturally so often that the LLMs parrot it so frequently.
wewewedxfgdf•10m ago
I think LLM's have that sort of "summarise, wrap it in a bow tie, give a little dramatic punch as a preview to the next few points".
frereubu•17m ago
I think this kind of language predates widespread LLM use, and has been picked up from that kind of writing. It's a "and here's where it gets interesting" pattern that people like Malcolm Gladwell and Freakonomics have used, even if the same thing could be said in a way that makes it sound much less intriguing.
cwillu•14m ago
There's even a word for it: “cliché”
jmbwell•14m ago
The language of drama and import without meaningful substance. Words statistically likely to be used in a segue, regardless of the preceding or subsequent point. Particularly effective when it seems like you’re getting let in on a secret. Really fatiguing to read

A writing teacher once excoriated me for saying that something was important. “Don’t tell me it’s important, show me, and let me decide, and if you do your job I’ll agree”

I don’t know how a completion can tell when it needs to do this. Mostly so far it doesn’t seem capable

crunis•34m ago
Are you referring to the literal use of the expression "full stop"? I don't see it anymore in the article, maybe they edited it out?
cbg0•43m ago
The real "sleeper" might be https://huggingface.co/ibm-granite/granite-vision-4.1-4b if the benchmarks hold up for such a small model against frontier models for table & semantic k:v extraction.
tosh•30m ago
IBM announcement: https://research.ibm.com/blog/granite-4-1-ai-foundation-mode...
agunapal•30m ago
If you really think about why MoE came into existence, its to save significant cost during training, I don't think there was any concrete evidence of performance gains for comparable MoE vs dense models. Over the years, I believe all the new techniques being employed in post training have made the models better.
zozbot234•15m ago
MoE models will have far more world knowledge than dense models with the same amount of active parameters. MoE is a no-brainer if your inference setup is ultimately limited by compute or memory throughput - not total memory footprint - or alternately if it has fast, high-bandwidth access to lower-tier storage to fetch cold model weights from on demand.
vessenes•14m ago
I think you mean inference compute? I believe all expert weights are updated in each backward pass during MoE training. The first benefit was getting a sort of structured pruning of weights through the mechanism of expert selection so that the model didn’t need to go through ‘unnecessary’ parts of the model for a given token. This then let inference use memory more efficiently in memory constrained environments, where non-hot or less common experts could be put into slow RAM, or sometimes even streamed off storage.

But I don’t think it necessarily saved training cost; if it did, I’d be interested to learn how!