frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

FDA Intends to Take Action Against Non-FDA-Approved GLP-1 Drugs

https://www.fda.gov/news-events/press-announcements/fda-intends-take-action-against-non-fda-appro...
1•randycupertino•21s ago•0 comments

Supernote e-ink devices for writing like paper

https://supernote.eu/choose-your-product/
1•janandonly•2m ago•0 comments

We are QA Engineers now

https://serce.me/posts/2026-02-05-we-are-qa-engineers-now
1•SerCe•3m ago•0 comments

Show HN: Measuring how AI agent teams improve issue resolution on SWE-Verified

https://arxiv.org/abs/2602.01465
2•NBenkovich•3m ago•0 comments

Adversarial Reasoning: Multiagent World Models for Closing the Simulation Gap

https://www.latent.space/p/adversarial-reasoning
1•swyx•3m ago•0 comments

Show HN: Poddley.com – Follow people, not podcasts

https://poddley.com/guests/ana-kasparian/episodes
1•onesandofgrain•11m ago•0 comments

Layoffs Surge 118% in January – The Highest Since 2009

https://www.cnbc.com/2026/02/05/layoff-and-hiring-announcements-hit-their-worst-january-levels-si...
4•karakoram•11m ago•0 comments

Papyrus 114: Homer's Iliad

https://p114.homemade.systems/
1•mwenge•11m ago•1 comments

DicePit – Real-time multiplayer Knucklebones in the browser

https://dicepit.pages.dev/
1•r1z4•11m ago•1 comments

Turn-Based Structural Triggers: Prompt-Free Backdoors in Multi-Turn LLMs

https://arxiv.org/abs/2601.14340
2•PaulHoule•13m ago•0 comments

Show HN: AI Agent Tool That Keeps You in the Loop

https://github.com/dshearer/misatay
2•dshearer•14m ago•0 comments

Why Every R Package Wrapping External Tools Needs a Sitrep() Function

https://drmowinckels.io/blog/2026/sitrep-functions/
1•todsacerdoti•15m ago•0 comments

Achieving Ultra-Fast AI Chat Widgets

https://www.cjroth.com/blog/2026-02-06-chat-widgets
1•thoughtfulchris•16m ago•0 comments

Show HN: Runtime Fence – Kill switch for AI agents

https://github.com/RunTimeAdmin/ai-agent-killswitch
1•ccie14019•19m ago•1 comments

Researchers surprised by the brain benefits of cannabis usage in adults over 40

https://nypost.com/2026/02/07/health/cannabis-may-benefit-aging-brains-study-finds/
1•SirLJ•21m ago•0 comments

Peter Thiel warns the Antichrist, apocalypse linked to the 'end of modernity'

https://fortune.com/2026/02/04/peter-thiel-antichrist-greta-thunberg-end-of-modernity-billionaires/
3•randycupertino•22m ago•2 comments

USS Preble Used Helios Laser to Zap Four Drones in Expanding Testing

https://www.twz.com/sea/uss-preble-used-helios-laser-to-zap-four-drones-in-expanding-testing
3•breve•27m ago•0 comments

Show HN: Animated beach scene, made with CSS

https://ahmed-machine.github.io/beach-scene/
1•ahmedoo•28m ago•0 comments

An update on unredacting select Epstein files – DBC12.pdf liberated

https://neosmart.net/blog/efta00400459-has-been-cracked-dbc12-pdf-liberated/
3•ks2048•28m ago•0 comments

Was going to share my work

1•hiddenarchitect•31m ago•0 comments

Pitchfork: A devilishly good process manager for developers

https://pitchfork.jdx.dev/
1•ahamez•31m ago•0 comments

You Are Here

https://brooker.co.za/blog/2026/02/07/you-are-here.html
3•mltvc•35m ago•1 comments

Why social apps need to become proactive, not reactive

https://www.heyflare.app/blog/from-reactive-to-proactive-how-ai-agents-will-reshape-social-apps
1•JoanMDuarte•36m ago•1 comments

How patient are AI scrapers, anyway? – Random Thoughts

https://lars.ingebrigtsen.no/2026/02/07/how-patient-are-ai-scrapers-anyway/
1•samtrack2019•37m ago•0 comments

Vouch: A contributor trust management system

https://github.com/mitchellh/vouch
3•SchwKatze•37m ago•0 comments

I built a terminal monitoring app and custom firmware for a clock with Claude

https://duggan.ie/posts/i-built-a-terminal-monitoring-app-and-custom-firmware-for-a-desktop-clock...
1•duggan•38m ago•0 comments

Tiny C Compiler

https://bellard.org/tcc/
5•guerrilla•39m ago•0 comments

Y Combinator Founder Organizes 'March for Billionaires'

https://mlq.ai/news/ai-startup-founder-organizes-march-for-billionaires-protest-against-californi...
4•hidden80•40m ago•4 comments

Ask HN: Need feedback on the idea I'm working on

1•Yogender78•40m ago•1 comments

OpenClaw Addresses Security Risks

https://thebiggish.com/news/openclaw-s-security-flaws-expose-enterprise-risk-22-of-deployments-un...
2•vedantnair•40m ago•0 comments
Open in hackernews

'Western Qwen': IBM Wows with Granite 4 LLM Launch and Hybrid Mamba/Transformer

https://venturebeat.com/ai/western-qwen-ibm-wows-with-granite-4-llm-launch-and-hybrid-mamba-transformer
83•2bluesc•4mo ago

Comments

baobun•4mo ago
IBM announcement post is more informative than venturebeat

IBM Granite 4.0: hyper-efficient, high performance hybrid models for enterprise

https://www.ibm.com/new/announcements/ibm-granite-4-0-hyper-...

flowerthoughts•4mo ago
ISO 42001 certified.

> ISO/IEC 42001 is an international standard that specifies requirements for establishing, implementing, maintaining, and continually improving an Artificial Intelligence Management System (AIMS) within organizations. It is designed for entities providing or utilizing AI-based products or services, ensuring responsible development and use of AI systems.

https://www.iso.org/standard/42001

If anyone has access to ISO standards, I'm really curious what the practical effects of that certification is. I.e. what things does Granite have that others don't, because they had to add/do it to fulfill the certification.

The committee was formed in 2017, chaired by an AI expert: https://www.iso.org/committee/6794475.html

PeterStuer•4mo ago
Depends. In my experience, some countries, e.g. Spain, are very into certs while others just ignore it.
magicalhippo•4mo ago
They also have a nice write-up on the Mamba architecture:

https://www.ibm.com/think/topics/mamba-model

EagnaIonat•4mo ago
Tried out the Ollama version and it's insanely fast with really good results for 1.9GB size. Supposed to have a 1M context window, would be interested where the speed goes then.

No Mamba in the Ollama version though.

Flere-Imsaho•4mo ago
(I've only just starting running local LLMs so excuse the dumb question).

Would Granite run with llama.cpp and use Mamba?

RossBencina•4mo ago
Last I checked Ollama inference is based on llama.cpp so either Ollama has not caught up yet, or the answer is no.

EDIT: Looks like Granite 4 hybrid architecture support was added to llama.cpp back in May: https://github.com/ggml-org/llama.cpp/pull/13550

magicalhippo•4mo ago
> Last I checked Ollama inference is based on llama.cpp

Yes and no. They've written their own "engine" using GGML libraries directly, but fall back to llama.cpp for models the new engine doesn't yet support.

mehdibl•4mo ago
Ollama default to Q4 usually and 8/16k context and not the 1M context
serioussecurity•4mo ago
Every technical paper I've read that IBM publish at an ML conference has been P-hacked to hell. Stay away.
soganess•4mo ago
Links? Maybe just paper titles?
danielhanchen•4mo ago
I made some dynamic GGUFs for the 32B MoE model! Try:

./llama.cpp/llama-cli -hf unsloth/granite-4.0-h-small-GGUF:UD-Q4_K_XL

Also a support agent finetuning notebook with granite 4: https://colab.research.google.com/github/unslothai/notebooks...

anshumankmr•4mo ago
You guys are lightning fast. Did you folks have access to the model weights before hand or something, if you don't mind me asking?
danielhanchen•4mo ago
Oh thanks! Yes sometimes we get early access to some models!
incomingpain•4mo ago
As always, you're awesome. keep up the great work!
danielhanchen•4mo ago
Thanks!
aetherspawn•4mo ago
I really just want to know how it compares to ChatGPT and Claude at various tasks, but there aren’t any graphs for that.
KronisLV•4mo ago
It will probably take a few days/week for some in depth benchmarks to start popping up.

The IBM article has this image showing that it's supposed to be a bit ahead of GPT OSS 120B for at least some tasks (horrible URL but oh well): https://www.ibm.com/content/dam/worldwide-content/creative-a...

So in general it's going to be worse than GPT-5 and also Sonnet 4.5, but closer to GPT-5 mini. At least you can run this on prem, but none of the others. Pretty good, could possibly replace Qwen3 for quite a few use cases!

KronisLV•4mo ago
Edit: or perhaps not, seems like 3rd party benchmarks aren't as positive.
anshumankmr•4mo ago
Also worth checking out was codestral... I think that had a 256k context and used Mamba even if it is slightly older model now... it had worked great for a Text2SQL use case we worked on.
incomingpain•4mo ago
Magistral 2509 just came out. It super slows down when you go over 40,000 context. It's quite a fantastic model.
incomingpain•4mo ago
"Small" is 32b a9b for 19GB @ Q4_K_XL

20GB @ 100,000 context.

But for some reason... LM studio isnt loading it onto gpu for me?

I just updated to 0.3.28 and still wont load onto gpu.

Switching from Vulkan to rocm. It's now working properly?

https://docs.unsloth.ai/new/ibm-granite-4.0

Fantastic work from unsloth folks as usual.

As it's running in roo code, it's using more like 26GB of vram.

~30TPS

Roo code does not work with it.

Kilo code next. It seems to be about 22GB of vram.

Kilo code works great.

The model however didn't 1 shot my first benchmark. That's pretty bad news for this model given magistral 2509 or apriel 15b are better.

Better on pass 2, still no 100%

3rd pass achieved.

Im predicting it'll be around 30% on livecodebench. Probably like 15% on aiderpolyglot. Very disappointed in its coding capability.

I just found:

https://artificialanalysis.ai/models/granite-4-0-h-small

25.1% on livecodebench. Absolutely deserved.

2% terminal bench.

16% on coding index. Completely deserved.

thawab•4mo ago
After getting burned by Watson. I am not touching any AI from IBM.
stirfish•4mo ago
Tell us more about how you were burned, please!
arthurcolle•4mo ago
It's a file you can run on your computer