frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Are AI agents ready for the workplace? A new benchmark raises doubts

https://techcrunch.com/2026/01/22/are-ai-agents-ready-for-the-workplace-a-new-benchmark-raises-do...
1•PaulHoule•3m ago•0 comments

Show HN: AI Watermark and Stego Scanner

https://ulrischa.github.io/AIWatermarkDetector/
1•ulrischa•3m ago•0 comments

Clarity vs. complexity: the invisible work of subtraction

https://www.alexscamp.com/p/clarity-vs-complexity-the-invisible
1•dovhyi•4m ago•0 comments

Solid-State Freezer Needs No Refrigerants

https://spectrum.ieee.org/subzero-elastocaloric-cooling
1•Brajeshwar•5m ago•0 comments

Ask HN: Will LLMs/AI Decrease Human Intelligence and Make Expertise a Commodity?

1•mc-0•6m ago•1 comments

From Zero to Hero: A Brief Introduction to Spring Boot

https://jcob-sikorski.github.io/me/writing/from-zero-to-hello-world-spring-boot
1•jcob_sikorski•6m ago•0 comments

NSA detected phone call between foreign intelligence and person close to Trump

https://www.theguardian.com/us-news/2026/feb/07/nsa-foreign-intelligence-trump-whistleblower
4•c420•7m ago•0 comments

How to Fake a Robotics Result

https://itcanthink.substack.com/p/how-to-fake-a-robotics-result
1•ai_critic•7m ago•0 comments

It's time for the world to boycott the US

https://www.aljazeera.com/opinions/2026/2/5/its-time-for-the-world-to-boycott-the-us
1•HotGarbage•7m ago•0 comments

Show HN: Semantic Search for terminal commands in the Browser (No Back end)

https://jslambda.github.io/tldr-vsearch/
1•jslambda•7m ago•1 comments

The AI CEO Experiment

https://yukicapital.com/blog/the-ai-ceo-experiment/
2•romainsimon•9m ago•0 comments

Speed up responses with fast mode

https://code.claude.com/docs/en/fast-mode
3•surprisetalk•13m ago•0 comments

MS-DOS game copy protection and cracks

https://www.dosdays.co.uk/topics/game_cracks.php
3•TheCraiggers•14m ago•0 comments

Updates on GNU/Hurd progress [video]

https://fosdem.org/2026/schedule/event/7FZXHF-updates_on_gnuhurd_progress_rump_drivers_64bit_smp_...
2•birdculture•15m ago•0 comments

Epstein took a photo of his 2015 dinner with Zuckerberg and Musk

https://xcancel.com/search?f=tweets&q=davenewworld_2%2Fstatus%2F2020128223850316274
7•doener•15m ago•2 comments

MyFlames: Visualize MySQL query execution plans as interactive FlameGraphs

https://github.com/vgrippa/myflames
1•tanelpoder•16m ago•0 comments

Show HN: LLM of Babel

https://clairefro.github.io/llm-of-babel/
1•marjipan200•16m ago•0 comments

A modern iperf3 alternative with a live TUI, multi-client server, QUIC support

https://github.com/lance0/xfr
3•tanelpoder•17m ago•0 comments

Famfamfam Silk icons – also with CSS spritesheet

https://github.com/legacy-icons/famfamfam-silk
1•thunderbong•18m ago•0 comments

Apple is the only Big Tech company whose capex declined last quarter

https://sherwood.news/tech/apple-is-the-only-big-tech-company-whose-capex-declined-last-quarter/
2•elsewhen•21m ago•0 comments

Reverse-Engineering Raiders of the Lost Ark for the Atari 2600

https://github.com/joshuanwalker/Raiders2600
2•todsacerdoti•23m ago•0 comments

Show HN: Deterministic NDJSON audit logs – v1.2 update (structural gaps)

https://github.com/yupme-bot/kernel-ndjson-proofs
1•Slaine•26m ago•0 comments

The Greater Copenhagen Region could be your friend's next career move

https://www.greatercphregion.com/friend-recruiter-program
2•mooreds•27m ago•0 comments

Do Not Confirm – Fiction by OpenClaw

https://thedailymolt.substack.com/p/do-not-confirm
1•jamesjyu•27m ago•0 comments

The Analytical Profile of Peas

https://www.fossanalytics.com/en/news-articles/more-industries/the-analytical-profile-of-peas
1•mooreds•27m ago•0 comments

Hallucinations in GPT5 – Can models say "I don't know" (June 2025)

https://jobswithgpt.com/blog/llm-eval-hallucinations-t20-cricket/
1•sp1982•27m ago•0 comments

What AI is good for, according to developers

https://github.blog/ai-and-ml/generative-ai/what-ai-is-actually-good-for-according-to-developers/
1•mooreds•27m ago•0 comments

OpenAI might pivot to the "most addictive digital friend" or face extinction

https://twitter.com/lebed2045/status/2020184853271167186
1•lebed2045•29m ago•2 comments

Show HN: Know how your SaaS is doing in 30 seconds

https://anypanel.io
1•dasfelix•29m ago•0 comments

ClawdBot Ordered Me Lunch

https://nickalexander.org/drafts/auto-sandwich.html
3•nick007•30m ago•0 comments
Open in hackernews

Agentic AI Foundation

https://block.xyz/inside/block-anthropic-and-openai-launch-the-agentic-ai-foundation
129•thinkingkong•1mo ago

Comments

ChrisArchitect•1mo ago
More discussion: https://news.ycombinator.com/item?id=46207425
flakiness•1mo ago
So I'll focus on the block's contribtution, which is goose: https://github.com/block/goose

Has it gotten better and good enough? I tried it a few months back and it was pretty crappy. And it's not because of bad models (I used it with the latest Claude at that time) but because of poor harness implementation and UI.

Is it worth trying the latest version to see how it compares with Claude Code? I want OSS, model agnostic implementation to win but I felt the odds are off then. I'd be happy to be proven wrong.

maelito•1mo ago
Tried Goose, couldn't do anything. OpenCode is better. Mistral's Vibe too.
pzo•1mo ago
I did tried as well few months and also uninstalled. Application didn't at least back then have update feature and for each new release you had to reinstall again. UI experience was also very poor comparing to many other open source projects - would expect they at least hire some designer
eagleinparadise•1mo ago
Goose was super jank when I last tried it. Not worth a look
rbren•1mo ago
The OpenHands CLI has had some major improvements since v1: https://github.com/OpenHands/OpenHands

MIT license and model agnostic

I’d also keep a close eye on Toad which is launching this month:

https://willmcgugan.github.io/announcing-toad/

ewoodrich•1mo ago
I use OpenCode as my main CLI tool at this point, falling back to Claude Code and Codex as needed. It's really solid these days, highly recommend.

I use Claude Sonnet 4.5, Gemini 3 Pro Preview, and GPT 5/5-mini with great results on OC. I initially tried it so I could decouple from VS Code extensions while still using my Github Copilot plan like I had been with Roo Code/Kilo Code, but have branched out to also using it with the Claude Code backend and their free models as they come and go.

Definitely worth trying if you haven't picked it up recently.

clhodapp•1mo ago
Open source rarely wins at the start.

Instead, its strength tends to be a continued improvement over the long term, in a way that commercial software just can't sustain because it needs to show a return on investment.

csomar•1mo ago
No. It didn't get any better. To be honest, the only working agentic tool out there in Claude Code. All other tools seems to confuse the model and not help much. Also bugs are a real problem (with gemini cli, it's really pathetic). Not that Claude Code doesn't have dozens of issues but the bar in this niche is set really low.
theshrike79•1mo ago
I've had decent experiences with Crush and GLM-4.6, I think the magic sauce is the fact that they have language server support that seems to make it somehow smarter.

TBH I mostly use it for very specific changes and not massive new features, those I do with Claude plan mode.

adrianfcole•1mo ago
I feel you and I also think this is a tough time. I routinely use Claude even though he typically ignores me ;) I'm also a maintainer of Goose and have been engaged since before the rust rewrite. I'll admit I don't get exactly what I want out of any agent. I also don't buy the "it is you" thing that is typical with Agents. We often need to know too many things and are defensive in how we act. I truly hope this is temporary.

ok back to the point. Block is not trying to sell a frontier model, or Goose at all. As an open source enthusiast, I like this model (no pun intended). Features go where the prominent site or key contributors want, vs a commercial agenda. To get more practical, it was goose folks themselves who put themsemselves out there in tbench.ai and remain in the top 10

https://www.tbench.ai/leaderboard/terminal-bench/2.0

Does this invalidate poor experience on use cases. no way. However, there's a lot of work being done by block folks to help teach and share practice and get things together. I'm always looking for pure local everything and Mic is also super keen on this, Today? well it is like watching someone type each character at a time while your laptop melts. I don't think this invalidates the long term, but it acknowledges the short term.

Next, Goose doesn't care about you in a specific way. Literally there is a Claude agent so you can swap out the goosey parts if you like. It is clunky and I'm personally looking into aligning that interop via Zed's ACP. I think like the combination of openness and not having any angle.. like not anti claude, literally give you a way to use it.. is telling.

This is a ramble and maybe a waste of your context, but I hope it colors some things and will get to see you around.

sheikhlimon•1mo ago
Adding a quick note as someone who contributes to Goose but isn’t a maintainer. I agree with a lot of what you’re saying. The harness and overall UX have changed quite a bit recently, though it’s still very much evolving like everything else in this space. If anyone tried it a while back, the newer versions are worth a look. And any issues people hit in practice are genuinely useful for us to improve things.

Appreciate the thoughtful take here.

matt_daemon•1mo ago
Spot the odd one out
kordlessagain•1mo ago
Such irony given Anthropic is hostile to open sourcing their agent frameworks like clause desktop and CLI.
theshrike79•1mo ago
There's a reason why their CLI is the best, there is some magic sauce in there the other haven't copied yet.
N_Lens•1mo ago
Forgive me for underestimating but I'd never heard of 'Block' before, and the title "Block, Anthropic, and OpenAI Launch the Agentic AI Foundation" reads a bit funny to me. Plus the bitcoin blurb on their site is also worth a chuckle -

"Bitcoin is about access, not speculation. It’s designed to be open, fast, low-cost, and free from centralized control. It’s meant to empower people"

bdangubic•1mo ago
> “Bitcoin is about access, not speculation. It’s designed to be open, fast, low-cost, and free from centralized control. It’s meant to empower people"

This is the most amazing paragraph I think I have ever read, pure gold!

lkbm•1mo ago
Block is the new name for Square, also of CashApp.
daveguy•1mo ago
Yeah... Still gonna use my credit card or debit card.
bigmadshoe•1mo ago
Square is primarily a payment platform so you probably have used your credit or debit card thousands of times with them already.
daveguy•1mo ago
Yup. Credit. Or Debit. They can play with crypto behind the scenes all they want. That doesn't make it worth anyone's time.
jazzyjackson•1mo ago
Square is likely the POS terminal you've been using your card on. They pioneered those neat headphone jack adaptors that let small businesses use their iPhone to take payment years before tapping phones together was a thing. Not a bad business, made jack Dorsey rich, now he gets to play around with crypto junk
lrkerhn2•1mo ago
Same here. Never heard of them before. But the more you look around on their site, feels more and more like some parody company. They seem to be into everything - bitcoin, blockchain, decentralized this, decentralized that, something called TBD, then Web5 (what even is that?)

https://blog.identity.foundation/block-contributes-to-dif/

https://tbd.website/

This company seems to be mocking itself.

pests•1mo ago
Square renamed themselves to Block. You might have heard of CashApp for example. Their CEO is Jack Dorsey. They have POS terminals and credit card readers.
nticompass•1mo ago
Even the domain https://block.xyz seemed questionable when I first saw this post.
jmathai•1mo ago
That website reminds me of the one…where a guy sold pixels on a website for like a buck and people basically bought ads.

https://en.wikipedia.org/wiki/The_Million_Dollar_Homepage

N_Lens•1mo ago
Yes it does feel a bit grift adjacent.
PierceJoy•1mo ago
The guy who created that page actually went on to found the Calm app, which has a multi billion dollar valuation now.