frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Are AI agents ready for the workplace? A new benchmark raises doubts

https://techcrunch.com/2026/01/22/are-ai-agents-ready-for-the-workplace-a-new-benchmark-raises-do...
1•PaulHoule•39s ago•0 comments

AI Watermark and Stego Scanner

https://ulrischa.github.io/AIWatermarkDetector/
1•ulrischa•1m ago•0 comments

Clarity vs. complexity: the invisible work of subtraction

https://www.alexscamp.com/p/clarity-vs-complexity-the-invisible
1•dovhyi•2m ago•0 comments

Solid-State Freezer Needs No Refrigerants

https://spectrum.ieee.org/subzero-elastocaloric-cooling
1•Brajeshwar•2m ago•0 comments

Ask HN: Will LLMs/AI Decrease Human Intelligence and Make Expertise a Commodity?

1•mc-0•3m ago•0 comments

From Zero to Hero: A Brief Introduction to Spring Boot

https://jcob-sikorski.github.io/me/writing/from-zero-to-hello-world-spring-boot
1•jcob_sikorski•3m ago•0 comments

NSA detected phone call between foreign intelligence and person close to Trump

https://www.theguardian.com/us-news/2026/feb/07/nsa-foreign-intelligence-trump-whistleblower
3•c420•4m ago•0 comments

How to Fake a Robotics Result

https://itcanthink.substack.com/p/how-to-fake-a-robotics-result
1•ai_critic•4m ago•0 comments

It's time for the world to boycott the US

https://www.aljazeera.com/opinions/2026/2/5/its-time-for-the-world-to-boycott-the-us
1•HotGarbage•5m ago•0 comments

Show HN: Semantic Search for terminal commands in the Browser (No Back end)

https://jslambda.github.io/tldr-vsearch/
1•jslambda•5m ago•1 comments

The AI CEO Experiment

https://yukicapital.com/blog/the-ai-ceo-experiment/
2•romainsimon•7m ago•0 comments

Speed up responses with fast mode

https://code.claude.com/docs/en/fast-mode
3•surprisetalk•10m ago•0 comments

MS-DOS game copy protection and cracks

https://www.dosdays.co.uk/topics/game_cracks.php
3•TheCraiggers•11m ago•0 comments

Updates on GNU/Hurd progress [video]

https://fosdem.org/2026/schedule/event/7FZXHF-updates_on_gnuhurd_progress_rump_drivers_64bit_smp_...
2•birdculture•12m ago•0 comments

Epstein took a photo of his 2015 dinner with Zuckerberg and Musk

https://xcancel.com/search?f=tweets&q=davenewworld_2%2Fstatus%2F2020128223850316274
7•doener•12m ago•2 comments

MyFlames: Visualize MySQL query execution plans as interactive FlameGraphs

https://github.com/vgrippa/myflames
1•tanelpoder•14m ago•0 comments

Show HN: LLM of Babel

https://clairefro.github.io/llm-of-babel/
1•marjipan200•14m ago•0 comments

A modern iperf3 alternative with a live TUI, multi-client server, QUIC support

https://github.com/lance0/xfr
3•tanelpoder•15m ago•0 comments

Famfamfam Silk icons – also with CSS spritesheet

https://github.com/legacy-icons/famfamfam-silk
1•thunderbong•15m ago•0 comments

Apple is the only Big Tech company whose capex declined last quarter

https://sherwood.news/tech/apple-is-the-only-big-tech-company-whose-capex-declined-last-quarter/
2•elsewhen•19m ago•0 comments

Reverse-Engineering Raiders of the Lost Ark for the Atari 2600

https://github.com/joshuanwalker/Raiders2600
2•todsacerdoti•20m ago•0 comments

Show HN: Deterministic NDJSON audit logs – v1.2 update (structural gaps)

https://github.com/yupme-bot/kernel-ndjson-proofs
1•Slaine•24m ago•0 comments

The Greater Copenhagen Region could be your friend's next career move

https://www.greatercphregion.com/friend-recruiter-program
2•mooreds•24m ago•0 comments

Do Not Confirm – Fiction by OpenClaw

https://thedailymolt.substack.com/p/do-not-confirm
1•jamesjyu•24m ago•0 comments

The Analytical Profile of Peas

https://www.fossanalytics.com/en/news-articles/more-industries/the-analytical-profile-of-peas
1•mooreds•25m ago•0 comments

Hallucinations in GPT5 – Can models say "I don't know" (June 2025)

https://jobswithgpt.com/blog/llm-eval-hallucinations-t20-cricket/
1•sp1982•25m ago•0 comments

What AI is good for, according to developers

https://github.blog/ai-and-ml/generative-ai/what-ai-is-actually-good-for-according-to-developers/
1•mooreds•25m ago•0 comments

OpenAI might pivot to the "most addictive digital friend" or face extinction

https://twitter.com/lebed2045/status/2020184853271167186
1•lebed2045•26m ago•2 comments

Show HN: Know how your SaaS is doing in 30 seconds

https://anypanel.io
1•dasfelix•27m ago•0 comments

ClawdBot Ordered Me Lunch

https://nickalexander.org/drafts/auto-sandwich.html
3•nick007•27m ago•0 comments
Open in hackernews

How I use Claude Code to implement new features in an existing complex codebase

https://www.sabrina.dev/p/ultimate-ai-coding-guide-claude-code
74•plentysun•6mo ago

Comments

campbel•6mo ago
The author describes a lot of directives they found useful. Be aware Claude has the concept of `slash commands` https://docs.anthropic.com/en/docs/claude-code/slash-command...
AndyNemmity•6mo ago
[flagged]
mattlondon•6mo ago
What are hooks and subagents then?
mephitix•6mo ago
Maybe you should ask Claude ;)

https://docs.anthropic.com/en/docs/claude-code/hooks

https://docs.anthropic.com/en/docs/claude-code/sub-agents

chank•6mo ago
> https://docs.anthropic.com/en/docs/claude-code/hooks

> https://docs.anthropic.com/en/docs/claude-code/sub-agents

AndyNemmity•6mo ago
.
_betty_•6mo ago
It's from July 6, so yeah a month in the past is totally right.

She has newer posts on sub agents

XenophileJKO•6mo ago
Actually please write an article about how you are currently using it then. I've been actively using it and working with sub-agents. I feel like there are very little resources out there on more complex Claude Code setups.

I would love to see other setups, like what MCP, hooks, sub-agents, commands, etc.

AndyNemmity•6mo ago
.
weitendorf•6mo ago
We’ve been building in this direction for a bit over a year now. I don’t see your contact info but please reach out over email if you’re interested in checking out our approach and maybe working with us!
AndyNemmity•6mo ago
.
weitendorf•6mo ago
It’s not the specific agent config or anything like that I’m after, just the genuine interest+enthusiasm in an area we’re actively working on. The central premise to what we’re building is our system for reusable and composable AI workflows.

Shit’s moving fast and the math works out such that it’s probably better to just find someone at the bleeding edge and offer to pay them really generously to work with us than to recruit for this work normally. If you change your mind you can find my email on my profile!

Edit for more context: We’ve basically had the same thing as Claude’s slash commands (we call them workflows) since November, but then we switched gears and made our cloud IDE/agent sandbox and other components of our infrastructure layer the top priority.

That’s working and the timing is amazing since we should be able to just glue all these new cli (sub) agents in between our workflows and agent infrastructure. I suspect someone who understands terminal agents well could do it about 4x faster than me.

AndyNemmity•6mo ago
.
XenophileJKO•6mo ago
Hmm.. I work at LinkedIn.. and I didn't get the magic invite. :(
derektank•6mo ago
>Apologies, I'm not an article writer. I focus all my time on improving my own setup.

Maybe that's why the guides you see appear a bit out of date? Writing, all kinds of communicating, takes time, both in organizing and compiling your thoughts

ToJans•6mo ago
Wow, that is a huge instruction set.

I've created a (way smaller) "/hire" command that does something similar, but I should probably turn it into an agent as well, as the commao is only creating agents, and I still need to do further adaptation with individual promoting and edits

It's these little, but crucial insights that make all the difference, so thank you!

I have the exact same feeling about losing time, for me it's starting to turn into an addiction,

I'm buiding a new side product, and the sense of urgency combined with the available capability makes it hard for me to stop working.

Progress is going so fast that it feels like the competition might catch up any time now.

I now restrained myself upfront with predefined timing windows for work, so the I manage to keep my sanity & social life from disappearing...

"What a great time to be alive"

poisonborz•6mo ago
The situation is of your own making. You can change/get out any time.
gexla•6mo ago
I think the basic issue for me is that people put this stuff out as if it's some big discovery, and yet my own usage is way different and serves me just fine thank you. These have the feel of a developer who has just discovered development and then want to tell you about all the best tools. But it's not really about showing you the best tools, is it? Rather, it's about riding the hype train and creating slop for more eyeballs.
bathtub365•6mo ago
When someone has a YouTube channel about programming with a certain level of polish they are not trying to be a professional programmer, they are trying to be a professional YouTuber.
weitendorf•6mo ago
Respectfully, I think you are being reflexively contrarian or overindexing on the vibe coding hype train. The way vibe coding UX works is basically just having LLMs guess what they need to do to complete a task and then sending it off to do it and try to fix anything that doesn't go according to plan along the way. Nothing big there.

I had started working on an AI devtool product a few months before Cursor took off, I didn't even know about them when I first started, and I hated that such a dumb UX was setting the narrative in this space. LLMs had essentially no ability to decompose and plan tasks at the time, and they weren't fucking sandboxing it!

Terminal agents are actually moving towards the UX I've been building/anticipating for. In March of 2024 I was playing around with GPT4 and saw it oneshot a microservice I asked it to make. I was so excited about the implications of where this stuff could go that I quit my job at google just to start building in this space.

Without getting into all the details, I am pretty convinced there must be some particular way of arranging infrastructure primitives and AI-coding tools in a way that properly decomposes and executes arbitrarily large or complicated tasks (limited only by available time and resources). Claude code is IMO getting closer to that by the week. No iterative change is crazy science or anything but there are some genuinely novel and exciting patterns for computing things underway.

keerthiko•6mo ago
to be fair, the linked youtube video from the article is literally 4 weeks old, so "at least a month in the past" is probably exactly accurate if you are moving at the same pace as the author.
_andrei_•6mo ago
> If you don't discuss hooks and subagents, i'm not sure what you're doing right now.

Agents have their own context and can be useful for tasks that can be parallelized, which is a minority of tasks. How are they critical to better performance for you?

AndyNemmity•6mo ago
This is the best question anyone has asked.

Let's consider context. At some level the more context you have is good. At some level, the more irrelevant context you have is bad.

Okay. We have at top level of context, a hook that forces a system prompt on every action.

Next level we have a ./claude/CLAUDE.md then we have the project level CLAUDE.md then we have a possible not required agent setup then we have the instructions you give it

We know that CLAUDE.md gets lost in the context, at any level. The system prompt level hooks don't.

Why does the CLAUDE.md get lost? Why are we losing ability with a longer context.

The problem is irrelevant context to the action. The Documentation agent doesn't require the Golang modernization rules. The Golang agent, doesn't require the planing coordinator rules.

So the question I asked myself last weekend was, what is the experience if you split the contexts to only the required information for the task.

I did head to head battles with agents, reading in the information, versus contextual specific information. The agents with context specific destroyed the competition. Like it was another world.

So then I ran head to head tests on the type of information. Etc etc. My current setup is the best level achieved in those tests.

So my argument is that removing the context that is entirely irrelevant for the agent improves performance dramatically.

But I'm one person doing tests... it's true for me. Maybe it's not true for others. People have to explore the conception and determine that.

I can only tell you what has worked best for me, and for me, it's like a model jump in performance improvements.

dSebastien•6mo ago
It's been my experience too, already prior to support for sub-tasks/agents.

I've been exploring the ways in which I could "lazy load" context depending on the task at hand. I've approached this by defining an AI "receptionist", a set of roles and a set of tasks.

The receptionist is the entrypoint of the system. It knows where to find roles/tasks but only loads very basic information about the roles.

Depending on the role I ask for, it then loads the right role file, which in turn loads additional context. Then I ask for whatever i want to do, and it loads specific task files based on what I asked. And the task files load additional context etc.

This works quite well because I end up with just the right context for the task at hand. And each role/task can actually be executed as a sub-task/agent

AndyNemmity•6mo ago
You stumbled on to the solution before the tools caught up to you. really cool.
_andrei_•6mo ago
> So my argument is that removing the context that is entirely irrelevant for the agent improves performance dramatically.

100% agree on building the optimal context, just have not seen parallel agents do better at sequential tasks. the documentation agent may have better initial context about how to write documentation, but it doesn't have the context of the changes, apart from what is passed to it / can explore. if we don't spawn a new session, and instead throw it a /document command - that would still get all the guidelines and rules for writing documentation, and it should have the same weight since it's at the bottom of the context.

for me the highest 'model jump level' performance booster is externalizing context and controlling the process - having claude initialize a plan file with a pre-defined template that makes it use that as its to do list and documentation place, and getting it to use that as its primary working area

AndyNemmity•6mo ago
I am doing that too. Perhaps I am misjudging which part is providing which benefit, but when using the plan system as well as the agent, the agents with this context setup perform much better.

It all builds on each other, so it can be slightly challenging to untangle which part is providing the majority of benefits.

CyberMacGyver•6mo ago
Genuinely, how do you keep up to date with these features while still being focused at the task?

The more I focus the less I have time to read and experiment O want to finish it. What are your sources and how are you balancing it ?

AndyNemmity•6mo ago
Honestly, the release of sub-agents was the first time it felt actually meaningful.

MCP servers and the rest have not provided the type of gains sub-agents have. Hooks and subagents actually provide tangible value. Enough that it's changed my structure entirely to be tool focused, and not output focused.

j_crick•6mo ago
Why did you replace your comments in this thread with . ?
AndyNemmity•6mo ago
Because I gave examples, and details, and thousands of people read them from a pastebin i used to share.

I didn't release it as open source or anything, just sharing. I don't want to take questions concerning it so I can focus on moving it forward.

Today's goal is to try to build self healing agents that automatically fix the problems they encounter so they only happen once, automating a manual process I successfully use.

Perhaps if that works out well, that is something releasable I can do in a real way as opposed to paste bin.

j_crick•6mo ago
Sadly I was just late to the discussion and missed the stuff. Would you mind sending what you shared to an email of mine? Not requesting further communication, just simply curious what people do with and around this.
dang•6mo ago
Can you please not gut your comments by deleting the content, once they have replies?

It destroys the context for readers, making the thread worse for everyone.

AndyNemmity•6mo ago
That’s fair, I shared things that weren’t in a state to communicate. I shouldn’t have commented at all.
dang•6mo ago
Appreciated! And yes, that happens sometimes.
cristea•6mo ago
I agree with the notion here that things are moving so fast that most blog posts will be outdated quickly.

However posts like this is valuable for new people to get a basic understanding of how these tools could be used in a very simple, beginner friendly, setup.

cantor_S_drug•6mo ago
This is equivalent of .vimrc file. ".clauderc" .
manmal•6mo ago
A few thoughts on this:

- I wouldn’t outsource my brain to CC when it comes to checking CC’s output. Very mixed results in my experience, and it might discourage further exploration/thinking if you’ve already performed the checklist CC has given you (satisficing).

- Slash commands are the idiomatic way to memoize often used prompts, I wonder why author put them in CLAUDE.md?

- I’m also a bit skeptical that, aside from strict rules CC needs to follow, the encouragements/enchantments for writing good code author put in CLAUDE.md really work. But who knows.

- I DO like the caveats section at the end a lot. This is probably the most important piece of the article, when it comes to large codebases. Never just accept the first draft. Review everything with high suspicion and bend the output to your own style and taste. Otherwise, you‘re pushing legacy code.

croes•6mo ago
It will be interesting to see how all this works out when the honeymoon phase is over.

At the moment they burn a investor money to gain customers by giving us free tools and cheap LLMs.

That will end at some point.

chrisvalleybay•6mo ago
One perspective could be that this is a moment to leverage. Subsidized LLMs won't be around forever, so this is a great time to build something that you wouldn't have had the time to build otherwise.
ofrzeta•6mo ago
That many sites about AI look a lot like scam sites ("make money fast") is a real turn-off -> https://www.sabrina.dev

Maybe it does have some valuable content but I will just close it.

Valodim•6mo ago
Wow, geez. That is terrible, I'm surprised a website like that can have any actual content as TFA
WD-42•6mo ago
The grifters have entered the arena.
literalAardvark•6mo ago
Hype is like catnip for them. They've been around for a while, many of the OG AI YouTubers didn't know anything about it when they started to pretend to educate.
rootnod3•6mo ago
I mean, the grifters have started it. They grifted for way too much money without ANY plan on how to actually get money back to the investors. Including the dubious benefits, just promising a "quick fix".

That is a bit over the top, but the core statement kind of stands.

layman51•6mo ago
I agree. I remember a year ago some a submission on this forum by Sabrina where she was evaluating LLMs on SAT math problems. The post was called like “ChatGPT-4o vs. Math”. I thought it was pretty insightful, but now that post is deleted from her blog and it seems like the blog as a whole is leaning into attracting people who are nontechnical and want to use it to generate social media content. Weird vibe in the comments of her posts and videos too.
steilpass•6mo ago
I stumbled upon the author’s command `qnew` to read Claude.md.

Why would you need that?

> CLAUDE.md is a special file that Claude automatically pulls into context when starting a conversation.

https://www.anthropic.com/engineering/claude-code-best-pract...

vunderba•6mo ago
That confused me as well. The whole point of CLAUDE.md is that it is automatically injected into the rolling context of any Claude interaction/thread.

That CLAUDE.md file that they've posted is also HUGE - Anthropic themselves recommend keeping it on the more concise side. If you need more, consider just creating dedicated subagents for "UI/UX reviewer", "search", etc.

theshrike79•6mo ago
My CLAUDE.md has references to other documentation

Like "for guidance on Go library selection refer to docs/go.md"

That way the main file stays compact.

brunooliv•6mo ago
Am I the only one who uses CC in a one-shot fashion? I rely on what has been generated by the init slash command at the root of my monorepo and then when I have a complex task I manually create a md file called CONTEXT.md inside the package/area where I want to implement the new task. I try to be as detailed as possible, think like writing a ticket for a junior dev. Then my only command for CC is something like: “Focus on the project structure described in CLAUDE.md and work through the task planned in CONTEXT.md. Follow current codebase standards, keep code reusable and easy to follow and modify.”

This is like 99% of my CC interactions working on top of a well structured codebase and it just works perfectly for almost any task I throw at it.

chrisvalleybay•6mo ago
I mostly do the same. My workflow is usually like this: 1. Supply CC with a simple user story with light spec / business logic, asking it to write a specification for the feature and ask me any questions that need to be answered to make it complete. 2. Build up the context with all relevant files that I want to use. In Ruby on Rails it means refercing models, views and controllers that show the structure we are going to use. 3. Write the code and tests.
brunooliv•6mo ago
Nice to see!! I wrote a longer post about this: https://open.substack.com/pub/boliv/p/claude-code-usage-patt...