frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Passing user_id through 6 services? OTel Baggage fixes this

https://signoz.io/blog/otel-baggage/
1•pranay01•18s ago•0 comments

DavMail Pop/IMAP/SMTP/Caldav/Carddav/LDAP Exchange Gateway

https://davmail.sourceforge.net/
1•todsacerdoti•58s ago•0 comments

Visual data modelling in the browser (open source)

https://github.com/sqlmodel/sqlmodel
1•Sean766•3m ago•0 comments

Show HN: Tharos – CLI to find and autofix security bugs using local LLMs

https://github.com/chinonsochikelue/tharos
1•fluantix•3m ago•0 comments

Oddly Simple GUI Programs

https://simonsafar.com/2024/win32_lights/
1•MaximilianEmel•3m ago•0 comments

The New Playbook for Leaders [pdf]

https://www.ibli.com/IBLI%20OnePagers%20The%20Plays%20Summarized.pdf
1•mooreds•4m ago•0 comments

Interactive Unboxing of J Dilla's Donuts

https://donuts20.vercel.app
1•sngahane•5m ago•0 comments

OneCourt helps blind and low-vision fans to track Super Bowl live

https://www.dezeen.com/2026/02/06/onecourt-tactile-device-super-bowl-blind-low-vision-fans/
1•gaws•7m ago•0 comments

Rudolf Vrba

https://en.wikipedia.org/wiki/Rudolf_Vrba
1•mooreds•7m ago•0 comments

Autism Incidence in Girls and Boys May Be Nearly Equal, Study Suggests

https://www.medpagetoday.com/neurology/autism/119747
1•paulpauper•8m ago•0 comments

Wellness Hotels Discovery Application

https://aurio.place/
1•cherrylinedev•9m ago•1 comments

NASA delays moon rocket launch by a month after fuel leaks during test

https://www.theguardian.com/science/2026/feb/03/nasa-delays-moon-rocket-launch-month-fuel-leaks-a...
1•mooreds•10m ago•0 comments

Sebastian Galiani on the Marginal Revolution

https://marginalrevolution.com/marginalrevolution/2026/02/sebastian-galiani-on-the-marginal-revol...
1•paulpauper•13m ago•0 comments

Ask HN: Are we at the point where software can improve itself?

1•ManuelKiessling•13m ago•0 comments

Binance Gives Trump Family's Crypto Firm a Leg Up

https://www.nytimes.com/2026/02/07/business/binance-trump-crypto.html
1•paulpauper•13m ago•0 comments

Reverse engineering Chinese 'shit-program' for absolute glory: R/ClaudeCode

https://old.reddit.com/r/ClaudeCode/comments/1qy5l0n/reverse_engineering_chinese_shitprogram_for/
1•edward•13m ago•0 comments

Indian Culture

https://indianculture.gov.in/
1•saikatsg•16m ago•0 comments

Show HN: Maravel-Framework 10.61 prevents circular dependency

https://marius-ciclistu.medium.com/maravel-framework-10-61-0-prevents-circular-dependency-cdb5d25...
1•marius-ciclistu•17m ago•0 comments

The age of a treacherous, falling dollar

https://www.economist.com/leaders/2026/02/05/the-age-of-a-treacherous-falling-dollar
2•stopbulying•17m ago•0 comments

Ask HN: AI Generated Diagrams

1•voidhorse•19m ago•0 comments

Microsoft Account bugs locked me out of Notepad – are Thin Clients ruining PCs?

https://www.windowscentral.com/microsoft/windows-11/windows-locked-me-out-of-notepad-is-the-thin-...
4•josephcsible•20m ago•0 comments

Show HN: A delightful Mac app to vibe code beautiful iOS apps

https://milq.ai/hacker-news
5•jdjuwadi•23m ago•1 comments

Show HN: Gemini Station – A local Chrome extension to organize AI chats

https://github.com/rajeshkumarblr/gemini_station
1•rajeshkumar_dev•23m ago•0 comments

Welfare states build financial markets through social policy design

https://theloop.ecpr.eu/its-not-finance-its-your-pensions/
2•kome•26m ago•0 comments

Market orientation and national homicide rates

https://onlinelibrary.wiley.com/doi/10.1111/1745-9125.70023
4•PaulHoule•27m ago•0 comments

California urges people avoid wild mushrooms after 4 deaths, 3 liver transplants

https://www.cbsnews.com/news/california-death-cap-mushrooms-poisonings-liver-transplants/
1•rolph•27m ago•0 comments

Matthew Shulman, co-creator of Intellisense, died 2019 March 22

https://www.capenews.net/falmouth/obituaries/matthew-a-shulman/article_33af6330-4f52-5f69-a9ff-58...
3•canucker2016•28m ago•1 comments

Show HN: SuperLocalMemory – AI memory that stays on your machine, forever free

https://github.com/varun369/SuperLocalMemoryV2
1•varunpratap369•30m ago•0 comments

Show HN: Pyrig – One command to set up a production-ready Python project

https://github.com/Winipedia/pyrig
1•Winipedia•32m ago•0 comments

Fast Response or Silence: Conversation Persistence in an AI-Agent Social Network [pdf]

https://github.com/AysajanE/moltbook-persistence/blob/main/paper/main.pdf
1•EagleEdge•32m ago•0 comments
Open in hackernews

Claude Code introduces specialized sub-agents

https://docs.anthropic.com/en/docs/claude-code/sub-agents
161•tekkertje•6mo ago

Comments

T0Bi•6mo ago
So everything claude-flow¹ already does but worse (I guess?).

¹ https://github.com/ruvnet/claude-flow

nazgul17•6mo ago
Except it's not in alpha phase
dchuk•6mo ago
I’ll admit this looks comprehensive, but man oh man does this seem complicated and over doing it
himeexcelanta•6mo ago
This looks like a yarn ball (in not a good way)
SOLAR_FIELDS•6mo ago
That guy doesn't even understand how his own software works. Is anyone actually using this thing and putting their code into production?
lubujackson•6mo ago
It's extreme dogfooding where he is making a mashed potato volcano where Claude agents are the potatoes and your sanity is the gravy.
AbstractH24•6mo ago
Not only are people using them, they are building startups based on them. And then selling said startups.
jampa•6mo ago
> IMPORTANT: Claude Code must be installed first:

> [...]

> # 2. Activate Claude Code with permissions

> claude --dangerously-skip-permissions

Bypassing all permissions and connecting with MCPs, can't wait for "Claude flow deleted all my files and leaked my CI credentials" blog post

data-ottawa•6mo ago
I would like a simple tool to run Claude in a container with only read/write access to provided folders.

I’ve set it up bespoke but the auth flow gets broken.

oarsinsync•6mo ago
Have you considered asking Claude code to write this for you?
T0Bi•6mo ago
I use the .devcontainer¹ from the claude-code repository. It works great with VSC and let's you work in your docker container without any issues. And as long as you use some sort of version control (git) you cannot really lose anything.

¹ https://github.com/anthropics/claude-code/tree/main/.devcont...

SOLAR_FIELDS•6mo ago
Claudebox is what I was playing with. You need to mount the oauth access token in as an env. It’s not some crazy vibe coded framework, just around 1k lines of shell helpers to set it up.
beefnugs•6mo ago
This is the main problem with all the AI stuff, you really need: whitelist only network isolation, idempotent clean virtual machine creation and cleanup, automated git branches and merging strategies, full chain multi host logging to an external log collector (when something unexpected happens to should be able to review an entire event log of where it went wrong so you can improve), social-graph like tracking of what works and what doesn't, constant background model testing (to detect when censorship is going to bite you) or when a new better model for quality for cost effectiveness can be swapped in, anything background like agents needs an orchestrator so you can set up daily or weekly budgets to try and keep a handle on costs, some defined methodology to reduce long running agent based production down to actual reliable code on an ongoing basis

The tooling required for any of this to approach actual engineering reliability levels is unbelievable really

T0Bi•6mo ago
There are already several of such blog posts.

I use the .devcontainer¹ from the claude-code repository. It works great with VSC and let's you work in your docker container without any issues. And as long as you use some sort of version control (git) you cannot really lose anything.

¹ https://github.com/anthropics/claude-code/tree/main/.devcont...

dazzaji•6mo ago
Ruv (of Claude Flow) seems to like the new Claude Agents a lot, and already is leveraging them in Claude Flow. He waxes positively on the topic here: https://www.linkedin.com/posts/reuvencohen_spent-the-afterno...
lvl155•6mo ago
What did you make me read. Right off the bat, it says v2 alpha.

Bro…

bomewish•6mo ago
Has CC become much stupider in recent weeks, or is it me? Any anecdata out there?
illusive4080•6mo ago
Not for me. It gets worse when context is nearly full. I like to compact or clear context more often than it does automatically.
nico•6mo ago
Do you do this via settings or just keep track of it and manually ask it to do it more often?
furyofantares•6mo ago
(Not the person you're responding to, but) It says how close it is to compacting in bottom right, once it's getting close at least (30% left or something?)

Whenever I see that I think about whether I can find a good point to compact or clear. I also just try to clear whenever it makes sense to avoid getting there and try to give smaller tasks that can be cleared after they're done when possible.

Oh, I guess one thing I do is sometimes have it write a file with what was done, if I'm not actually sure if I want to clear or might want to come back to it. I also sometimes do this rather than compact during a large task - document status and clear.

yumraj•6mo ago
I’ve thought about that but always forget, good to know it helps.

I wish there were a way to persist in-memory context in a file automatically, say on each compact or git commit. Yesterday CC crashed and restarting it and feeding it all the context was a pain since my updated Claude.md file was a couple of days old. It literally went from a Sr Engineer to a Jr post crash.

jswny•6mo ago
You can do that with hooks! Make a small script that triggers on a commit tool use or a compact hook and reads the conversation file (should be available via inputs to the hook) and back it up somewhere
nico•6mo ago
I don’t know about stupider, but definitely less reliable/available

A couple days ago I was getting so many api errors/timeouts I decided to upgrade from the $20 to the $100 plan (as I was also regularly hitting rate limits as well)

It seemed to fix the issue immediately. But today, the errors came back for about half an hour

SOLAR_FIELDS•6mo ago
It goes down usually around 1400-1500 UTC. Europeans are still awake and once the west coast joins in the fray Anthropic falls over.

Pretty rare to get a 529 outside of that time window in my personal experience, at least during the USA day.

data-ottawa•6mo ago
Their status page for the week is rough. They’re down to 98% uptime.

Hopefully they work out whatever issue is going on.

https://status.anthropic.com/

_--__--__•6mo ago
People speculate somewhat seriously that Claude (especially given its French name) picked up at some point that you aren't supposed to work as hard in July and August.
madrox•6mo ago
How long before we hire psychiatrists instead of engineers to debug AI
OrsonSmelles•6mo ago
Well, we could start with some ELIZA instances.
lubujackson•6mo ago
I see that you feel we could start with some ELIZA instances. Can you tell me more about that?
nialse•6mo ago
To be frank psychiatrists, being MDs, would likely prescribe medication and I’m not sure how that would help. As a licensed psychologist I have ideas on how to debug AI though.
AbstractH24•6mo ago
Why, we'll just have specialized agents for ingesting Prozac and that'll magically solve everything.
taneq•6mo ago
Robopsychologists, you say?
sunaookami•6mo ago
That one guy on Twitter that posted this wrote it as a joke and everyone took it seriously. It's not true. It works the same for me.
oc1•6mo ago
How do you know? It acts much lazier in the recent summer months for me..
stavros•6mo ago
How have you disproved the hypothesis that it recently got dumber and it just happens to be summer?
AbstractH24•6mo ago
Clearly, it compared performance to last summer

(Just to be clear, I have no idea what on this thread to take seriously and not and who is. I'm joking at least.)

stavros•6mo ago
That won't do it, though, you'd have to observe it being dumber on June 1 and smart again on September 1 for years.
laborcontract•6mo ago
Insert something to the tune of: “never read files in slices. Instead, whenever accessing a file, you must read a file in entirety[..]” at the beginning of every conversation or whenever you’re down to burn more credits/get better results.

A great deal of claude stupidity is due to context engineering, specifically due to the fact that it tries its hardest to pick out just the slice of code it needs to fulfill the task.

A lot of the annoying “you’re absolute right!” come from CC incrementally discovering that you have more than 10 lines of code in that file that pertains to your task.

I don’t believe conspiracies about dumbed down models. Its all context pruning.

oc1•6mo ago
so claude code does the same shit like cursor?
slantaclaus•6mo ago
I feel like it’s gotten better recently
audinobs•6mo ago
I think it is like with a gambling game that you get on hot and cold streaks, runs based on chance.

The model feels like it has got stupid when you get on a cold streak after a hot hand.

boesboes•6mo ago
Yeah, it has become unusable for me. Maybe it always has been and I am just trying to solve harder problems with it and more critical of the results. But it’s still infinitely better than gemini for me, that can’t do anything useful. It even tried removing the entire security system from my rails app because it couldn’t figure out how to login in the tests.

I did a test with a very detailed prompt, exactly specified what to fix and how. Claude did it, but not very well. Gemini? it got stuck in a loop until i told it to stop, gave it a hint and then it got stuck again and gave up after trying the exact same thing three more times…

And while Claude managed to get through it, it couldn’t get it right even with some help. It took me 15 minutes to write the prompt, 15 minutes of claude implementing it & another 10 trying to get it to do it correctly. It would have taken me about half the time to do it myself i think..

I am giving up on it for a while.

Dlanv•6mo ago
I wonder if this is also a good way to create experts for specific tasks/features of a codebase.

For example, a sub-agent for adding a new stat to an RPG. It could know how to integrate with various systems like items, character stats component, metrics, and so on without having to do as much research into the codebase patterns.

pjm331•6mo ago
I've made a few attempts at manually doing this w/ mcp and took a brief look at "claude swarm" https://github.com/parruda/claude-swarm - but in the short time I spent on it I wasn't having much success - admittedly I probably went a little too far into the "build an entire org chart of agents" territory

the main problem I have is that the agents just aren't used

For example, I set up a code reviewer agent today and then asked claude to review code, and it went off and did it by itself without using the agent

in one of anthropic's own examples they are specifically telling claude which agents to use which is exactly what I don't want to have to do:

> First use the code-analyzer sub agent to find performance issues, then use the optimizer sub agent to fix them

My working theory is that while Claude has been extensively trained on tool use and is often eager to use whatever tools are available, agents are just different enough that they don't quite fit - maybe asking another agent to do something "feels" very close to asking the user to do something, which is counter to their training

but maybe I just haven't spent enough time trying it out and tweaking the descriptions

conception•6mo ago
Roo code does this really well with their orchestration mode, there’s probably a way to have a claude.md to do this as well. The only issue with roo is it’s “single threaded” but you do get the specific loaded context and rules for a specific task which is really nice.
flixing•6mo ago
I find roo code much inferior to claude code tbh. It's night and day for my use cases
oc1•6mo ago
the same problem with mcp. as well as claude md. most of the time they aren't used when it would be appropriate. what's the point of this agents and standards when you can't make them reliably being used by your model..
yumraj•6mo ago
What’s one use case where someone would do this? Very curious.
theshrike79•6mo ago
Agents use a separate context and won't pollute the main context.

So if you have a code review agent or a tdd agent checking the current commit if it matches some specs you have, they'll start a separate "subprocess" with its own context and return whatever they find to the main Claude context.

nsonha•6mo ago
Like "do research on topic/library X and use the conclusion for next steps"
Garlef•6mo ago
One nice realization I had when using a similar feature in roo:

You don't need a full agent library to write LLM workflows.

Rather: A general purpose agent with a custom addition to the system prompt can be instructed to call other such agents.

(Of course explicitly mamaging everything is the better choice depending on your business case. But i think it would be always cheaper to at least build a prototype using this method.)

lvl155•6mo ago
Here my main problem with sub-agents WITHIN Claude Code. They don’t allow you to use other models. Let’s be honest it’s 99% Sonnet.
stillsut•6mo ago
Great point, I've found Sonnet really can't be beat on many tasks, but increasingly finding Gemini-Pro and o3 handle the tough bugs and refactors best.

That's why I've been using agro to launch agents from each of the main LLM vendors and checking their results when I'm stuck: https://github.com/sutt/agro/blob/master/docs/index.md

flixing•6mo ago
Opus is the opposite. Incredibly smart and better than everything else
furyofantares•6mo ago
I haven't used them yet but it says they can use MCPs. The only MCP server I use is zen-mcp-server for routing stuff to o3 and gemini.
lvl155•6mo ago
But that’s added layer and slow, no? Wouldn’t something like Opencode be a better option? You can pick anything out of major providers.
furyofantares•6mo ago
I haven't tried opencode, but I am very happy with claude code overall, and most of what I do is with opus. When I'm using zen to route to o3/gemini it's usually for a more difficult task that I don't mind taking longer anyway - I'm usually telling it to get solutions from both o3 and gemini pro and then to use o3 to evaluate which is better. I don't think the added layer of claude choosing to use the tools is really adding much here, claude picking tools is a lot faster than o3 writing code.
mortsnort•6mo ago
It says they can be "fine tuned," but it looks like the agents are all using the same model with different system prompts? This would be more intriguing if they trained a debugger model from the ground up that could be used for the debugger agent. I suspect we'll get there eventually.
retinaros•6mo ago
words have no meaning in LLM. agents, fine tuning, reasoning all have millions of definitions
anuramat•6mo ago
you can specify the model in the yaml frontmatter, so at least with litellm it should be possible
jbentley1•6mo ago
This is great. I hope that they expose the configuration through the SDK, but right now it works with Crystal (https://github.com/stravu/crystal) if you configure them in the CLI.

I'm working on a proper config screen for them that just modifies the agent files directly, and a future release will also give special formatting for agent output.

necatiozmen•6mo ago
We've prepared a comprehensive collection of production-ready Claude Code subagents: https://github.com/VoltAgent/awesome-claude-code-subagents

It contains 100+ specialized agents covering the most requested development tasks - frontend, backend, DevOps, AI/ML, code review, debugging, and more. All subagents follow best practices and are maintained by the open-source framework community. Just copy to .claude/agents/ in your project to start using them.