frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

The Therac-25 Incident

https://thedailywtf.com/articles/the-therac-25-incident
115•lemper•3h ago•72 comments

WebLibre: The Privacy-Focused Browser

https://docs.weblibre.eu/
18•mnmalst•1h ago•8 comments

Claude for Chrome

https://www.anthropic.com/news/claude-for-chrome
655•davidbarker•15h ago•350 comments

Scientist exposes anti-wind groups as oil-funded. Now they want to silence him

https://electrek.co/2025/08/25/scientist-exposes-anti-wind-groups-as-oil-funded-now-they-want-to-...
209•xbmcuser•3h ago•43 comments

Malleable Software Will Eat the SaaS World

https://www.mdubakov.me/malleable-software-will-eat-the-saas-world/
18•tablet•2h ago•23 comments

Gemini 2.5 Flash Image

https://developers.googleblog.com/en/introducing-gemini-2-5-flash-image/
936•meetpateltech•20h ago•428 comments

Dissecting the Apple M1 GPU, the end

https://rosenzweig.io/blog/asahi-gpu-part-n.html
478•alsetmusic•8h ago•105 comments

Light pollution prolongs avian activity

https://gizmodo.com/birds-across-the-world-are-singing-all-day-for-a-disturbing-reason-2000646257
74•gmays•3d ago•14 comments

GNU Artanis – A fast web application framework for Scheme

https://artanis.dev/index.html
223•smartmic•14h ago•45 comments

Chinese astronauts make rocket fuel and oxygen in space

https://www.livescience.com/space/space-exploration/chinese-astronauts-make-rocket-fuel-and-oxyge...
230•Teever•2d ago•96 comments

The man with a Home Computer (1967) [video]

https://www.youtube.com/watch?v=w6Ka42eyudA
38•smarm•5h ago•18 comments

Rv, a new kind of Ruby management tool

https://andre.arko.net/2025/08/25/rv-a-new-kind-of-ruby-management-tool/
254•steveklabnik•1d ago•90 comments

Bypass PostgreSQL catalog overhead with direct partition hash calculations

https://www.shayon.dev/post/2025/221/bypass-postgresql-catalog-overhead-with-direct-partition-has...
21•shayonj•3d ago•5 comments

Reverse Engineered Raspberry Pi Compute Module 5

https://github.com/schlae/cm5-reveng
38•_Microft•2d ago•5 comments

Neuralink 'Participant 1' says his life has changed

https://fortune.com/2025/08/23/neuralink-participant-1-noland-arbaugh-18-months-post-surgery-life...
288•danielmorozoff•3d ago•329 comments

One universal antiviral to rule them all?

https://www.cuimc.columbia.edu/news/one-universal-antiviral-rule-them-all
302•breve•20h ago•135 comments

US Intel

https://stratechery.com/2025/u-s-intel/
459•maguay•23h ago•483 comments

Molluscs of the Multiverse: molluscan diversity in Magic: The Gathering

https://jgeekstudies.org/2025/08/24/molluscs-of-the-multiverse-molluscan-diversity-in-magic-the-g...
6•zdw•2d ago•0 comments

Japan has opened its first osmotic power plant

https://www.theguardian.com/world/2025/aug/25/japan-osmotic-power-plant-fukuoka
259•pseudolus•2d ago•83 comments

Denmark summons top US diplomat over alleged Greenland influence operation

https://www.bbc.com/news/articles/c0j9l08902eo
86•vinni2•1h ago•60 comments

SpaCy: Industrial-Strength Natural Language Processing (NLP) in Python

https://github.com/explosion/spaCy
90•marklit•4d ago•31 comments

Uncomfortable Questions About Android Developer Verification

https://commonsware.com/blog/2025/08/26/uncomfortable-questions-android-developer-verification.html
297•ingve•5h ago•212 comments

A teen was suicidal. ChatGPT was the friend he confided in

https://www.nytimes.com/2025/08/26/technology/chatgpt-openai-suicide.html
298•jaredwiener•20h ago•308 comments

iOS 18.6.1 0-click RCE POC

https://github.com/b1n4r1b01/n-days/blob/main/CVE-2025-43300.md
207•akyuu•1d ago•46 comments

Show HN: Regolith – Regex library that prevents ReDoS CVEs in TypeScript

https://github.com/JakeRoggenbuck/regolith
25•roggenbuck•7h ago•22 comments

Why do people keep writing about the imaginary compound Cr2Gr2Te6?

https://www.righto.com/2025/08/Cr2Ge2Te6-not-Cr2Gr2Te6.html
168•freediver•16h ago•93 comments

The McPhee method for writing deeply reported nonfiction

https://jsomers.net/blog/the-mcphee-method
166•jsomers•1d ago•41 comments

Michigan Supreme Court: Unrestricted phone searches violate Fourth Amendment

https://reclaimthenet.org/michigan-supreme-court-rules-phone-search-warrants-must-be-specific
483•mikece•16h ago•89 comments

Undisclosed financial conflicts of interest in DSM-5 (2024)

https://www.bmj.com/content/384/bmj-2023-076902
225•renameme•17h ago•162 comments

LiteLLM (YC W23) is hiring a back end engineer

https://www.ycombinator.com/companies/litellm/jobs/6uvoBp3-founding-backend-engineer
1•detente18•13h ago
Open in hackernews

How to Fix Your Context

https://www.dbreunig.com/2025/06/26/how-to-fix-your-context.html
91•itzlambda•3d ago

Comments

profsummergig•3d ago
This seems to be an important article.

However it uses various terms that I'm not sure of the definition for.

E.g. the word "Tool".

How can I learn about the definitions of these words? Will reading prior articles in the series help? Please advise.

nativeit•2d ago
tool (n)

One who lacks the mental capacity to know he is being used. A fool. A cretin. Characterized by low intelligence and/or self-esteem.

Disclaimer: I only cite definitions from Urban Dictionary, but I remain firmly convinced they are correct definitions in context.

1123581321•1d ago
You should make a Chrome plugin that fills in Urban Dictionary definitions of first names while you’re on LinkedIn.
jcheng•1d ago
Here's my attempt at explaining tool calling:

https://youtu.be/owDd1CJ17uQ?si=Z2bldI8IssG7rGON&t=1330

It's an _incredibly_ important concept to understand if you have even a passing interest in LLMs. You need to understand it if you want to have any kind of mental model for how LLM-powered agents are even possible.

profsummergig•1d ago
Thank you, I watched it. The key takeaway I got was that the client (browser, I suppose) does the actual usage of such tools. The user hands over control of these tools to the AI (and the tool-use happens in the background so it might look to the user like the AI is the one doing the actual usage of the tools).
tptacek•1d ago
Ordinarily:

   you> what's going on?
   > It's going great --- how can I help you today?
Tool calls:

   you> [json blob of available "tools": "ls", "grep", "cat"]
   you> what's going on?
   > [json blob selecting "ls"]
   (you) presumably run "ls"
   you> [json blob of "ls" output]
   > [json blob selecting "cat foo.c"]
   (you) dump "foo.c"
   you> [json blob of "cat foo.c"]
   > I can see that we're in a C project that does XYZ...
The key thing is just: tools are just a message/conversation abstraction LLMs are trained to adhere to: they know to spit out a standardized "tool call" JSON, and they know to have multi-round conversations with sets of different "tools" (whichever ones are made available to them) to build up context to answer questions with.

That's the whole thing.

trjordan•1d ago
This is all true, and we've prototyped a number of these things at my current startup. You need to be pretty considered about implementing them.

For a counter-example, consider Claude Code:

- 1 long context window, with (at most) 1 sub-agent

- same tools available at all times, and to sub-agent (except: spawning a sub-sub-agent)

- Full context stays in conversation, until you hit the context window limit. Compaction is automatic but extremely expensive. Quality absolutely takes a dive until everything is re-established.

- Deterministic lookup of content. Claude reads files with tools, not includes random chunks from RAG cosine similarity.

I could go on. In my experience, if you're going to use these techniques 1) maybe don't and 2) turn up the determinism to 11. Get really specific about _how_ you're going to use, and why, in a specific case.

For example, we're working on code migrations [0]. We have a tool that reads changelogs, migration guides, and OSS source. Those can be verbose, so they blow the context window on even 200k models. But we're not just randomly deleting things out of the "plan my migration" context, we're exposing a tool that deliberately lets the model pull out the breaking changes. This is "Context Summarization," but before using it, we had to figure out that _those_ bits were breaking the context, _then_ summarizing them. All our attempts at generically pre-summarizing content just resulted in poor performance because we were hiding information from the agent.

[0] https://tern.sh

jasonjmcghee•1d ago
What do you mean re Claude Code, "at most 1 sub-agent"?
trjordan•1d ago
It only spawns a single sub-agent (called Task iirc), which can do everything Claude Code can, except call Task().

This is different from a lot of the context-preserving sub-agents, which have fully different toolsets and prompts. It's much more general.

CuriouslyC•1d ago
This isn't going to be much of a problem for long. I'm wrapping up on an agent context manager that gives effectively infinite context while producing ~77% better results than naive vector+BM25 baseline on my benchmark suite.
rl3•1d ago
May I ask the rationale for writing your own? Were you using an existing tool that didn't quite fit your needs?

This is an itch I've been wanting to scratch myself, but the space has so many entrants that it's hard to justify the time investment.

CuriouslyC•1d ago
Existing off the shelf IR tools are mid, more recent research is often not productionized, and there are a lot of assumptions that hold for agentic context (at least in the coding realm, which is the one that matters) that you can take advantage of to push performance.

That plus babysitting Claude Code's context is annoying as hell.

rl3•1d ago
Thanks.

>That plus babysitting Claude Code's context is annoying as hell.

It's crazy to me that—last I checked—its context strategy was basically tool use of ls and cat. Despite the breathtaking amount of engineering resources major AI companies have, they're eschewing dense RAG setups for dirt simple tool calls.

To their credit it was good enough to fuel Claude Code's spectacular success, and is fine for most use cases, but it really sucks not having proper RAG when you need it.

On the bright side, now that MCP has taken off I imagine one can just provide their preferred RAG setup as a tool call.

CuriouslyC•1d ago
You can, but my tool actually handles the raw chat context. So you can have millions of tokens in context, and actual message that gets produced for the LLM is an optimized distillate, re-ordered to take into account LLM memory patterns. RAG tools are mostly optimized for QA anyhow, which has dubious carryover to coding tasks.
olejorgenb•18h ago
> ... re-ordered to take into account LLM memory patterns.

If I understand you correctly, doesn't this break prefix KV caching?

CuriouslyC•18h ago
It is done at immediately before the LLM call, transforming the message history for the API call.

This does reduce the context cache hit rate a bit, but I'm cache aware so I try to avoid repacking the early parts if I can help it. The tradeoff is 100% worth it though.

barbazoo•1d ago
> When prompting DeepSeek-v3, the team found that selecting the the right tools becomes critical when you have more than 30 tools. Above 30, the descriptions of the tools begin to overlap, creating confusion. Beyond 100 tools, the model was virtually guaranteed to fail their test. Using RAG techniques to select less than 30 tools yielded dramatically shorter prompts and resulted in as much as 3x better tool selection accuracy.

> For smaller models, the problems begin long before we hit 30 tools. One paper we touched on last post, “Less is More,” demonstrated that Llama 3.1 8b fails a benchmark when given 46 tools, but succeeds when given only 19 tools. The issue is context confusion, not context window limitaions.

High number of tools is a bit of a "smell" to me and often makes me wonder if the agent doesn't have too much responsibility. A bit like a method with so many parameters, it can do almost anything.

Have folks had success with agents like that? I found the fewer tools the better, e.g. <10 "ballpark".

knewter•1d ago
we have success with 39 but we're introducing more focused agents and a smart router because we see the writing on the wall among other things (benefits)