frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Keep large tool output out of LLM context: 3x accuracy 95% fewer tokens

https://github.com/lourencomaciel/sift-gateway
4•loumaciel•1h ago
LLM agents often place raw JSON tool outputs directly in the prompt. After a few tool calls, earlier results get compacted or truncated and answers become incorrect or inconsistent.

I built Sift, a drop-in MCP gateway that stores tool outputs as local artifacts (filesystem blobs indexed in SQLite) and returns an `artifact_id` plus compact schema hints when responses are large or paginated.

Instead of reasoning over full JSON in the prompt, the model runs a small Python query:

    def run(data, schema, params):
        return max(data, key=lambda x: x["magnitude"])["place"]
Query code runs in a constrained subprocess (AST/import guards + timeout/memory caps). Only the computed result is returned to the model.

Benchmark (Claude Sonnet 4.6, 103 questions across 12 datasets):

- Baseline (raw JSON in prompt): 34/103 (33%), 10.7M input tokens

- Sift (artifact + code query): 102/103 (99%), 489K input tokens

Open benchmark + MIT code: https://github.com/lourencomaciel/sift-gateway

Install:

    pipx install sift-gateway
    sift-gateway init --from claude
Works with Claude Code, Cursor, Windsurf, Zed, and VS Code. Existing MCP servers and tools require no changes.

Comments

loumaciel•1h ago
Happy to answer questions about the sandboxing, artifact format, or the benchmark setup.

The benchmark harness and datasets are in the repo if anyone wants to reproduce or extend the tests. Curious if others have run into the same context compaction issues with tool-heavy agents.

Flyte 2 In-Browser Demo: Open-Source AI Orchestration Is Now Available Locally

https://flyte.org/platform/flyte-2
1•aitacobell•36s ago•0 comments

"My bros and I are looksmaxers"

https://substack.com/@tomasbjartur/note/c-200613630
1•eatitraw•40s ago•0 comments

Show HN: JobApplicator (tailored job applications in minutes)

https://jobapplicator.win/
1•quinndupont•55s ago•0 comments

What to Put in a Claude Code Skill for Reviewing Your Team's Code

https://everyrow.io/blog/claude-review-skill
1•parad0x0n•2m ago•0 comments

Show HN: Open Right Zoom, Open Source Alternative to Right Zoom for macOS

https://github.com/Michele0303/open-right-zoom
1•michele0303•2m ago•0 comments

Show HN: Form81 – 100% free form builder (free Typeform alternative)

https://form81.com/
1•sh_tomer•4m ago•0 comments

Feature gating patterns in a multi-tenant Next.js SaaS

1•madebyjam•4m ago•0 comments

The Browser Can Speak a Page

https://adrianroselli.com/2026/03/your-browser-can-already-speak-a-page.html
3•speckx•4m ago•0 comments

Show HN: Venus flight simulator to train LLM pilots (~2% vs. 1985 Soviet data)

https://veenie.space/
1•hackiku•5m ago•1 comments

The AI in minutes, solves patient care problem that stumped doctors for months

https://www.fiercehealthcare.com/health-tech/cvs-unveils-health-100-its-new-google-powered-consum...
1•krzyzanowskim•6m ago•0 comments

Tiny, 45 base long RNA can make copies of itself

https://arstechnica.com/science/2026/02/researchers-find-small-rnas-that-can-make-copies-of-thems...
1•PaulHoule•6m ago•0 comments

Middle East war makes ethical debate over AI use in war all too real

https://www.cbc.ca/player/play/video/9.7115523
1•empressplay•6m ago•0 comments

The Illusion of Building

https://uphack.io/blog/post/the-illusion-of-building/
1•birdculture•7m ago•0 comments

Flash Attention 4

https://www.together.ai/blog/flashattention-4
1•zagwdt•8m ago•0 comments

The ML Engineer's Guide to Protein AI

https://huggingface.co/blog/MaziyarPanahi/protein-ai-landscape
1•maziyar•8m ago•1 comments

Show HN: SamarthyaBot – a privacy-first self-hosted AI agent OS

https://github.com/mebishnusahu0595/SamarthyaBot
1•mebishnusahu0•8m ago•1 comments

Chrome is moving to a two-week release cycle starting with Chrome 153

https://developer.chrome.com/blog/chrome-two-week-release
1•maxloh•9m ago•0 comments

Show HN: Argus – VSCode debugger for Claude Code sessions

https://github.com/yessGlory17/argus
1•lydionfinance•10m ago•0 comments

Buhurt board game – Knight fight [video]

https://www.youtube.com/watch?v=DN7NsfMH8g4
1•melor•10m ago•0 comments

AI Agent Authentication and Authorization IETF RFC Draft

https://datatracker.ietf.org/doc/draft-klrc-aiagent-auth/
1•mooreds•10m ago•0 comments

44% on ARC-AGI-1 in 67 cents

https://github.com/mvakde/mdlARC/
1•evilmathkid•12m ago•1 comments

I made a WeTransfer clone with Darth Vader vibes

https://DropVader.com
1•hitsnoozer•12m ago•0 comments

Web of Documents (2019)

https://blog.danieljanus.pl/2019/10/07/web-of-documents/
1•nathell•12m ago•0 comments

In Which We Give Our AI Agent a Map (and It Stops Getting Lost)

https://seylox.github.io/2026/03/05/blog-agents-meta-repo-pattern.html
1•Seylox•13m ago•1 comments

Show HN: Jpa-fetch – JPA fetch control that doesn't suck

https://github.com/alterioncorp/jpa-fetch
1•alexlitovsky•13m ago•0 comments

OpenAI launched symphony, turn project work into isolated, autonomous runs

https://github.com/openai/symphony/blob/main/README.md
1•vampiregrey•13m ago•0 comments

Doing My Taxes with Claude

https://theautomatedoperator.substack.com/p/claude-almost-nails-my-taxes-and
1•idopmstuff•13m ago•0 comments

JJ LSP Follow Up

https://matklad.github.io/2026/03/05/jj-lsp-followup.html
2•emschwartz•13m ago•0 comments

Show HN: In 2026, I'm building a conversational WordPress form plugin

https://wp.forms.md/
1•darkhorse13•13m ago•0 comments

Show HN: Cook – A portable terminal AI agent (OSS, MIT)

https://getcook.dev
1•vadepaysa•14m ago•0 comments