frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: ARISE – Agents that create their own tools at runtime when they fail

https://github.com/abekek/arise
3•abekek•2h ago
I built a framework that lets LLM agents create their own tools at runtime. Most agent frameworks assume you'll hand-craft every tool upfront. That works until your agent hits something you didn't plan for. ARISE (Adaptive Runtime Improvement through Self-Evolution) lets agents synthesize their own tools at runtime when they detect gaps

ARISE sits between your agent and its tool library. When the agent keeps failing at a class of tasks, it analyzes what's missing, uses a cheap LLM to synthesize a new Python function, tests it in a sandbox with adversarial edge cases, and if it passes, promotes it. The agent picks it up on the next run. Over time, the agent accumulates tools shaped by the actual tasks it encounters, not just what you imagined at build time.

There's a bunch of research on this idea — VOYAGER did it in Minecraft, LATM (LLMs as Tool Makers) showed LLMs can write reusable tools, CRAFT and CREATOR explored similar directions. But none of them resulted in something you can actually pip install and use with your own agent. That's what I'm trying to build.

For safety, generated code undergoes sandboxed execution, auto-generated tests, and adversarial validation before entering the active library. Everything is versioned with rollback. I don't fully trust it yet for unsupervised production use, but it's getting there.

By default, everything runs locally with SQLite. For deployment, there's a distributed mode where the agent is stateless — it reads skills from a remote store and reports trajectories to a queue. A separate worker process picks those up and runs evolution independently. So you can scale the agent without worrying about evolution blocking your hot path. I tested this end-to-end with real infra and real LLM calls.

Works with any agent that takes a task and returns a result. Native Strands adapter, raw OpenAI/Anthropic function calling works too.

This is very early — just shipped it. There's a lot to improve. Would really appreciate feedback and contributions if this is interesting to you.

Putting my stamp on a lost art: Why I still send postcards

https://www.csmonitor.com/The-Home-Forum/2026/0227/mail-USPS-art
1•Tomte•41s ago•0 comments

In This Cleveland Newsroom, AI Is Writing (But Not Reporting) the News

https://www.cjr.org/news/cleveland-newsroom-ai-rewrite-desk-chris-quinn-plain-dealer.php
1•Tomte•59s ago•0 comments

Extend or replace – how to evaluate your billing stack at AI scale

https://arnon.dk/extend-or-replace-how-to-evaluate-your-billing-stack-at-ai-scale/
1•arnon•1m ago•0 comments

Ask HN: How to Learn C++ in 2026?

1•creatorcoder•2m ago•0 comments

PulseLog – Python logger that opens a live browser dashboard (263k logs/SEC)

https://pypi.org/project/pulselog/
1•Rankush•2m ago•1 comments

OpenJarvis: Personal AI, on Personal Devices

https://scalingintelligence.stanford.edu/blogs/openjarvis/
2•jostylr•7m ago•0 comments

Show HN: Free OpenAI API Access with ChatGPT Account

https://github.com/EvanZhouDev/openai-oauth
2•EvanZhouDev•10m ago•1 comments

The Pentagon Went to War with Anthropic. What’s Really at Stake?

https://www.newyorker.com/news/annals-of-inquiry/the-pentagon-went-to-war-with-anthropic-whats-re...
1•Anon84•12m ago•0 comments

Show HN: iFrame Tester Gator

https://iframetest.com/
1•tonysurfly•16m ago•0 comments

Show HN: Graft – Your local environment, everywhere

https://graft.run
2•erdaniels•17m ago•1 comments

Canada's Bill C-22 Mandates Mass Metadata Surveillance of Canadians

https://www.parl.ca/DocumentViewer/en/45-1/bill/C-22/first-reading
2•opengrass•18m ago•0 comments

Russia's new elite hit squad was compromised by using Google Translate

https://theins.ru/en/inv/290235
1•amarcheschi•18m ago•0 comments

DriverExplorer – Windows kernel driver loader and viewer in Rust

https://github.com/orinimron123/DriverExplorer
1•orinimron123•19m ago•0 comments

I'm Too Lazy to Check Datadog Every Morning, So I Made AI Do It

https://quickchat.ai/post/automate-bug-triage-with-claude-code-and-datadog
1•piotrgrudzien•21m ago•0 comments

Turing, Gödel, and Church at Princeton in the 1930s (2012) [video]

https://www.youtube.com/watch?v=kO-8RteMwfw
2•gone35•23m ago•0 comments

Wizaskdo

https://github.com/xmonader/wizaskdo
1•aredirect•26m ago•1 comments

Show HN: Lux – Drop-in Redis replacement in Rust. 5.6x faster, ~1MB Docker image

https://github.com/lux-db/lux
3•mattyhogan•27m ago•1 comments

LessWrong Policy on LLM Use

https://www.lesswrong.com/posts/nQWavk9mnwcv6ScMR/new-lesswrong-editor-also-an-update-to-our-llm-...
3•xpe•28m ago•1 comments

It Ought to Be a Pull Door

https://elliotbonneville.com/it-really-ought-to-be-a-pull-door/
2•elliotbnvl•28m ago•0 comments

Show HN: Flutterby, an App for Flutter Developers

https://flutterby.app/
2•DavidCanHelp•29m ago•1 comments

Sewage Dump Is Now One of America's Best Bird Sanctuaries [video]

https://www.youtube.com/watch?v=gt_eVx5AX2s
1•EwanG•31m ago•0 comments

Show HN: PostSupremo – Generate authentically inauthentic LinkedIn content

https://www.postsupremo.com/
1•raphaelsoeiro•33m ago•0 comments

Show HN: HUMANTODO

https://humantodo.dev/
3•bodash•34m ago•1 comments

State Department Cuts Price of Renouncing U.S. Citizenship to $450

https://www.nytimes.com/2026/03/15/us/us-citizenship-renounce-price-cut.html
5•vinni2•37m ago•0 comments

Show HN: What Is Your Face Worth in the Modeling Industry?

https://facemaxxing.vercel.app/
1•roozka10•37m ago•0 comments

Show HN: Whspe – We decomposed TTFB to build a real hosting quality score

1•gezginweb•38m ago•0 comments

Ghost Logits: Simulating missing partition mass in sampled softmax [pdf]

https://github.com/yousef-rafat/MaximusLLM/blob/main/docs/maxis.pdf
1•yousef_g•39m ago•0 comments

The Toyota 4Runner Trailhunter's Snorkel Isn't Even a Snorkel, So Be Careful

https://www.thedrive.com/news/the-toyota-4runner-trailhunters-snorkel-isnt-even-a-snorkel-so-be-c...
4•PaulHoule•39m ago•0 comments

UK Companies House security blunder leaves director data exposed

https://www.accountingweb.co.uk/tech/tech-pulse/companies-house-security-blunder-leaves-director-...
5•mmarian•39m ago•0 comments

Demos of 2025 from the Demoscene

https://laurent.le-brun.eu/blog/the-best-demos-of-2025-from-the-demoscene
2•laurentlb•40m ago•0 comments