frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Simple Meta-Harness on Islo.dev

https://zozo123.github.io/meta-harness-on-islo-page/
25•zozo123-IB•1h ago

Comments

love2read•1h ago
I have no idea what this does or is. I really wish they could have given a better description of why this is useful.
cyanydeez•56m ago
I find it fascinating, all these attempts are goldmining LLMs with a harness and it's clear they're generating all the docs for AI to read and use, even the docs say "we made a MCP for this!" like some how within 2 years people no longer make choices and it's just like AIs roaming the internet trying on harnesses, etc; certainly that'd be a fascinating reality but the verbosity really is a eye-glazing experience. Who do they expect to read all of that ad copy? It's not me.
antiobli•54m ago
Their lines "A meta-harness is the loop that improves the harness automatically" and "the bottleneck is diagnostic context: most optimizers compress prior runs into summary statistics, while meta-harness gives the proposer up to 10M tokens of raw execution traces to grep through," seem good, no?

Have to dig into the code, but it looks like they have sound engineering around a "self-improving" agentic coding harness. Will be fun to take the code for a spin.

kingstnap•33m ago
10M tokens of raw execution traces to grep through is slop. The tasks are fizzbuzz, palindrome, list reversal, and sum-even. The palindrome challenge is literaly this:

> Is the word "racecar" a palindrome? Answer with exactly one lowercase word: "yes" or "no". Print only the answer.

cyanydeez•59m ago
serious question: I've already got a opencode harness running on a local model. It's easily installable via the insecure bash command. It's already tailored with a couple of plugins and with a proper TODO.md and planning, I can get it to loop fine with proper attention to its pratfalls on vague/non-determinant language. It's all running on a AMD 395+ Qwen3-Coder-Next model with ~256k context. opencode has a webui I can put behind a password protected endpoint and keep it busy from anywhere I need to via a simple nginx proxy.

How does this go above and beyond this straightforward opensource, open weights and relatively cheap setup? Do you just get more tokens from SOTA models? Can anyone rationally say the products of token production are quality and secure?

pohl•39m ago
You know how OpenCode can be prompted to modify itself when you want to improve it in some way? This just automates that kind of thing.
cyanydeez•22m ago
It can't actually; I had to create a systemd service that watched the config path and send a signal to reload the files. It roughly works, but it doesn't actually do the loop correctly.

However, the problem with self-modification is the tendency towards inoperable states. Does it automatically revert when a detrimental state is reached? How does it determine that a modification worked?

pohl•16m ago
The paper shows that it can. Take note this seems to be someone’s experiment. If it’s not working for you that’s probably because it’s not a polished product.
m3kw9•59m ago
This seems to be another over optimization for AI that many are trying to get into. The LLM's improve, and your setup is deprecated, you wasted time optimizing for a slight edge. TDLR: You trade time for slight edge.
mccoyb•26m ago
It has now become fashionable to dress oneself in the garb of science to sell dev environments ... for agents.

It has now become fashionable to claim much, and furnish little.

It has now become fashionable to fail to understand or state the core of your proposal in as few words as possible: instead of "genetic algorithm applied to the space of harnesses, parallelized by our infrastructure" we get "Three swaps. Same orchestrator. Same dashboard. The wiring is the thing."

We're cooked chat.

adamgold7•20m ago
we need better RL
vmg12•21m ago
This is not how I've seen the term meta-harness be used. The common usage I've seen has been for a meta-harness to be a wrapper around an existing agent to give that agent a new ui or abilities.
visarga•12m ago
I did this too, ablating all the components in my coding agent harness. The insight from my meta-optimization loops was "have judge agents review the plan and implementation".

One of my own insights here is that you need to collect not just execution traces, but all the human-in-the-loop nudges and steering commands. They are one of the purest sources of feedback on coding agents when seen in context.

I agree with OP on the need to collect traces and compare them, not just scores. It is a much richer source of feedback.

If anyone is interested I have a slide deck about my approach: https://docs.google.com/presentation/d/1uSvMPInwIMxCNiN3GQBO...

The best is over: The fun has been optimized out of the Internet

https://muddy.jprs.me/posts/2026-05-03-the-best-is-over/
164•jprs•1h ago•104 comments

AI didn't delete your database, you did

https://idiallo.com/blog/ai-didnt-delete-your-database-you-did
228•Brajeshwar•1h ago•106 comments

iOS 27 is adding a 'Create a Pass' button to Apple Wallet

https://walletwallet.alen.ro/blog/ios-27-wallet-create-pass/
216•alentodorov•3h ago•174 comments

Async Rust never left the MVP state

https://tweedegolf.nl/en/blog/237/async-rust-never-left-the-mvp-state
335•pjmlp•8h ago•171 comments

Should I Run Plain Docker Compose in Production in 2026?

https://distr.sh/blog/running-docker-in-production/
202•pmig•5d ago•166 comments

Simple Meta-Harness on Islo.dev

https://zozo123.github.io/meta-harness-on-islo-page/
25•zozo123-IB•1h ago•13 comments

Agents for Financial Services and Insurance

https://www.anthropic.com/news/finance-agents
6•louiereederson•37m ago•0 comments

AI Product Graveyard

https://tooldirectory.ai/ai-graveyard
157•StriverGuy•2h ago•69 comments

Docker 29 has changed its default image store for new installs

https://docs.docker.com/engine/storage/containerd
47•neitsab•3d ago•30 comments

When everyone has AI and the company still learns nothing

https://www.robert-glaser.de/when-everyone-has-ai-and-the-company-still-learns-nothing/
152•youngbrioche•6h ago•94 comments

Bun is being ported from Zig to Rust

https://github.com/oven-sh/bun/commit/46d3bc29f270fa881dd5730ef1549e88407701a5
662•SergeAx•14h ago•472 comments

Three Inverse Laws of AI

https://susam.net/inverse-laws-of-robotics.html
4•blenderob•15m ago•0 comments

Empty Screenings – Finds AMC movie screenings with few or no tickets sold

https://walzr.com/empty-screenings
246•MrBuddyCasino•11h ago•208 comments

Show HN: A Mutating Webhook to automatically strip PII from K8s logs

https://github.com/aragossa/pii-shield
3•aragoss•33m ago•1 comments

The first photo published in a newspaper

https://phsne.org/the-first-photograph-published-in-a-newspaper-1848/
16•geuis•2d ago•2 comments

Google Chrome silently installs a 4 GB AI model on your device without consent

https://www.thatprivacyguy.com/blog/chrome-silent-nano-install/
679•john-doe•8h ago•520 comments

Lessons for Agentic Coding: What should we do when code is cheap?

https://www.dbreunig.com/2026/05/04/10-lessons-for-agentic-coding.html
156•ingve•8h ago•156 comments

Hand Drawn QR Codes (2025)

https://sethmlarson.dev/hand-drawn-qr-codes
173•jollyjerry•11h ago•32 comments

Comparing the Z80 and 6502 to Their Relatives

https://bumbershootsoft.wordpress.com/2026/05/02/comparing-the-z80-and-6502-to-their-relatives/
27•ibobev•2d ago•0 comments

It's official: Utah is the U.S. state closest to banning VPNs

https://tech.yahoo.com/vpn/article/its-official-utah-is-the-us-state-closest-to-banning-vpns-1535...
26•giantg2•41m ago•14 comments

sRGB profile comparison

https://ninedegreesbelow.com/photography/srgb-profile-comparison.html
38•Retr0id•3d ago•9 comments

How OpenAI delivers low-latency voice AI at scale

https://openai.com/index/delivering-low-latency-voice-ai-at-scale/
457•Sean-Der•20h ago•136 comments

Show HN: I built a new word game, Wordtrak

https://wordtrak.com/blog/2026-05-05-I-built-a-new-word-game
33•qrush•3h ago•16 comments

CVE-2026-31431: Copy Fail vs. rootless containers

https://www.dragonsreach.it/2026/05/04/cve-2026-31431-copy-fail-rootless-containers/
155•averi•12h ago•85 comments

Farewell to a Giant of Botany

https://nautil.us/farewell-to-a-giant-of-botany-1280409
70•Brajeshwar•2d ago•5 comments

Train Your Own LLM from Scratch

https://github.com/angelos-p/llm-from-scratch
370•kristianpaul•11h ago•43 comments

Agent Skills

https://addyosmani.com/blog/agent-skills/
330•BOOSTERHIDROGEN•18h ago•162 comments

Mouse Pointer as a Mere Mortal

https://unsung.aresluna.org/mouse-pointer-as-a-mere-mortal/
67•zdw•2d ago•27 comments

The Frog for Whom the Bell Tolls

https://sethmlarson.dev/the-frog-for-whom-the-bell-tolls
34•anujbans•8h ago•13 comments

Does Employment Slow Cognitive Decline? Evidence from Labor Market Shocks

https://www.nber.org/papers/w35117
327•littlexsparkee•1d ago•341 comments