frontpage.
newsnewestaskshowjobs

Open Source @Github

fp.

Open in hackernews

Some Simple Economics of AGI

https://papers.ssrn.com/sol3/papers.cfm?abstract_id=6298838
23•reasonableklout•2h ago

Comments

lowsong•1h ago
> we thank ChatGPT, Claude, Gemini, and Grok for tirelessly traversing the combinatorial space of this manuscript.

This is a premise so fanciful this might as well start with "assume a goose that lays golden eggs" that's then been extrapolated into 100+ pages of AI generated text.

delusional•1h ago
Lead author is a crypto guy. Having written any paper on crypto, especially as an economist, is negatively correlated with my interest in reading your arguments.
BobbyTables2•1h ago
Articles like this feel overly optimistic.

It would be ludicrous to assert that YouTube video viewing compresses the learning time, allowing one to become an Olympic coach in a sport without ever having played it.

Yet, it the article claims that humans can merely define intent and supervise the AI’s output - the same humans that used the AI to avoid the tedium and time commitment required to obtain mastery from actual work.

Perhaps I’m sorely mistaken. All those coworkers who went up the management chain after 2-3 years experience are the foremost technical experts in their field. They merely press a single button like George Jetson and the AI will figure out all the details!

slicendice•1h ago
18 emdashes in just the first 5 paragraphs is crazy work. Might as well just ask claude myself.
cadamsdotcom•1h ago
> For companies, the core strategic insight is that verification is no longer a mere compliance function, but a primary production technology—and increasingly, their most defensible one.

Sure, but the real wins are when the “intent-execution-verification sandwich” is done to the verification layer.

Have the agent produce its own verifiers according to a human supplied spec, then let the agent install it in its own workflow so it can later use it to check its own work.

Software development is a highly repeatable process as long as the agent is building - and verifying! - its own verification suite as it goes. By producing its own tooling to let itself define and codify “working software” and to later use that ever-growing verification suite to ask and get answers to that question, no human is needed for working software to be (eventually, after potentially lots of self-iteration) produced.

Test-driven development works for this.

It’s massive leverage - humans define high-level what the verification suite should contain (ie. plan mode), and the suite is built by the agent then used by that same agent to check its own work. Work that fails the suite’s checks can be fixed by the agent and re-checked as many times as needed before it is allowed to declare itself done. It’s just linting on epic steroids.

For non software work it is possible to get a similar result if hypothetically you can build a human-curated library of examples of “good work” and “bad work”, then have the agent build a verification suite using those.

For example, let’s say you want a blogging agent that never gives away its agentness.

You could prompt for a blog post and get something you can bang into shape by yelling at your agent over and over, or you could automate the part where you yell at the agent over and over!

Here’s how:

Gather a set of shitty AI prose that you hate seeing, and have an agent write a script that receives prose as input and will error out and reject if said prose containing those AI-isms. The prose-checker script needs to spit out the problem line numbers & offsets. That’s the most important thing as it will help any agent using this script - if it has a line number or an offset, it knows where to make targeted edits to improve its first draft.

When you provide that script to your blog post generating agent and tell it to refuse to stop and continue making improvements to its work until the prose-checker script does NOT error out, it can iterate as many times as it needs until the checker is satisfied. Only then is it going to present you its work. Voila! No more AI-isms! (At least, not the ones your checker disallowed)

You can do the same in a positive direction too - for example you could automate making prose sound more like you wrote it yourself.

Have your agent build you a script that invokes an LLM which, given samples of your writing style, can “grade” the prose based on closeness to yours. The grader needs to give feedback to the invoking agent to help it edge its output closer to the goal.

It’s powerful to think of AI as a tool-building tool that can use tools!

Both scripts are like 10-20 lines of code; the first wraps a regex and the second wraps `claude -p`. Or they could be one script. The point is, you’re no longer going to tell the original agent what to do - you’re giving it a prompt and a suite of validations - tools that let it check if it’s reached the goal you have for it, and which if it hasn’t, give guidance on how to inch closer.

Your job as a software engineer DID get taken - by you, the software verification specification engineer! And the agents DO write over 90% of the code. Because why would you code by typing when you can have so much more leverage using coding agents to codify your goal.

zarzavat•45m ago
This article is about current level AI, not future AGI. The common definition of AGI is intelligence that matches a human's, so there would be no need for a human to supervise every action of the AGI any more than a CEO supervises every action of their employees since by definition the human supervisor would do no better than an AGI performing the same role.

GLM 5.2 beats Claude in our benchmarks

https://semgrep.dev/blog/2026/we-have-mythos-at-home-glm-52-beats-claude-in-our-cyber-benchmarks/
702•jms703•12h ago•335 comments

HackerRank open sourced its ATS. My resume scored 90/100. Oh wait 74. No – 88

https://danunparsed.com/p/hackerrank-open-source-ats
228•sambellll•5h ago•52 comments

Age verification is just a precursor to automated attribution of speech

https://nonogra.ph/age-verification-is-just-a-precursor-to-attribution-of-speech-06-29-2026
277•arkhiver•3h ago•146 comments

Herdr: Agent multiplexer that lives in your terminal

https://github.com/ogulcancelik/herdr
30•mzehrer•2h ago•10 comments

Lore – give your coding agent the decisions your team made

https://github.com/itsthelore/rac-core
15•tcballard•2h ago•8 comments

Dissecting Apple's Sparse Image Format (ASIF)

https://schamper.dev/dissecting-apples-sparse-image-format-asif/
23•supermatou•14h ago•0 comments

Historical memory prices 1960-2026

https://dam.stanford.edu/memory-prices.html
267•vga1•12h ago•94 comments

5k menus from the New York Public Library’s Buttolph Collection (1880-1920)

https://pudding.cool/2026/06/menu-story/
354•xbryanx•16h ago•90 comments

I used Claude Code to get a second opinion on my MRI

https://antoine.fi/mri-analysis-using-claude-code-opus
402•engmarketer•14h ago•522 comments

Knowledge Distillation of Black-Box Large Language Models (2024)

https://arxiv.org/abs/2401.07013
80•babelfish•8h ago•13 comments

Deciphering Basmala

https://blog.plover.com/lang/bismillah.html
41•lordgrenville•4d ago•15 comments

Show HN: Zanagrams

https://zanagrams.com/
252•pompomsheep•15h ago•58 comments

TOP500 at ISC’26: We have a New Number 1 Supercomputer

https://chipsandcheese.com/p/top500-at-isc26-we-have-a-new-number
95•rbanffy•11h ago•57 comments

Tokenmaxxing is dead, long live tokenmaxxing

https://12gramsofcarbon.com/p/agentics-tech-things-tokenmaxxing
138•theahura•14h ago•166 comments

The Boeing 747 begins its final descent

https://www.theatlantic.com/magazine/2026/07/boeing-747-retirement/687304/
175•dbl000•3d ago•245 comments

The Baffling World of Masayoshi Son's Presentations (2020)

https://www.bloomberg.com/news/features/2020-06-23/golden-geese-and-unicorns-inside-the-eccentric...
50•phaser•2d ago•16 comments

Model Training as Code

https://aleph-alpha.com/en/blog/model-training-as-code/
52•peterBlue75•3d ago•10 comments

The Forgotten Castles of the Garamantes

https://www.wildmanlife.com/the-forgotten-castles-of-the-garamantes/
19•bookofjoe•4d ago•1 comments

Professor denounces mass AI fraud on an exam at Brown

https://english.elpais.com/education/2026-06-28/ai-fraud-at-brown-university-academic-integrity-i...
363•geox•14h ago•477 comments

The KIDS Act would require age checks to get online

https://www.eff.org/deeplinks/2026/06/kids-act-would-require-age-checks-get-online
423•bilsbie•18h ago•337 comments

Working around dragons with the Lemote Yeeloong laptop and OpenBSD

http://oldvcr.blogspot.com/2026/06/working-around-dragons-with-lemote.html
105•zdw•13h ago•29 comments

Librepods: AirPods liberated

https://github.com/librepods-org/librepods
345•rbanffy•12h ago•117 comments

Idler Magazine

https://www.idler.co.uk/
14•tomjakubowski•3d ago•2 comments

Daisugi, the Japanese technique of growing trees out of other trees (2020)

https://www.openculture.com/2020/10/daisugi.html
129•MaysonL•14h ago•41 comments

A way to exclude sensitive files issue still open for OpenAI Codex

https://github.com/openai/codex/issues/2847
195•pikseladam•18h ago•129 comments

Examining circuit boards from the Space Shuttle's I/O Processor

https://www.righto.com/2026/06/space-shuttle-io-processor-boards.html
100•pwg•14h ago•23 comments

A glitch in February of the year 0

https://28times.com/blog/2026-06-26-february-of-the-year-0
14•lukasgelbmann•2d ago•8 comments

More evidence is consistent with possible ancient life on Mars (2025)

https://www.cbc.ca/radio/quirks/more-evidence-of-life-on-mars-but-still-no-life-1.7649645
78•pseudolus•18h ago•77 comments

The curious case of the disappearing Polish S (2015)

https://aresluna.org/the-curious-case-of-the-disappearing-polish-s/
229•colinprince•18h ago•76 comments

Show HN: Bash4LLM+ – A lightweight, dependency-free Bash wrapper for LLM APIs

https://github.com/kamaludu/bash4llm/
46•kamaludu•11h ago•16 comments