frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

Open in hackernews

Outcome-Based Reinforcement Learning to Predict the Future

https://arxiv.org/abs/2505.17989
96•bturtel•1d ago

Comments

ctoth•1d ago
Do you want paperclips? Because this is how you get paperclips!

Eliminate all agents, all sources of change, all complexity - anything that could introduce unpredictability, and it suddenly becomes far easier to predict the future, no?

JoshTriplett•1d ago
> Do you want paperclips? Because this is how you get paperclips!

Don't^W worry, there are many other ways of getting paperclips, and we're doing all of them.

sitkack•1d ago
Even explaining how not to get paper clips, gets you paper clips when you can invert the loss function. Paper clips for everyone!
vlovich123•1d ago
I don't know. Paperclips are awful useful. Would it be so bad to build more of them?
throwaway71271•1d ago
https://www.decisionproblem.com/paperclips/index2.html go ahead :)
Ygg2•23h ago
That's all fun and games until paperclip maximizers starts looking at your blood as source of iron.
valine•1d ago
So instead of next token prediction its next event prediction. At some point this just loops around and we're back to teaching models to predict the next token in the sequence.
lumost•1d ago
Tokens are an awfully convenient way to describe an event.
phyalow•23h ago
Tokens are just discretized state representations.
ww520•1d ago
It’s the next state. So instead of spitting out words, it will spit out a whole movie, or a sequence of world states in a game or simulation.
amelius•22h ago
Why would you use RL if you're not going to control the environment, but just predict it?
TOMDM•19h ago
Because they're training a predictor, not an agent?
jldugger•22h ago
From the abstract

> A simple trading rule turns this calibration edge into $127 of hypothetical profit versus $92 for o1 (p = 0.037).

I'm lazy: is this hypothetical shooting fish in a barrel, or is it a real edge?

nyrikki•21h ago
Note the 'hypothetical profit' part , I know of several groups looking for opportunities to skim off LLM traders, leveraging its limited sensitivity, expressiveness, and the loss of tail data.

Predictive AI is problematic no matter what tool you use. Great at demoware that doesn't deliver.

I am sure there are use cases, but it would be augmentation, not a reliable approach by itself.

garbagecoder•1h ago
"a couple of wavy lines"

bzzzzz "sorry this isn't your lucky day"

Show HN: I rewrote my Mac Electron app in Rust

https://desktopdocs.com/?v=2025
263•katrinarodri•3h ago•201 comments

Compiler Explorer and the promise of URLs that last forever

https://xania.org/202505/compiler-explorer-urls-forever
151•anarazel•4h ago•67 comments

Compiling a Neural Net to C for a 1,744× speedup

https://slightknack.dev/blog/difflogic/
115•todsacerdoti•3h ago•31 comments

What does "Undecidable" mean, anyway

https://buttondown.com/hillelwayne/archive/what-does-undecidable-mean-anyway/
21•BerislavLopac•1h ago•9 comments

A toy RTOS inside Super Mario Bros. using emulator save states

https://prettygoodblog.com/p/what-threads-are-part-2
8•notorious_pgb•35m ago•2 comments

Ice Cream Replaced Booze in the US Navy

https://www.oldsaltblog.com/2025/05/how-ice-cream-replaced-booze-in-the-us-navy-2/
37•speckx•2h ago•31 comments

Visualize and debug Rust programs with a new lens

https://firedbg.sea-ql.org/
59•alex_hirner•3d ago•3 comments

Deepseek R1-0528

https://huggingface.co/deepseek-ai/DeepSeek-R1-0528
158•error404x•2h ago•47 comments

Show HN: Tesseral – Open-Source Auth

https://github.com/tesseral-labs/tesseral
99•ucarion•5h ago•41 comments

LLM Codegen go Brrr – Parallelization with Git Worktrees and Tmux

https://www.skeptrune.com/posts/git-worktrees-agents-and-tmux/
54•skeptrune•5h ago•34 comments

The Blowtorch Theory: A new model for structure formation in the universe

https://theeggandtherock.com/p/the-blowtorch-theory-a-new-model
99•surprisetalk•7h ago•78 comments

Collatz's Ant and Similarity of Landscapes

https://gbragafibra.github.io/2025/05/18/collatz_ant3.html
6•Fibra•3d ago•0 comments

GoGoGrandparent (YC S16) is hiring Back end Engineers

1•davidchl•3h ago

Mathematical Fiction

https://kasmana.people.charleston.edu/MATHFICT/default.html
49•the-mitr•3d ago•11 comments

As a developer, my most important tools are a pen and a notebook

https://hamatti.org/posts/as-a-developer-my-most-important-tools-are-a-pen-and-a-notebook/
302•ingve•14h ago•212 comments

Launch HN: MindFort (YC X25) – AI agents for continuous pentesting

27•bveiseh•4h ago•11 comments

Getting a Cease and Desist from Waffle House

https://www.jack.bio/blog/wafflehouse
185•lafond•5h ago•123 comments

xAI to pay telegram $300M to integrate Grok into the chat app

https://techcrunch.com/2025/05/28/xai-to-invest-300m-in-telegram-integrate-grok-into-app/
206•freetonik•5h ago•264 comments

The mysterious Gobi wall uncovered

https://phys.org/news/2025-05-secrets-mysterious-gobi-wall-uncovered.html
61•bikenaga•5h ago•14 comments

Show HN: Loodio 2 – A Simple Rechargable Bathroom Privacy Device

https://loodio.com/
49•testmasterflex•6h ago•53 comments

De-anonymization attacks against the privacy coin XMR

https://monero.forex/is-monero-totally-private-a-comprehensive-analysis-of-de-anonymization-attacks-against-the-privacy-coin/
135•DbigCOX•6h ago•71 comments

Show HN: My LLM CLI tool can run tools now, from Python code or plugins

https://simonwillison.net/2025/May/27/llm-tools/
473•simonw•23h ago•154 comments

Square Theory

https://aaronson.org/blog/square-theory
678•aaaronson•1d ago•127 comments

Implementing complex numbers and FFT with just datatypes (2023)

https://gist.github.com/VictorTaelin/5776ede998d0039ad1cc9b12fd96811c
22•surprisetalk•3d ago•2 comments

Building interactive web pages with Guile Hoot

https://spritely.institute/news/building-interactive-web-pages-with-guile-hoot.html
39•e12e•3d ago•5 comments

Show HN: Wetlands – a lightweight Python library for managing Conda environments

https://arthursw.github.io/wetlands/0.2.0/
25•arthursw•5h ago•36 comments

Show HN: AutoThink – Boosts local LLM performance with adaptive reasoning

373•codelion•18h ago•58 comments

A thought on JavaScript "proof of work" anti-scraper systems

https://utcc.utoronto.ca/~cks/space/blog/web/JavaScriptScraperObstacles
147•zdw•2d ago•174 comments

Homo erectus from the seabed, new archaeological discoveries in Indonesia

https://www.universiteitleiden.nl/en/news/2025/05/homo-erectus-from-the-seabed-new-archaeological-discoveries-in-indonesia
31•palmfacehn•2d ago•6 comments

The Level Design Book

https://book.leveldesignbook.com
284•keiferski•3d ago•55 comments