frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

I got the highest score on ARC-AGI again swapping Python for English

https://jeremyberman.substack.com/p/how-i-got-the-highest-score-on-arc-agi-again
44•freediver•5h ago

Comments

pilooch•1h ago
Congrats, this solution resembles AlphaEvolve. Text serves as the high-level search space, and genetic mixing (map-elites in AE) merges attemps at lower levels.
doctorpangloss•1h ago
you would be interested in dSPY
Davidzheng•34m ago
Actually really promising stuff. I think a lot of the recent advances in the last 6mo - 1yr is in the other loop (for ex. the google deepthink model which got IMO gold and the OAI IMO gold all use substantive other loop search strategies [though it's unclear what these are] to maybe parallelize some generation/verification process). So there's no reason why we can't have huge advances in this area even outside of the industry labs in my view (I'm uninformed in general so take this comment with a large grain of salt).
modeless•31m ago
I've been testing LLMs on Sokoban-like puzzles (in the style of ARC-AGI-3) and they are completely awful at them. It really highlights how poor their memory is. They can't remember abstract concepts or rules between steps, even if they discover them themselves. They can only be presented with text describing such things which they have to re-read and re-interpret at every step.

LLMs are completely helpless on agentic tasks without a ton of scaffolding. But the scaffolding is inflexible and brittle, unlike the models themselves. Whoever figures out how to reproduce the functions of this type of scaffolding within the models, with some kind of internal test-time-learned memory mechanism, is going to win.

M4v3R•19m ago
I wonder scaffolding synthesis is the way to go. Namely the LLM itself first reasons about the problem and creates scaffolding for a second agent that will do the actual solving. All inside a feedback loop to adjust the scaffolding based on results.
modeless•16m ago
In general I think the more of the scaffolding that can be folded into the model, the better. The model should learn problem solving strategies like this and be able to manage them internally.

GNU Midnight Commander

https://midnight-commander.org/
140•pykello•2h ago•80 comments

Notion API importer, with Databases to Bases conversion bounty

https://github.com/obsidianmd/obsidian-importer/issues/421
53•twapi•1h ago•5 comments

The Asus Gaming Laptop ACPI Firmware Bug: A Deep Technical Investigation

https://github.com/Zephkek/Asus-ROG-Aml-Deep-Dive
99•signa11•2h ago•36 comments

Shai-Hulud malware attack: Tinycolor and over 40 NPM packages compromised

https://socket.dev/blog/ongoing-supply-chain-attack-targets-crowdstrike-npm-packages
980•jamesberthoty•19h ago•779 comments

I just want an 80×25 console, but that's no longer possible

https://changelog.complete.org/archives/10881-i-just-want-an-80x25-console-but-thats-no-longer-po...
35•teddyh•2h ago•30 comments

Murex – An intuitive and content aware shell for a modern command line

https://murex.rocks/
9•modinfo•21m ago•0 comments

Things you can do with a Software Defined Radio (2024)

https://blinry.org/50-things-with-sdr/
738•mihau•16h ago•124 comments

How to make the Framework Desktop run even quieter

https://noctua.at/en/how-to-make-the-framework-desktop-run-even-quieter
255•lwhsiao•12h ago•73 comments

Doom crash after 2.5 years of real-world runtime confirmed on real hardware

https://lenowo.org/viewtopic.php?t=31
173•minki_the_avali•9h ago•58 comments

In Praise of Idleness (1932)

https://harpers.org/archive/1932/10/in-praise-of-idleness/
17•awanderingmind•49m ago•0 comments

Denmark close to wiping out cancer-causing HPV strains after vaccine roll-out

https://www.gavi.org/vaccineswork/denmark-close-wiping-out-leading-cancer-causing-hpv-strains-aft...
667•slu•12h ago•266 comments

About the security content of iOS 15.8.5 and iPadOS 15.8.5

https://support.apple.com/en-us/125142
291•jerlam•6h ago•118 comments

I got the highest score on ARC-AGI again swapping Python for English

https://jeremyberman.substack.com/p/how-i-got-the-highest-score-on-arc-agi-again
44•freediver•5h ago•6 comments

A dumb introduction to z3

https://asibahi.github.io/thoughts/a-gentle-introduction-to-z3/
174•kfl•1d ago•18 comments

Tuberculosis shaped Victorian fashion (2016)

https://www.smithsonianmag.com/science-nature/how-tuberculosis-shaped-victorian-fashion-180959029/
10•franze•1d ago•1 comments

AMD Open Source Driver for Vulkan project is discontinued

https://github.com/GPUOpen-Drivers/AMDVLK/discussions/416
45•haunter•6h ago•5 comments

Irssi: IRC Client in a Docker Image

https://hub.docker.com/_/irssi
37•razodactyl•5h ago•29 comments

CubeSats are fascinating learning tools for space

https://www.jeffgeerling.com/blog/2025/cubesats-are-fascinating-learning-tools-space
38•calcifer•3d ago•3 comments

Waymo has received our pilot permit allowing for commercial operations at SFO

https://waymo.com/blog/#short-all-systems-go-at-sfo-waymo-has-received-our-pilot-permit
624•ChrisArchitect•14h ago•607 comments

Show HN: A PSX/DOS style 3D game written in Rust with a custom software renderer

https://totenarctanz.itch.io/a-scavenging-trip
33•mvx64•4h ago•2 comments

I built my own phone because innovation is sad rn [video]

https://www.youtube.com/watch?v=qy_9w_c2ub0
228•Timothee•2d ago•45 comments

Normal-order syntax-rules and proving the fix-point of call/cc

https://okmij.org/ftp/Scheme/callcc-calc-page.html
5•Bogdanp•3d ago•0 comments

Slow social media

https://herman.bearblog.dev/slow-social-media/
59•rishikeshs•4h ago•43 comments

Bertrand Russell to Oswald Mosley (1962)

https://lettersofnote.com/2016/02/02/every-ounce-of-my-energy/
194•giraffe_lady•14h ago•94 comments

In Defense of C++

https://dayvster.com/blog/in-defense-of-cpp/
116•todsacerdoti•11h ago•190 comments

Meta RayBan AR glasses shows Lumus waveguide structures in leaked video

https://kguttag.com/2025/09/16/meta-rayban-ar-glasses-shows-lumus-waveguide-structures-in-leaked-...
79•speckx•12h ago•79 comments

Should we drain the Everglades?

https://rabbitcavern.substack.com/p/should-we-drain-the-everglades
81•ksymph•11h ago•81 comments

How Container Filesystem Works: Building a Docker-Like Container from Scratch

https://labs.iximiuz.com/tutorials/container-filesystem-from-scratch
138•lgunsch•3d ago•25 comments

Launch HN: Rowboat (YC S24) – Open-source IDE for multi-agent systems

https://github.com/rowboatlabs/rowboat
57•segmenta•13h ago•26 comments

Wait4X allows you to wait for a port or a service to enter the requested state

https://github.com/wait4x/wait4x
31•atkrad•3d ago•7 comments