frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Paper Lantern – improving Autoresearch with research knowledge

https://www.paperlantern.ai/code
2•paperlantern•1h ago
Hi, we've been working on Paper Lantern - an MCP server that searches 2M+ CS research papers for coding agents. The coding agent describes its problem and PL returns ranked techniques with implementation steps, hyperparameters, and failure modes.

We tested it on Karpathy's autoresearch framework : where the task is to find better llm architecture and training configs. In autoresearch, the agent proposes an optimization, tries a 5 min training run, calculates the val loss and then keeps / discards if the val loss lowered / increased.

We compared a strong baseline agent (Opus 4.6 + web search) vs that same agent + Paper Lantern.

  - agent + Paper Lantern iterated to a config that got a much lower val loss on 5-min runs  

  - we trained the two final configs for 2 hours : the config from Paper Lantern got a 3.2% lower val loss
Two concrete examples :

  1. Both agents tried halving the batch size. The paper-access agent pulled a 2022 paper and scaled the learning rate by 1/sqrt(2) as the paper prescribed. It worked, and further halving kept working. The web-search agent made the same batch change, got worse loss, and moved on without diagnosing the LR.  

  2. The with-paper-lantern agent also implemented AdaGC (adaptive gradient clipping, arxiv 2502.11034, published Feb 2025) on the first try with no tuning. Which the baseline agent did not try at all.  

If you want to deep-dive:

  - (code) https://github.com/paperlantern-ai/autoresearch-experiment

  - (blog) https://www.paperlantern.ai/blog/autoresearch
If you want to try Paper Lantern yourself:

  - Quick setup: `npx paperlantern@latest`

Comments

parima08•1h ago
That's an impressive jump in performance by providing the agent with access to relevant literature.

Is there a breakdown of which wins came from hyperparameter values (where BO would likely match this) vs. wins from techniques the agent wouldn’t have tried without the paper?

paperlantern•1h ago
yes - the blog post has a figure showing all the improvements and how big they were.

also, some times the baseline agent tries the same idea but doesn't get as big a boost as the baseline + Paper Lantern agent. We studied it and found the reason was that the baseline tries changes in isolation whereas the research-backed ideas understand the interactions between parameters and suggests multiple changes at the same time - which the baseline agent never discovers.

Mhdybnb

https://blog.cloudflare.com/post-quantum-warp/
1•mhdybnb•34s ago•0 comments

One unusual thing in SV is the topics of billboard ads

https://101ads.org/
1•mihaichiorean•42s ago•0 comments

U.S. Attorney's Office Filed 143 Border-Related Cases This Week

https://www.justice.gov/usao-sdca/pr/us-attorneys-office-filed-143-border-related-cases-week
1•737min•3m ago•1 comments

The Spitfire

https://www.youtube.com/watch?v=B6a9_bEebYM
1•fallinditch•3m ago•0 comments

Indianapolis councilman says shots fired at home and 'No Data Centers' note left

https://www.pbs.org/newshour/nation/indianapolis-councilman-says-shots-fired-at-home-and-no-data-...
1•rmason•3m ago•0 comments

10k-watt GPU meet 40-watt lump of meat

https://daverupert.com/2026/04/if-i-could-watt-10-000-florps/
2•speckx•6m ago•0 comments

Usage of psychedelic psilocybin rises after state decriminalization

https://medicalxpress.com/news/2026-03-usage-psychedelic-psilocybin-state-decriminalization.html
1•PaulHoule•7m ago•0 comments

Mozilla Used Anthropic's Mythos to Find and Fix 271 Bugs in Firefox

https://www.wired.com/story/mozilla-used-anthropics-mythos-to-find-271-bugs-in-firefox/
2•cpeterso•7m ago•0 comments

Running full coding loop on DGX Spark

https://mihaichiorean.com/blog/closing-the-loop/
3•mihaichiorean•9m ago•1 comments

Show HN: Million Dollar Homepage, 21 years later, priced in satoshis

https://twentyonebitcoinhomepage.com/
1•bigryanhacks•10m ago•0 comments

Wells Fargo, Citi and Goldman lead in AI venture investment

https://www.americanbanker.com/news/wells-fargo-citi-and-goldman-lead-in-ai-venture-investment
1•petethomas•11m ago•0 comments

Elite law firm Sullivan and Cromwell admits to AI 'hallucinations'

https://www.ft.com/content/657d86df-5e0d-4d03-bf0c-cb768a58e758
2•petethomas•12m ago•1 comments

Native Apps with ClojureScript, React and Static Hermes

https://romanliutikov.com/blog/native-apps-with-clojurescript-react-and-static-hermes
1•luckymate•12m ago•0 comments

Show HN: Humanoid.js – One HTML file that scores how human your clicks look

https://humanoid-js.bensontech.dev/
1•informal007•14m ago•0 comments

ChatGPT Images 2.0

https://chatgpt.com/images/
8•pretext•15m ago•3 comments

Assault at Antarctic base could be a warning for future travellers to Mars

https://www.cbc.ca/radio/quirks/antarctic-base-assault-1.7489637
4•devonnull•15m ago•2 comments

Wrkflw v0.8.0 – Validate and Run GitHub Actions Locally

https://github.com/bahdotsh/wrkflw
1•bahdotshxx•15m ago•0 comments

The $400M Machine That Spawned the Most Coveted Toy

https://www.wsj.com/lifestyle/asml-lego-set-euv-machine-61dd93c6
2•ethanpil•17m ago•0 comments

I can never talk to an AI anonymously again

https://www.theargumentmag.com/p/i-can-never-talk-to-an-ai-anonymously
2•maxutility•18m ago•0 comments

How to Program Computers

https://fall.tr/kos
1•tosh•21m ago•0 comments

Testing a Local LLM

https://lzon.ca/posts/other/local-llm-testing/
1•jpmitchell•23m ago•0 comments

Blue Origin rocket grounded after satellite 'mishap'

https://www.bbc.com/news/articles/cjr9vwz48npo
4•jethronethro•23m ago•0 comments

Show HN: A simple intermittent fasting tracker and meditation timer

https://apps.apple.com/us/app/fasting-tracker-fast-timer/id6760658061
1•rkj93•26m ago•0 comments

The problem with Europe's Big Tech breakup: It's still hooked

https://www.politico.eu/article/europe-big-tech-donald-trump-alexander-scholtes-breakup-still-hoo...
6•andrewstetsenko•29m ago•0 comments

PAI

https://github.com/nirholas/PAI
1•nlch•30m ago•0 comments

Show HN: Resumemind – A developer-first resume builder

https://resumemind.com
1•bryden_cruz•32m ago•0 comments

Hyprglaze

https://github.com/slastra/hyprglaze
1•dadass•34m ago•0 comments

We train LLMs like dogs, not raise them: RLHF and sycophancy

https://old.reddit.com/r/ControlProblem/comments/1sr0ewp/we_are_training_llms_like_dogs_not_raisi...
1•musculus•34m ago•0 comments

FAA sets records in effort to hire gamers as air traffic controllers

https://www.nextgov.com/people/2026/04/faa-sets-records-effort-hire-gamers-air-traffic-controller...
2•speckx•35m ago•1 comments

U.S. Personnel Who Died in Mexico Were Working for the CIA, Sources Say

https://theintercept.com/2026/04/21/cia-mexico-deaths-drugs/
6•jbegley•35m ago•0 comments