frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Paper Lantern – on-demand techniques from 2M+ papers for coding agents

https://www.paperlantern.ai/code
2•paperlantern•1h ago
Paper Lantern is an MCP server that lets coding agents ask for personalized techniques / ideas from 2M+ CS research papers. Your coding agent tells PL what problem it is working on --> PL finds the most relevant ideas from 100+ research papers for you --> gives it to your coding agent including trade-offs and implementation instructions.

We had previously shown that this helps research work and want to know understand whether it helps everyday software engineering tasks. We built out 9 tasks to measure this and compared using only a Coding Agent (Opus 4.6) (baseline) vs Coding Agent + Paper Lantern access.

(Blog post with full breakdown: https://www.paperlantern.ai/blog/coding-agent-benchmarks)

Some interesting results : 1. we asked the agent to write tests that maximize mutation score (fraction of injected bugs caught). The baseline caught 63% of injected bugs. Baseline + Paper Lantern found mutation-aware prompting from recent research (MuTAP, Aug 2023; MUTGEN, Jun 2025), which suggested enumerating every possible mutation via AST analysis and then writing tests to target each one. This caught 87%.

2. extracting legal clauses from 50 contracts. The baseline sent the full document to the LLM and correctly extracted 44% of clauses. Baseline + Paper Lantern found two papers from March 2026 (BEAVER for section-level relevance scoring, PAVE for post-extraction validation). Accuracy jumped to 76%.

Five of nine tasks improved by 30-80%. The difference was technique selection. 10 of 15 most-cited papers across all experiments were published in 2025 or later.

Everything is open source : https://github.com/paperlantern-ai/paper-lantern-challenges

Each experiment has its own README with detailed results and an approach.md showing exactly what Paper Lantern surfaced and how the agent used it.

Quick setup: `npx paperlantern@latest`

Comments

vunderba•51m ago
Nice job. I put together a similar system a while back - it's just a self-contained Go binary called paper-search along with an accompanying LLM skill to facilitate search, retrieval, and downloading of relevant academic papers using OpenAlex, Semantic Scholar, and arXiv.

In my experience it's been a better solution versus just asking the LLM to directly to search the web for this kind of information via search engine tooling.

Also just FYI the link provided in your Show HN (https://github.com/paper-lantern-ai/paper-lantern-challenges) is a 404. I think it should be:

https://github.com/paperlantern-ai/paper-lantern-challenges

paperlantern•12m ago
yes - definitely, i have noticed the same that customized solutions for research papers works better than LLM web search.

thanks for catching the link issue

in case you can try out our solution for code agents, i'd love to hear what you think of it...

parima08•1m ago
Trying this out on a valve engineering PDF extraction task — the repo claims 72% improvement on PDF extraction specifically, which is why I'm biting. Installed the MCP and the results were actually contextualized against what I'd already tried in the repo, which surprised me. Handing it to Claude Code now. Quick question: does it pull implementation details from the paper itself, or just the technique? Will report back.

Opus 4.7: better or worse so far compared to 4.6? (don't forget to upvote)

https://strawpoll.com/Qrgewz4kRyp/results
1•firebaze•26s ago•0 comments

What Psychedelics Do to the Brain

https://www.nationalgeographic.com/health/article/psychedelics-brain-scans-drugs
1•gmays•1m ago•0 comments

Age verification app ready as EU moves to curb children's social media access

https://www.reuters.com/world/eu-age-verification-app-ready-europe-moves-curb-childrens-social-me...
1•rbanffy•1m ago•0 comments

Chinese groups call for global AI governance framework – Chinadaily.com.cn

https://www.chinadaily.com.cn/a/202604/14/WS69de4411a310d6866eb43650.html
1•rbanffy•2m ago•0 comments

A Buick GNX Merged with an El Camino to Create This 470-HP Masterpiece

https://www.thedrive.com/news/a-buick-gnx-merged-with-an-el-camino-to-create-this-470-hp-masterpiece
1•PaulHoule•4m ago•0 comments

Logfare.ai – Free LLM Inference. No Auth. No Limits

https://news.ycombinator.com/submit
1•ampdot•5m ago•0 comments

Beyond the Hype: Practical and Responsible Use Cases for Agentic AI Webinar

https://fusionauth.io/webinar/beyond-the-hype-practical-and-responsible-use-cases-for-agentic-ai
1•mooreds•6m ago•0 comments

Hacking the old HackerNews codebase

https://winfunc.com/research/hacking-the-old-hackernews-codebase
1•mufeedvh•10m ago•0 comments

Using a USB switch as a full KVM

https://luke.hsiao.dev/blog/display-switch/
1•lwhsiao•11m ago•0 comments

From SIMT to Systolic Part 2: A Kernel Author's Field Report

https://twitter.com/MainzOnX/status/2044804854020006223
1•matt_d•17m ago•0 comments

Synthetic Astrophysics Photometry

https://github.com/nialljmiller/SED_Tools
1•nialljmiller•18m ago•1 comments

Airbus Likely Provided Satellite Imagery of US Military Assets to China Before

https://chinaselectcommittee.house.gov/media/press-releases/airbus-space-likely-provided-satellit...
4•737min•19m ago•2 comments

Agentic Infrastructure

https://vercel.com/blog/agentic-infrastructure
2•gmays•19m ago•0 comments

Inside Notion

https://colossus.com/article/inside-notion/
2•herbertl•20m ago•0 comments

Witter Coin to host a $50k coin scavenger hunt in SF

https://www.wittercoin.com/
1•nvader•26m ago•0 comments

Worldmonitor: Real-time global intelligence dashboard

https://github.com/koala73/worldmonitor
1•quux0r•28m ago•0 comments

Show HN: AI Subroutines – Run automation scripts inside the browser tab

https://www.rtrvr.ai/blog/ai-subroutines-zero-token-deterministic-automation
3•arjunchint•28m ago•1 comments

Focused microwaves allow 3D printers to fuse circuits onto almost anything

https://newatlas.com/electronics/meta-nfc-focused-microwaves-circuits/
3•breve•28m ago•0 comments

Opus 4.7 refuses to solve NYT Connections puzzles

https://twitter.com/LechMazur/status/2044970170347622727
2•MallocVoidstar•31m ago•1 comments

"Project Hail Mary's" Success: A Story You Can Believe In

https://www.civitasoutlook.com/research/project-hail-marys-success-a-story-you-can-believe-in-fca...
1•RickJWagner•34m ago•0 comments

Stitch – Google's AI design tool

https://stitch.withgoogle.com/
1•satvikpendem•35m ago•1 comments

Glyph Protocol for Terminals

https://rapha.land/introducing-glyph-protocol-for-terminals/
1•coderlovernine•35m ago•0 comments

Tesla Roadstar does not even have a release date so how is that not real fraud?

https://www.tesla.com/roadster
2•kingleopold•35m ago•1 comments

Agile Is Dying

https://twitter.com/helloteban/status/2045244584880451748
3•baristaGeek•36m ago•3 comments

Wild Gunman: Resurrecting Nintendo's First Coin-Op on Its 50th Anniversary [video]

https://www.youtube.com/watch?v=TOfqnomGPkM
1•kitcar•39m ago•0 comments

I reversed Opus 4.7 costs

https://github.com/LucasDuys/forge
1•lucasduys•43m ago•0 comments

Fulu bounty for Ring Camera jailbreak reaches $23k

https://bounties.fulu.org/bounties/ring-video-doorbells
2•SomaticPirate•43m ago•1 comments

The new World ID and the partners bringing proof of human to the internet

https://world.org/blog/announcements/the-new-world-id-and-the-partners-bringing-proof-of-human-to...
2•spondyl•44m ago•0 comments

Show HN: Realtime Voice AI on ESP32 with Cloudflare Durable Objects

https://github.com/akdeb/ElatoAI/tree/main/server/cloudflare
1•akadeb•48m ago•0 comments

Change management problem rarely mentioned when pushing AI to engineering teams

https://shiftmag.dev/as-an-engineering-manager-i-couldnt-ignore-ai-if-my-teams-are-to-survive-9061/
1•cyberkoza•49m ago•2 comments