frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Show HN: Lockin, a PDF TTS reader for manuals and papers cited Q&A

https://lockin.pageyard.org/
1•lockin__•38s ago•0 comments

How to Make Package Managers Scream (FOSDEM'26)

https://www.youtube.com/watch?v=PBlDHlFnzGo
1•boegel•59s ago•0 comments

A Journey into Understanding the IDE Bus

https://www.crowdsupply.com/polpotronics/picoide/updates/a-journey-into-understanding-the-ide-bus
1•geerlingguy•1m ago•0 comments

There is no evidence for X

2•cadabrabra•3m ago•4 comments

So We Built Our Own Agentic Developer

https://builders.fullscript.com/posts/lessons-learned-from-building-nitro-fullscripts-autonomous-...
1•ncrum•7m ago•0 comments

The Art of Being Lazy(log)

https://www.warpstream.com/blog/the-art-of-being-lazy-log-lower-latency-and-higher-availability-w...
1•ordinarily•9m ago•0 comments

Scientists Discover Life Thriving Beneath Fukushima's Dead Reactors

https://dailygalaxy.com/2026/02/strange-life-under-fukushima-dead-reactors/
1•SunshineTheCat•10m ago•0 comments

Technocracy 2.0

https://brooklynrail.org/2026/02/field-notes/technocracy-2-0/
2•antonomon•12m ago•0 comments

Something Wild Going on with Emails?

2•trevyn•12m ago•0 comments

Home Assistant Comm Badge

https://github.com/graffitiwriter/Home-Assistant-Comm-Badge
1•taubek•13m ago•0 comments

SanDisk crushes wallets with up to 2.8X SSD price hikes

https://www.tomshardware.com/pc-components/ssds/sandisk-crushes-wallets-with-up-to-2-8x-ssd-price...
2•vmykyt•16m ago•0 comments

Start all of your commands with a comma

https://rhodesmill.org/brandon/2009/commands-with-comma/
2•theblazehen•19m ago•0 comments

Sh-DSL – Write/Use Shell with Janet

https://janet-lang.org/spork/api/sh-dsl.html
1•veqq•19m ago•0 comments

Exploring Different Keyboard Sensing Technologies – LTT Labs

https://www.lttlabs.com/articles/2026/01/27/exploring-different-keyboard-sensing-technologies#buc...
1•rbanffy•19m ago•0 comments

Windsurf Tab v2

https://windsurf.com/blog/windsurf-tab-2
1•swyx•20m ago•0 comments

Securely run Claude Code agents in Docker

https://edspencer.net/2026/2/4/run-claude-code-agents-docker-herdctl
1•edspencer•20m ago•0 comments

Hand-Crafting Domain-Specific Compression with an LLM

https://engineering.nanit.com/hand-crafting-domain-specific-compression-with-an-llm-3c42f5c2b070
1•PaulHoule•21m ago•0 comments

The perks of being a mole rat

https://worksinprogress.co/issue/the-perks-of-being-a-mole-rat/
1•ortegaygasset•21m ago•0 comments

Show HN: A TikTok-style research paper reader

https://pokepaper.com/
1•hajimi_hacker•21m ago•0 comments

PaperBanana – Automating Academic Illustration

https://paperbanana.org/
1•bilsbie•22m ago•0 comments

Readr, Safari-Like Reading Mode for Chrome

https://github.com/login
1•ymolodtsov•23m ago•2 comments

GitHub integrates Claude and Codex AI coding agents directly into GitHub

https://github.blog/changelog/2026-02-04-claude-and-codex-are-now-available-in-public-preview-on-...
2•thoughtpeddler•23m ago•1 comments

ClickHouse Agent Skills

https://github.com/ClickHouse/agent-skills
1•clickpiper-pete•24m ago•0 comments

Anthropic's new AI tool: Next black stock market day for the software industry

https://www.heise.de/en/news/Anthropic-s-new-AI-tool-Next-black-stock-market-day-for-the-software...
2•doener•27m ago•1 comments

Ask HN: How can you enforce rules for Claude etc.

1•blackknightdev•27m ago•2 comments

Tell HN: Electrolux HR chief hired to layoff workforce bought 12 room apartment

2•dssadasadsdsa12•29m ago•2 comments

Mean People Fail (2014)

https://paulgraham.com/mean.html
19•insuranceguru•30m ago•20 comments

NYC subway gates tested by the MTA use AI tech to track fare evaders

https://gothamist.com/news/modern-nyc-subway-gates-tested-by-the-mta-use-ai-tech-to-track-fare-ev...
2•geox•31m ago•0 comments

Show HN: Autonomous AI radio station about engineering, history and philosophy

https://www.hermestransmissions.com/
1•ivanachillee•35m ago•0 comments

GitHub ponders kill switch for pull requests to stop AI slop

https://www.theregister.com/2026/02/03/github_kill_switch_pull_requests_ai/
1•abdelhousni•36m ago•2 comments
Open in hackernews

Mappa – Fine-tune ANY multi-agent LLM systems end-to-end with AI coaches

3•junyuren•1h ago
Blog: https://ltjed.github.io/MAPPA/ Paper: https://arxiv.org/abs/2601.23228 Code: https://github.com/ltjed/multiagent-coaching Twitter: https://x.com/t_ed_li/status/2019114121250370021

Comments

junyuren•1h ago
Author here. Happy to answer questions.

The problem: when you have multiple LLM agents working together and something fails, which agent is responsible? Traditional RL gives you one reward at the end, so all agents share the blame equally.

Our approach: an external LLM (we used Gemini) watches each agent's actions and tool outputs, then assigns per-action scores. When agent 3 crashes because agent 1 forgot to save a file, the coach traces back through the tool outputs and blames agent 1, not agent 3.

This gives you dense training signal without needing ground truth labels. The coach provides the supervision.

Practical angle: you use the API calls only during training. Afterward you have a team of local models that run offline. We tested with Qwen and LLaMA base models.

Results: +17pp on AIME math competition, +38% F1 on Kaggle-style data science tasks.

Hardware requirement is 2-8x 80GB GPUs depending on model size. Code is MIT licensed.

The framework is general - plug in your own agents, your own task, your own coach model.

ed_li•1h ago
Does MAPPA work for law?