frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Do we need MCPs? Reverse-engineered Slack and Linear API for Evals & RL

https://www.agentdiff.dev/
5•hubertmarek•37m ago

Comments

hubertmarek•23m ago
Running evals on the 40-task Linear API benchmark, most frontier models scored surprisingly well:

- Claude Opus 4.5: 95% (38/40) - GLM 4.6: 87.5% (35/40) - Claude Sonnet 4.5: 85% (34/40) - Claude Haiku 4.5: 82.5% (33/40) - Kimi K2: 82.5% (33/40) - Grok 4.1 Fast: 80% (32/40) - GPT 5.1: 77.5% (31/40)

This makes me think if we really need to reinvent the wheel and make special interfaces (MCPs) for agents interacting with services, when they can just use APIs as they are.

Feedback is more than welcome. Thanks!

Why AI Investments makes sense

https://www.sledgeworx.io/why-ai-investments-makes-sense/
1•Sevii•2m ago•0 comments

Show HN: Talk to your Google Calendar (read/create/edit/delete events by voice)

https://calendarflow.dev
1•Rostik312•3m ago•0 comments

AI Data Centers Can Tell Us Something About Credit Market Weakness

https://www.bloomberg.com/news/audio/2025-12-04/odd-lots-what-ai-reveals-about-credit-market-weak...
2•zerosizedweasle•5m ago•0 comments

China has invented a new way to do innovation

https://www.noahpinion.blog/p/china-has-invented-a-whole-new-way
1•bookofjoe•6m ago•0 comments

From No-Code to Full-Stack in 3 Weeks: How I Built FlickFuture

https://pieterhaasbroek.substack.com/p/from-no-code-to-full-stack-in-3-weeks
1•PantherCat•6m ago•0 comments

Show HN: Invest in ETFs and Stocks from Inside ChatGPT and Claude

https://dialog.treasury.app/
1•rothblatt•7m ago•0 comments

The high cost of free testing

https://engineering.block.xyz/blog/the-high-cost-of-free-testing
1•ktpsns•9m ago•0 comments

App Store Awards 2025

https://developer.apple.com/app-store/app-store-awards-2025/
1•soheilpro•9m ago•0 comments

Show HN: ToolDexo – 29 free online tools (calculator, converter, generator)

https://www.tooldexo.com/
1•WebCreator•10m ago•0 comments

Show HN: FluentUI Icons – Search 6k+ Microsoft Icons with MCP Support for Claude

https://github.com/KeenMate/fluentui-icons
1•OndrejValenta•10m ago•0 comments

Pornography company fined £1M by Ofcom for not having strong enough age checks

https://www.theguardian.com/society/2025/dec/04/pornography-site-fined-1m-for-not-having-strong-a...
1•nickcotter•10m ago•0 comments

Show HN: LLM-Infra-Lab – A minimal, reproducible lab for LLM systems

https://github.com/REICHIYAN/llm_infra_lab
1•Sai-HN•12m ago•0 comments

Macron warned US could 'betray' Ukraine in leaked leaders' call

https://www.politico.eu/article/european-leaders-warn-us-could-betray-ukraine-in-leaked-call/
3•belter•12m ago•0 comments

Show HN: OnlyRecipe 2.0 – I added all features HN requested – 4 years later

https://onlyrecipeapp.com/?url=https://www.allrecipes.com/turkish-pasta-recipe-8754903
2•AwkwardPanda•17m ago•1 comments

Handling the Cloudflare outage with infrastructure as code in 10 minutes

https://www.carneiro.pt/blog/2025-cloudflare-outage/
1•mig4ng•20m ago•1 comments

Autonomous Observability at Pinterest

https://medium.com/pinterest-engineering/autonomous-observability-at-pinterest-part-1-of-2-eb0ada...
1•meysamazad•21m ago•0 comments

What's the best way to expand the US electricity grid?

https://news.mit.edu/2025/best-way-to-expand-us-electricity-grid-1204
1•meysamazad•22m ago•1 comments

Minimum Viable Benchmark

https://blog.nilenso.com/blog/2025/11/28/minimum-viable-benchmark/
1•ath_ray•22m ago•0 comments

My First ProductHunt Launch Flopped: 11 Upvotes, 800 Visitors

https://meysam.io/blog/first-producthunt-launch-flopped/
2•meysamazad•23m ago•0 comments

Warning to lawyers helping LiP who submitted AI-generated authorities

https://www.lawgazette.co.uk/news/warning-to-lawyers-helping-lip-with-ai-generated-authorities/51...
2•ColinWright•25m ago•0 comments

Illusion of Consensus

https://bigthinkmedia.substack.com/p/the-illusion-of-consensus-is-powerful
1•fbn79•27m ago•0 comments

A VLA That Learns from Experience

https://www.pi.website/blog/pistar06
1•thunderbong•28m ago•0 comments

Show HN: Intrepid – A Visual Behavior Builder for Real Robotics Code

https://intrepid.ai/product/
1•frag•28m ago•0 comments

Duck Store for Hackers – New Modern Vulnerable Web App

https://duck-store.escape.tech
1•alexxxchr•28m ago•0 comments

Coupongogo: Remote-Controlled Crypto Stealer Targeting Developers on GitHub

https://www.rastersec.com/blog/coupongogo-cryptostealer
2•stnby•29m ago•0 comments

President Donald Trump Appears to Approve Kei Cars for the USA

https://www.roadandtrack.com/news/a69623655/president-donald-trump-kei-cars-usa/
3•eatonphil•30m ago•0 comments

Cursed circuits #2: switched capacitor lowpass

https://lcamtuf.substack.com/p/cursed-circuits-2-switched-capacitor
1•hasheddan•30m ago•0 comments

Inventing a new programming language for web development was a mistake

https://twitter.com/MatijaSosic/status/1996576283447480624
2•matijash•33m ago•0 comments

Spoon Bending

https://grantgumina.notion.site/Spoon-Bending-2bc8eeba3c2d80109957d0d28b66e559
2•gum_ina_package•34m ago•0 comments

Whence the Force of F = ma? I: Culture Shock

https://physicstoday.aip.org/opinion/whence-the-force-of-f-ma-i-culture-shock
2•o4c•34m ago•0 comments