frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Show HN: FB Album Downloader

https://chromewebstore.google.com/detail/fb-album-downloader/cgkgapbmopldaebaecjjgiopdlkkaolo
1•qwikhost•9m ago•0 comments

Adobe to Acquire Semrush

https://news.adobe.com/news/2025/11/adobe-to-acquire-semrush
1•rajeevk•11m ago•0 comments

#!magic, details about the shebang/hash-bang mechanism on various Unix flavours

https://www.in-ulm.de/%7Emascheck/various/shebang/
3•js2•18m ago•0 comments

Jack Conte on "an algorithm that doesn't rot your brain"

https://www.nytimes.com/2025/11/19/opinion/patreon-algorithms-social-media-internet.html
2•dougb5•26m ago•0 comments

Saudi Arabia's Prince Has Big Plans, but His Giant Fund Is Low on Cash

https://www.nytimes.com/2025/11/19/business/pif-saudi-arabia-fund-problems.html
3•mmooss•27m ago•2 comments

Detect Fake Google Reviews with This Open Source Project

https://github.com/doncoriolan/google-fake-reviews
1•mike_prixe•28m ago•0 comments

Grok-4.1 confirms missing guardrail for virus-hosting links (Nov 9 incident)

https://drive.google.com/drive/folders/15aT3TrXbqf8m8DjyRpzaMjK7w-3y9Vv3?usp=sharing
1•MarsFryCook•29m ago•0 comments

RealWorldProgrammer – How to scale blockchain workflows

https://realworldprogrammer.com/2025/11/18/overloading-the-node-the-hidden-engineering-behind-sca...
1•rwpdotcom•32m ago•0 comments

A better way to use MCP

https://rayai.com/blog/code-executing-agents-in-production
1•Amit_Patil_010•32m ago•0 comments

Who Is OpenAI's Auditor?

https://www.ft.com/content/3cff198e-25e5-481a-bd34-e26941e1d12d
2•mraniki•33m ago•1 comments

New Agentic Development Environment

https://app.principal-ade.com
1•fernandoramlugo•35m ago•0 comments

Ask HN: What are some modern technologies that you refuse to adopt?

4•catstor•40m ago•1 comments

Show HN: Quantum4J – A Pure Java Quantum Computing SDK

https://github.com/vijayanandg/quantum4j
1•vijayanandg•42m ago•1 comments

I wrote lyrics about dev life and had Suno turn them into an 80s glam-metal song [video]

https://www.youtube.com/watch?v=7mT-vaYieT4
3•ztp123•49m ago•1 comments

How AI will change software engineering – with Martin Fowler

https://www.youtube.com/watch?v=CQmI4XKTa0U
7•pramodbiligiri•57m ago•2 comments

Ask HN: Git Mirrors. Who's running one? What repos are you mirroring?

1•gooob•1h ago•0 comments

Don't Split My Data: I Will Use a Database (Not PostgreSQL) for My Data Needs

https://www.eloqdata.com/blog/2025/11/07/use-real-database-for-data-needs
1•iamlintaoz•1h ago•0 comments

Physicists demonstrate the speed of light with unprecedented accuracy

https://phys.org/news/2025-11-physicists-constancy-unprecedented-accuracy.html
3•stOneskull•1h ago•1 comments

I wish I were as interesting as my phone

https://lukaspet.substack.com/p/jelly-star
2•lukaspetersson•1h ago•0 comments

Ask HN: Agent evaluations, what is everything I should know?

3•akira_067•1h ago•1 comments

Show HN: Pusher's Maze – a browser-based puzzle game

https://pushersmaze.vercel.app/
1•gagarwal123•1h ago•4 comments

Company built an internal agent framework because agent frameworks suck

1•akira_067•1h ago•0 comments

The Secrets of Watch Regulation

https://www.youtube.com/watch?v=tadSi7KNBQw
1•o4c•1h ago•0 comments

RFC Hub

https://rfchub.app/
2•todsacerdoti•1h ago•0 comments

Enoch, a date-prediction AI-model, trained on C14-dated scroll samples

https://journals.plos.org/plosone/article?id=10.1371%2Fjournal.pone.0323185
1•felineflock•1h ago•0 comments

Show HN: AI Search Engineer in Telecoms for Research and Development

https://commsearch.info
3•niliu123•1h ago•1 comments

I Worked for Hyundai. What I Saw Will Shock You. [video]

https://www.youtube.com/watch?v=OKgurZ0CRDE
3•evanjrowley•1h ago•1 comments

Sheaf Topos Theory: A Powerful Setting for Lagrangian Field Theory

https://www.alphaxiv.org/abs/2504.08095
1•measurablefunc•1h ago•0 comments

Visually impaired students learn to make music as the Semi-Modulars synth band

https://www.texasstandard.org/stories/semi-modulars-band-texas-school-blind-visually-impaired/
1•1659447091•1h ago•0 comments

Why the Future Doesn't Need Us (2000)

https://www.wired.com/2000/04/joy-2/
2•mooreds•1h ago•1 comments
Open in hackernews

Ask HN: Agent evaluations, what is everything I should know?

3•akira_067•1h ago
I'm currently building coding agents, and wondering what the standard is for creating and running evals for most people? I gather that the tasks and their definitions will be dramatically different across domains and instances, so I'm not hoping for a one size fits all. Just... what actually works for you in practice?

Comments

adastra22•36m ago
The capabilities of the tool matter more. Claude Code, Codex, Cursor CLI all have different feature sets. This usually determines the choice more than base model capabilities.