frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Show HN: SafeShare – Clean tracking params locally (PWA and bookmarklets

https://j-ai-71.github.io/Supersystem/
1•safeshare•2m ago•0 comments

Ilya Sutskever – We're moving from the age of scaling to the age of research

https://youtu.be/aR20FWCCjAs?si=SD1bp8f5jOcUdl78
1•blufish•4m ago•0 comments

AI Tools Dashboard (Updated Daily)

https://phshort.com/ai
1•mohamadkk7•4m ago•0 comments

Show HN: God's Eye – Subdomain recon with local AI analysis

https://github.com/Vyntral/god-eye
1•vyntral•4m ago•0 comments

Larry Ellison Met with Trump to Discuss Which CNN Reporters They Plan to Fire

https://www.techdirt.com/2025/11/25/larry-ellison-met-with-trump-to-discuss-which-cnn-reporters-t...
3•throw0101a•4m ago•0 comments

Nimbalyst: WYSIWYG Markdown editor with visual diffs powered by Claude Code

https://github.com/Nimbalyst/nimbalyst
2•wek•6m ago•0 comments

DJI ROMO robot vacuum [video]

https://www.youtube.com/watch?v=Iv7BYURURRI
1•surprisetalk•7m ago•0 comments

Show HN: StepKit, an open and cross-platform durable execution standard

12•tonyhb•7m ago•1 comments

93% Faster Next.js in (Your) Kubernetes

https://blog.platformatic.dev/93-faster-nextjs-in-your-kubernetes
1•chrisdoc•9m ago•0 comments

Atlassian's DR simulation showed it lived in dependency hell

https://www.theregister.com/2025/11/25/atlassian_dependency_migration/
1•BerislavLopac•10m ago•0 comments

LJV – Lissajous Curve Music Visualization

https://github.com/ThatXliner/ljv
1•thatxliner•11m ago•0 comments

Economist get cold feet about high minimum wages

https://www.economist.com/finance-and-economics/2025/11/20/economists-get-cold-feet-about-high-mi...
1•gdudeman•12m ago•1 comments

Visualizing the Sorites Paradox via LLM Probability Logits

https://joshfonseca.com/blogs/sorites-paradox
1•vuciv•12m ago•1 comments

Show HN: We built an open source, zero webhooks payment processor

https://github.com/flowglad/flowglad
13•agreeahmed•13m ago•7 comments

HubSpot forms are under attack by bots–how do you protect CRM data?

1•rayyanabrar76•14m ago•0 comments

Tampa men charged with smuggling Nvidia chips to China

https://www.tampabay.com/news/crime/2025/11/21/china-computer-chip-smuggling-tampa-nvidia-ai-arre...
1•donsupreme•14m ago•1 comments

It Is OK to Say "CSS Variables" Instead of "Custom Properties"

https://blog.kizu.dev/css-variables/
1•eustoria•15m ago•0 comments

Lowercase head behaves differently in Git worktrees

https://www.brandonpugh.com/til/git/head-is-case-sensitive/
1•eustoria•16m ago•0 comments

Google attacking human thought with Gemini in Google Keep

4•fellowniusmonk•18m ago•0 comments

Modular Code with Reusable Standalone Modules

https://massimo-nazaria.github.io/reusable-standalone-modules.html
1•massimo-nazaria•20m ago•0 comments

Wave of mass brutality accompanied the collapse of first pan-European culture

https://www.science.org/content/article/headless-bodies-hint-why-europe-s-first-farmers-vanished
5•mzs•20m ago•0 comments

Show HN: Memory System Hitting 80.1% Accuracy on LoCoMo (Built in 4.5 Months)

https://github.com/vac-architector/VAC-Memory-System
2•ViktorKuz•21m ago•0 comments

Stop Telling Us XMPP Should Use JSON

https://www.process-one.net/blog/stop-telling-us-xmpp-should-use-json/
2•todsacerdoti•22m ago•0 comments

RDMA over Thunderbolt 5 on Apple Silicon – 14µs latency

https://twitter.com/anemll/status/1993182652204187929
2•anemll•23m ago•1 comments

FBI: Cybercriminals stole $262M by impersonating bank support teams

https://www.bleepingcomputer.com/news/security/fbi-cybercriminals-stole-262-million-by-impersonat...
3•fleahunter•23m ago•0 comments

Super-adjuvant nanoparticles for platform cancer vaccination

https://www.cell.com/cell-reports-medicine/fulltext/S2666-3791(25)00488-4
2•bookofjoe•25m ago•0 comments

Ilya Sutskever on Dwarkesh Patel's Podcast

https://www.dwarkesh.com/p/ilya-sutskever-2
3•piotrgrabowski•25m ago•0 comments

Show HN: ZenPaint, a pixel-perfect MacPaint recreation for the browser

https://zenpaint.org/
4•allthreespies•25m ago•0 comments

Pluribus: The audacity of the Breaking Bad creator's new TV show is incredible

https://www.theguardian.com/tv-and-radio/2025/nov/07/pluribus-review-breaking-bad-creators-tv-sho...
2•wslh•26m ago•0 comments

CSS Meets Voxel Art: Building a Rendering Engine with Stacked Grids – Codrops

https://tympanus.net/codrops/2025/03/03/css-meets-voxel-art-building-a-rendering-engine-with-stac...
2•thunderbong•27m ago•0 comments
Open in hackernews

Show HN: Agent Runner – open-source agent harness to benchmark real coding

https://www.designarena.ai/?arena=agents&harness=standard
3•grace77•1h ago
Hey HN! We built Agent Runner, a model-agnostic, open-source agent harness that executes the same prompt against two anonymized coding agents in parallel sandboxes. Each agent can make tool calls, edit multiple files, and self-correct through iterative reasoning. You pick the better result - this becomes the ground truth for the leaderboard.

Why we built it Traditional benchmarks often fall short for modern agentic systems: they rely on static tasks and only measure final outputs. But real coding agents modify multiple files across a repo, answer to user re-prompts, use tool calls, and recover from partial failures

What Agent Runner does You ask it to build anything Agent Runner kicks off two generations from different sandboxed LLM providers (OpenAI, Anthropic, Google, xAI, Mistral, Kimi, and more) Anonymized models make tool calls, multi-file edits, and cater to reprompts You pick your favorite - this preference powers the benchmark

Because different providers handle tool calls, prompts, and execution semantics differently, we worked with each provider to ensure configurations reflect intended behavior. These provider-specific setups remain private, but Agent Runner itself is open-source.

How to try it Kick off Agent Runner at https://www.designarena.ai/agentarena Repo at https://github.com/Design-Arena/agent-runner Use it as a CLI tool: https://pypi.org/project/agent-runner/ pip install agent-runner agentrunner run “create a nextjs replica of Discord”

We hope this provides a provider-agnostic, framework-agnostic, realistic benchmark for state-of-the-art coding agents.

Video demo: https://youtu.be/rdtiuCHatjs