frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: ModelKombat – Arena-style battles for coding models

https://astra.hackerrank.com/model-kombat
9•rvivek•3d ago
I'm Vivek, co-founder/CEO of HackerRank (YC S11); You may know us as a hiring tool for developers/companies.

Over the years, we have built up deep expertise in generating programming challenges, and we are now using that to make coding models better.

Our first launch is Model Kombat -- an arena where you can directly compare anonymized coding models, side by side, on real problems.

* Pick an arena (Java, Python, etc.) * Each battle has 3 rounds: see the problem + two model outputs -> vote on which you’d actually prefer * Leaderboards + problem statements are updated weekly.

We already have 400+ challenges live, and our goal is to evolve toward real-world, multi-file challenges and robust tooling for coding model evaluation.

Try it now at: modelkombat.com

We’d love your feedback.

Comments

rafiki6•3d ago
Pretty fun! A few questions:

1 - are you planning to let people write their own prompts?

2 - when will you share the model names?

rvivek•3d ago
1 -- yes, soon 2 -- every week; check back this Sat.
elou•3d ago
Super interesting to see an arena specific to coding. I love the retro throwback.

How are you deciding which challenges to start with for this first release?

StilesCrisis•1h ago
When I tried this, one of the two C++ models kept generating pages and pages of ideas for how it could implement the code, but just got stuck in a loop doing this and failed to actually implement anything.

Then again, I’ve seen interviews play out like that too.

Less is safer: How Obsidian reduces the risk of supply chain attacks

https://obsidian.md/blog/less-is-safer/
229•saeedesmaili•7h ago•88 comments

Things managers do that leaders never would

https://simonsinek.com/stories/5-things-managers-do-that-leaders-never-would-according-to-simon/
63•9x39•3h ago•28 comments

If all the world were a monorepo

https://jtibs.substack.com/p/if-all-the-world-were-a-monorepo
72•sebg•3d ago•17 comments

Hidden risk in Notion 3.0 AI agents: Web search tool abuse for data exfiltration

https://www.codeintegrity.ai/blog/notion
90•abirag•7h ago•24 comments

Feedmaker: URL + CSS selectors = RSS feed

https://feedmaker.fly.dev
97•mustaphah•7h ago•16 comments

A 3D-Printed Business Card Embosser

https://www.core77.com/posts/138492/A-3D-Printed-Business-Card-Embosser
41•surprisetalk•2d ago•8 comments

Ants that seem to defy biology – They lay eggs that hatch into another species

https://www.smithsonianmag.com/smart-news/these-ant-queens-seem-to-defy-biology-they-lay-eggs-tha...
350•sampo•16h ago•113 comments

Show HN: WeUseElixir - Elixir project directory

https://weuseelixir.com/
111•taddgiles•8h ago•15 comments

Internet Archive's big battle with music publishers ends in settlement

https://arstechnica.com/tech-policy/2025/09/internet-archives-big-battle-with-music-publishers-en...
291•coloneltcb•4d ago•118 comments

Show HN: Zedis – A Redis clone I'm writing in Zig

https://github.com/barddoo/zedis
74•barddoo•7h ago•56 comments

Ruby Central's Attack on RubyGems [pdf]

https://pup-e.com/goodbye-rubygems.pdf
618•jolux•21h ago•203 comments

Faster Argmin on Floats

https://algorithmiker.github.io/faster-float-argmin/
8•return_to_monke•1d ago•3 comments

The best YouTube downloaders, and how Google silenced the press

https://windowsread.me/p/best-youtube-downloaders
244•Leftium•16h ago•102 comments

Three-Minute Take-Home Test May Identify Symptoms Linked to Alzheimer's Disease

https://www.smithsonianmag.com/smart-news/three-minute-take-home-test-may-identify-symptoms-linke...
73•pseudolus•10h ago•30 comments

Starfront Observatories

https://starfront.space/
32•stefanpie•3d ago•5 comments

Kernel: Introduce Multikernel Architecture Support

https://lwn.net/ml/all/20250918222607.186488-1-xiyou.wangcong@gmail.com/
133•ahlCVA•13h ago•36 comments

An untidy history of AI across four books

https://hedgehogreview.com/issues/lessons-of-babel/articles/perplexity
92•ewf•10h ago•32 comments

Your very own humane interface: Try Jef Raskin's ideas at home

https://arstechnica.com/gadgets/2025/09/your-very-own-humane-interface-try-jef-raskins-ideas-at-h...
75•zdw•11h ago•12 comments

R MCP Server

https://github.com/finite-sample/rmcp
82•neehao•3d ago•11 comments

Shipping 100 hardware units in under eight weeks

https://farhanhossain.substack.com/p/how-we-shipped-100-hardware-units
116•M_farhan_h•1d ago•63 comments

Trump to impose $100k fee for H-1B worker visas, White House says

https://www.reuters.com/business/media-telecom/trump-mulls-adding-new-100000-fee-h-1b-visas-bloom...
911•mriguy•9h ago•1214 comments

Mini: Tonemaps (2023)

https://mini.gmshaders.com/p/tonemaps
37•bpierre•2d ago•7 comments

Show the Physics

https://interactivetextbooks.tudelft.nl/showthephysics/Introduction/About.html
154•pillars•3d ago•7 comments

Time Spent on Hardening

https://third-bit.com/2025/09/18/time-spent-on-hardening/
53•mooreds•9h ago•16 comments

The health benefits of sunlight may outweigh the risk of skin cancer

https://www.economist.com/science-and-technology/2025/09/17/the-health-benefits-of-sunlight-may-o...
234•petethomas•1d ago•204 comments

Xmonad seeking help for Wayland port (2023)

https://xmonad.org/news/2023/10/06/wayland.html
64•clircle•2d ago•41 comments

The Economic Impacts of AI: A Multidisciplinary, Multibook Review [pdf]

https://kevinbryanecon.com/BryanAIBookReview.pdf
52•cjbarber•9h ago•15 comments

Safepoints and Fil-C

https://fil-c.org/safepoints
76•matt_d•4d ago•41 comments

Revamping an Old TV as a Gift (2019)

https://blog.davidv.dev/posts/revamping-an-old-tv-as-a-gift/
68•deivid•14h ago•27 comments

Nostr

https://nostr.com/
336•dtj1123•23h ago•293 comments