frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Anthropic's original take home assignment open sourced

https://github.com/anthropics/original_performance_takehome
106•myahio•3h ago

Comments

koolba•1h ago
What is the actual assignment here?

The README only gives numbers without any information on what you’re supposed to do or how you are rated.

glalonde•1h ago
"Optimize the kernel (in KernelBuilder.build_kernel) as much as possible in the available time, as measured by test_kernel_cycles on a frozen separate copy of the simulator." from perf_takehome.py
vermilingua•48m ago
Think that means you failed :(
nice_byte•40m ago
+1

being cryptic and poorly specified is part of the assignment

just like real code

in fact, it's _still_ better documented an self contained than most of the problems you'd usually encounter in the wild. pulling on a thread to end up with a clear picture of what needs to be accomplished is like 90% of the job very often.

avaer•17m ago
It's definitely cleaner than what you will see in the real world. Research-quality repositories written in partial Chinese with key dependencies missing are common.

IMO the assignment('s purpose) could be improved by making the code significantly worse. Then you're testing the important stuff (dealing with ambiguity) that the AI can't do so well. Probably the reason they didn't do that is because it would make evaluation harder + more costly.

jackblemming•1h ago
Seems like they’re trying to hire nerds who know a lot about hardware or compiler optimizations. That will only get you so far. I guess hiring for creativity is a lot harder.

And before some smart aleck says you can be creative on these types of optimization problems: not in two hours, it’s far too risky vs regurgitating some standard set of tried and true algos.

rvz•47m ago
> Seems like they’re trying to hire nerds who know a lot about hardware or compiler optimizations. That will only get you so far. I guess hiring for creativity is a lot harder.

Good. That should be the minimum requirement.

Not another Next.js web app take home project.

tmule•47m ago
Your comments history suggests you’re rather bitter about “nerds” who are likely a few standard deviations smarter than you (Anthropic OG team, Jeff Dean, proof nerds, Linus, …)
jackblemming•37m ago
And they’re all dumber than John von Neumann, who cares?
margalabargala•3m ago
Transitively, you haven't thought the most thoughts or cared the most about anything, therefore we should disregard what you think and care about?
muglug•39m ago
If they're hiring performance engineers then they're hiring for exactly these sets of skills.

It's a take-home test, which means some people will spend more than a couple of hours on it to get the answer really good. They would have gone after those people in particular.

Analemma_•37m ago
This would be an inappropriate assignment for a web dev position, but I'm willing to bet that a 1% improvement in cycles per byte in inference (or whatever) saves Anthropic many millions of dollars. This is one case where the whiteboard assignment is clearly related to the actual job duties.
onion2k•34m ago
And before some smart aleck says you can be creative on these types of optimization problems: not in two hours, it’s far too risky vs regurgitating some standard set of tried and true algos.

You're both right and wrong. You're right in the sense that the sort of creativity the task is looking for isn't really possible in two hours. That's something that takes a lot of time and effort over years to be able to do. You're wrong because that's exactly the point. Being able to solve the problem takes experience. Literally. It's having tackled these sorts of problems over and over in the past until you can draw on that understanding and knowledge reasonably quickly. The test is meant to filter out people who can't do it.

I also think it's possible to interpret the README as saying humans can't do better than the optimizations that Claude does when Claude spends two hours of compute time, regardless of how long the human takes. It's not clear though. Maybe Claude didn't write the README.

mips_avatar•43m ago
Going through the assignment now. Man it’s really hard to pack the vectors right
avaer•40m ago
It's pretty interesting how close this assignment looks to demoscene [1] golf [2].

[1] https://en.wikipedia.org/wiki/Demoscene [2] https://en.wikipedia.org/wiki/Code_golf

It even uses Chrome tracing tools for profiling, which is pretty cool: https://github.com/anthropics/original_performance_takehome/...

nice_byte•37m ago
it's designed to select for people who can be trusted to manually write ptx :-)
greesil•39m ago
This is a knowledge test of GPU architecture?
avaer•36m ago
Kind of, but not any particular GPU.

The machine is fake and simulated: https://github.com/anthropics/original_performance_takehome/...

But presumably similar principles apply.

zeroCalories•37m ago
It shocks me that anyone supposedly good enough for anthropic would subject themselves to such a one sided waste of time.
mips_avatar•31m ago
It’s kind of an interesting problem.
browningstreet•31m ago
I’ve been sent the Anthropic interview assignments a few times. I’m not a developer so I don’t bother. At least at the time they didn’t seem to have technical but not-dev screenings. Maybe they do now.
sealeck•27m ago
Why is writing code to execute a program using the fewest instructions possible on a virtual machine a waste of time?
pclmulqdq•22m ago
I generally have a policy of "over 4 hours and I charge for my time." I did this in the 4-hour window, and it was a lot of fun. Much better than many other take-home assignments.
whateveracct•10m ago
4 hours continuous or no? I can't imagine finding 4 hours of straight focus.
sureglymop•27m ago
Having recently learned more about SIMD, PTX and optimization techniques, this is a nice little challenge to learn even more.

As a take home assignment though I would have failed as I would have probably taken 2 hours to just sketch out ideas and more on my tablet while reading the code before even changing it.

tucnak•25m ago
The snarky writing of "if you beat our best solution, send us an email and MAYBE we think about interviewing you" is really something, innit?
pvalue005•20m ago
I suspect this was released by Anthropic as a DDOS attack on other AI companies. I prompted 'how do we solve this challenge?' into gemini cli in a cloned repo and it's been running non-stop for 20 minutes :)
kristianpaul•12m ago
“If you optimize below 1487 cycles, beating Claude Opus 4.5's best performance at launch, email us at performance-recruiting@anthropic.com with your code (and ideally a resume) so we can be appropriately impressed and perhaps discuss interviewing.”

Anthropic's original take home assignment open sourced

https://github.com/anthropics/original_performance_takehome
107•myahio•3h ago•28 comments

Disaster planning for regular folks (2015)

https://lcamtuf.coredump.cx/prep/index-old.shtml
72•AlphaWeaver•2h ago•38 comments

A 26,000-year astronomical monument hidden in plain sight (2019)

https://longnow.org/ideas/the-26000-year-astronomical-monument-hidden-in-plain-sight/
415•mkmk•11h ago•89 comments

Libbbf: Bound Book Format, A high-performance container for comics and manga

https://github.com/ef1500/libbbf
13•zdw•1h ago•1 comments

Are arrays functions?

https://futhark-lang.org/blog/2026-01-16-are-arrays-functions.html
93•todsacerdoti•1d ago•54 comments

California is free of drought for the first time in 25 years

https://www.latimes.com/california/story/2026-01-09/california-has-no-areas-of-dryness-first-time...
323•thnaks•7h ago•163 comments

Show HN: Mastra 1.0, open-source JavaScript agent framework from the Gatsby devs

https://github.com/mastra-ai/mastra
134•calcsam•13h ago•43 comments

Instabridge has acquired Nova Launcher

https://novalauncher.com/nova-is-here-to-stay
165•KORraN•10h ago•110 comments

Which AI Lies Best? A game theory classic designed by John Nash

https://so-long-sucker.vercel.app/
87•lout332•7h ago•43 comments

The Unix Pipe Card Game

https://punkx.org/unix-pipe-game/
197•kykeonaut•13h ago•65 comments

Provably unmasking malicious behavior through execution traces

https://arxiv.org/abs/2512.13821
34•PaulHoule•7h ago•4 comments

Unconventional PostgreSQL Optimizations

https://hakibenita.com/postgresql-unconventional-optimizations
303•haki•15h ago•47 comments

The challenges of soft delete

https://atlas9.dev/blog/soft-delete.html
115•buchanae•8h ago•72 comments

IPv6 is not insecure because it lacks a NAT

https://www.johnmaguire.me/blog/ipv6-is-not-insecure-because-it-lacks-nat/
89•johnmaguire•10h ago•118 comments

Our approach to age prediction

https://openai.com/index/our-approach-to-age-prediction/
82•pretext•10h ago•151 comments

Who owns Rudolph's nose?

https://creativelawcenter.com/copyright-rudolph-reindeer/
26•ohjeez•5h ago•11 comments

The GDB JIT Interface

https://bernsteinbear.com/blog/gdb-jit/
8•surprisetalk•4d ago•2 comments

Proof of Concept to Test Humanoid Robots

https://thehumanoid.ai/humanoid-and-siemens-completed-a-proof-of-concept-to-test-humanoidrobots-i...
10•0xedb•5d ago•6 comments

Lunar Radio Telescope to Unlock Cosmic Mysteries

https://spectrum.ieee.org/lunar-radio-telescope
28•rbanffy•7h ago•1 comments

Verizon starts requiring 365 days of paid service before it will unlock phones

https://arstechnica.com/tech-policy/2026/01/verizon-starts-requiring-365-days-of-paid-service-bef...
79•voxadam•4h ago•63 comments

Maintenance: Of Everything, Part One

https://press.stripe.com/maintenance-part-one
91•mitchbob•10h ago•17 comments

The life of a playboy publisher who shaped 20th-century literature

https://www.washingtonpost.com/books/2026/01/09/bennett-cerf-biography-nothing-random-feldman-boo...
11•benbreen•3d ago•1 comments

Apples, Trees, and Quasimodes

https://systemstack.dev/2025/09/humane-computing/
40•entaloneralie•3d ago•3 comments

Building Robust Helm Charts

https://www.willmunn.xyz/devops/helm/kubernetes/2026/01/17/building-robust-helm-charts.html
51•will_munn•1d ago•0 comments

Show HN: Agent Skills Leaderboard

https://skills.sh
55•andrewqu•8h ago•18 comments

IP Addresses Through 2025

https://www.potaroo.net/ispcol/2026-01/addr2025.html
170•petercooper•16h ago•129 comments

Show HN: Aventos – An experiment in cheap AI SEO

https://www.aventos.dev/
14•JimsonYang•5d ago•8 comments

Fast Concordance: Instant concordance on a corpus of >1,200 books

https://iafisher.com/concordance/
41•evakhoury•4d ago•3 comments

Show HN: TopicRadar – Track trending topics across HN, GitHub, ArXiv, and more

https://apify.com/mick-johnson/topic-radar
23•MickolasJae•15h ago•4 comments

Ask HN: Do you have any evidence that agentic coding works?

144•terabytest•17h ago•138 comments