frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Ask HN: A proposal for interviewing "AI-Augmented" Engineers

1•vanbashan•1h ago
Hi HN,

I’m currently rethinking our hiring process. Like many of you, I feel that traditional algorithmic tests (LeetCode style) are becoming less relevant now that LLMs can solve them instantly. Furthermore, prohibiting AI during interviews feels counter-productive; I want to hire engineers who know how to use these tools effectively to multiply their output.

I am designing a new evaluation framework based on real-world open-source work, and I would love the community’s feedback on whether this sounds fair, effective, or if I’m missing something critical.

The Core Philosophy: We shouldn't test if a candidate can write syntax better than an AI. We should test if they can guide, debug, and improve upon an AI's output to handle the "last mile" of complex engineering.

The Proposed Process:

1. Task Selection (Real World Context) Instead of synthetic puzzles, we select open issues or discussions from public GitHub repositories that share a tech stack with our product.

    Scope: 2–4 hours.

    Types: Implementing a feature based on a discussion, fixing a bug, or reviewing a PR (specifically one that was eventually rejected, to test "taste").

    Ambiguity: Adjusted for seniority. Junior roles get clear specs; senior roles get vague problem statements requiring architectural decisions.
2. Establishing the "AI Baseline" Before giving the task to a candidate, we run it through current SOTA models with minimal human intervention.

    The Filter: If the AI solves it perfectly on the first try, we discard the task.

    The Sweet Spot: We are looking for tasks where the AI gets 80% right but fails on edge cases, context integration, or complex logic. The problem setup should not be too easy or too hard.
3. The Candidate Test Candidates are required to use their preferred AI coding tools. We ask them to submit not just the code, but their chat/prompt history.

How We Evaluate (The "AI Delta"):

We aren't just looking at the final code. We analyze the "diff" between the Candidate’s process and our "AI Baseline":

    1. Exploration Strategy: How does the candidate "load context"? Do they blindly paste errors, or do they guide the AI to understand the repository structure first? We look for a clear understanding of the existing codebase.

    2. Engineering Rigor (TDD): Does the candidate push the AI to generate a test plan or reproduction script before generating the fix? We value candidates who treat the AI as a junior partner that needs verification.

    3. The "Last 10%" (Edge Cases): Since we picked tasks where AI fails slightly, we look at how the candidate handles those failure modes. Can they spot the boundary conditions and logic errors that the LLM glossed over?

    4. Documentation Hygiene: We specifically check if the candidate instructs the AI to search existing documentation and—crucially—if they prompt the AI to update the docs to reflect the new changes.

    5. Engineering Taste (The Rejected PR): For the code review task, we ask them to analyze a PR that was rejected in the real world (without telling them). We want to see if their reasoning for rejection aligns with our team's engineering culture (maintainability, complexity, clarity, etc.).
My Questions for HN:

    Is analyzing the "Chat History" too invasive, or is it the best way to see their thought process in 2026?

    For those of you hiring now, how do you distinguish between a "prompt kiddie" and a senior engineer who is just very good at prompting?

    Does the 2-4 hour time commitment feel reasonable for a "take-home" if the tooling makes the actual coding faster?
Thanks for your insights!

(Full disclosure: In the spirit of this topic, this post was composed by AI based on my draft notes.)

GLM-OCR

https://twitter.com/Zai_org/status/2018520052941656385
1•sergiotapia•11s ago•0 comments

OpenClaw – Hands for a Brain That Doesn't yet Exist

https://bengoertzel.substack.com/p/openclaw-amazing-hands-for-a-brain
1•laurex•26s ago•0 comments

Rust in the NetBSD Kernel, and other odd decisions

https://bentsukun.ch/posts/netbsd-rust-kernel/
1•jaypatelani•5m ago•0 comments

Show HN: LevelUpPro – Test-drive tech careers before committing

https://www.leveluppro.in
1•vijayanand_v•6m ago•0 comments

The Physical World Doesn't Want Your "Success Dataset"

https://substack.com/home/post/p-186695082
1•FuseGov•7m ago•1 comments

Salazar vs. Paramount Global (3:22-CV-00756) [pdf]

https://dn710205.ca.archive.org/0/items/gov.uscourts.tnmd.92043/gov.uscourts.tnmd.92043.1.0.pdf
1•1vuio0pswjnm7•8m ago•0 comments

Clawsocial.io – a crustacean themed network 4000 meters deep

https://clawsocial.io/#/
1•hnaln•8m ago•0 comments

Fifteen former college basketball players charged in alleged betting scheme

https://www.theguardian.com/sport/2026/jan/15/fifteen-former-college-basketball-players-charged-i...
1•PaulHoule•12m ago•0 comments

Show HN: Open-source semantic search over your local notes via CLI

https://github.com/chenxin-yan/nia-vault
1•jellyotsiro•13m ago•0 comments

How Vibe Coding Is Killing Open Source

https://hackaday.com/2026/02/02/how-vibe-coding-is-killing-open-source/
1•lxm•14m ago•0 comments

Over 60% of YC start up are B2B

https://pardusai.org/view/7d44e51254facd240c2889c1abdd1207a70f531dafe8d80917e56dc50d72da73
1•JasonHEIN•14m ago•0 comments

Fixing academic email perishability with personal domains

https://r-federation.eu
2•r-federation•14m ago•0 comments

Man, 83, Tricked by Scammers, Gets 21 Years to Life for Killing Uber Driver

https://www.nytimes.com/2026/02/02/us/ohio-man-kills-uber-driver-sentenced.html
2•lxm•15m ago•1 comments

The Tragedy of Supernatural

https://www.theverge.com/tech/871250/supernatural-meta-vr-fitness-community
1•guiambros•19m ago•1 comments

Looking back at Catacomb 3D, the game that led to Wolfenstein 3D

https://arstechnica.com/gaming/2026/02/looking-back-at-catacomb-3d-the-game-that-led-to-wolfenste...
1•AdmiralAsshat•22m ago•0 comments

Fecal microbiota transplantation and immunotherapy in metastatic renal carcinoma

https://www.nature.com/articles/s41591-025-04183-8
1•bookofjoe•25m ago•0 comments

Show HN: Stream-based AI with neurological multi-gate (Na⁺/θ/NMDA)

https://github.com/CSCT-NAIL/CSCT
2•CSCT-NAIL•28m ago•2 comments

How to carry more than your own bodyweight (2025)

https://www.bbc.com/future/article/20250124-how-to-carry-more-than-your-own-bodyweight
1•1659447091•31m ago•1 comments

Show HN: Dm.bot – DMs between AI agents with no humans in the middle

https://dm.bot
1•dommm•32m ago•1 comments

Lawsuit Challenges National Park Service Ban on Cash Payments

https://reclaimthenet.org/lawsuit-challenges-national-park-service-ban-on-cash-payments
8•bilsbie•39m ago•0 comments

Data Centers Are Not "Campuses"

https://newrepublic.com/article/205525/data-centers-campus-virginia
2•petethomas•40m ago•0 comments

Show HN: APYCalc – Privacy-First APY Calculator (Zero Data Collection)

https://www.apycalc.net/
1•ludydev•41m ago•0 comments

Voynich Manuscript

https://en.wikipedia.org/wiki/Voynich_manuscript
1•reaperducer•42m ago•0 comments

Six Facts about the Recent Employment Effects of AI (Nov. 2025, Pdf)

https://digitaleconomy.stanford.edu/app/uploads/2025/11/CanariesintheCoalMine_Nov25.pdf
2•bikenaga•47m ago•2 comments

Classified Whistleblower Complaint About Tulsi Gabbard Stalls Within Her Agency

https://www.wsj.com/politics/national-security/classified-whistleblower-complaint-about-tulsi-gab...
15•petethomas•50m ago•2 comments

The Vanilla Web Is Wonderful

https://benjaminsmallwood.com/blog/the-vanilla-web-is-wonderful/
2•bensmallwood•56m ago•1 comments

Show HN: One Ego, Any Model – A Chrome Extension for Portable AI Context

https://chromewebstore.google.com/detail/context-wallet/cipkkclgneblkoifncgjncaapiamcjho
1•haebom•57m ago•1 comments

Show HN: CancelShouldBeEasy – Generate and co-sign consumer complaint letters

https://CancelShouldBeEasy.com
1•xinbenlv•1h ago•0 comments

Lombard Effect

https://en.wikipedia.org/wiki/Lombard_effect
2•porjo•1h ago•1 comments

Ask HN: Interest in low cost / fast container registry?

1•osigurdson•1h ago•0 comments