frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Research-Backed Multi-Agent System for Autonomous Development

https://github.com/asklokesh/claudeskill-loki-mode
2•slogansand•17h ago
Hey HN, author here. Loki Mode orchestrates specialized AI agents to take a PRD to deployed product with zero human intervention. But what I'm most proud of is the research foundation - we implemented virtually every scientifically proven pattern from the 2025-2026 AI agent literature. From Anthropic:

Constitutional AI self-critique against principles Building Effective Agents evaluator-optimizer pattern Claude Code Best Practices explore-plan-code workflow Visible Extended Thinking (think, think hard, ultrathink levels) Effective Harnesses one-feature-at-a-time pattern

From DeepMind:

SIMA 2 self-improvement loops Gemini Robotics hierarchical reasoning (planner + executor) Scalable AI Safety debate-based verification

From OpenAI:

Agents SDK tracing, guardrails, tripwires Deep Research adaptive planning with backtracking AGENTS.md standardized instructions

From Academic Research:

CONSENSAGENT (ACL 2025): Blind review + Devil's Advocate when unanimous. 30% false positive reduction. GoalAct: Global planning → skill decomposition → local execution. 12%+ success rate improvement. A-Mem: Zettelkasten-style memory linking for episodic→semantic consolidation. Multi-Agent Reflexion: Structured debate (Implementer → Skeptic → Advocate → Synthesizer). Iter-VF: Verify answer only, not reasoning chain. Prevents context overflow.

From Industry:

NVIDIA ToolOrchestra: Three-reward signal (outcome/efficiency/preference), dynamic agent selection AWS Bedrock: Routing mode for simple tasks, supervisor mode for complex Boris Cherny's self-verification loop (2-3x quality improvement) Simon Willison's sub-agents for context isolation

From HN discussions:

"Zero companies without human in the loop" → confidence-based escalation Context curation beats automatic RAG Fresh contexts yield better results LLM-as-judge has shared blind spots → deterministic validation

The full acknowledgements with links to every paper/resource: https://github.com/asklokesh/claudeskill-loki-mode/blob/main... Run: claude --dangerously-skip-permissions then "Loki Mode with PRD at path/to/prd" Happy to discuss any of the research or architecture decisions.

Comments

slogansand•16h ago
[2.35.0] - 2026-01-08 Added - Anthropic Agent Harness Patterns & Claude Agent SDK Sources:

Effective Harnesses for Long-Running Agents - Anthropic Engineering Claude Agent SDK Overview - Anthropic Platform New Patterns:

One Feature at a Time (Rule #7 in Core Autonomy)

Work on exactly one feature per iteration Complete, commit, verify before moving to next Prevents over-commitment and ensures clean progress tracking E2E Browser Testing with Playwright MCP

Features NOT complete until verified via browser automation New Essential Pattern: Playwright MCP -> Automate browser -> Verify UI features visually Detailed verification flow added to SKILL.md Note: Playwright cannot detect browser-native alert modals Advanced Task Tool Parameters

run_in_background: Returns output_file path, output truncated to 30K chars resume: Continue interrupted agents with full context Use cases: Context limits, rate limits, multi-session work Fixed Release workflow: Use gh CLI instead of softprops action for atomic release creation

Making Tools Developers Actually Use – Michiel Borkent [video]

https://www.youtube.com/watch?v=119qVkHxPkM
1•adityaathalye•1m ago•0 comments

Ask HN: Why is Claude Code so cheap?

1•figassis•1m ago•0 comments

My Blessed Setup for Public Bookmarks

https://incoherenceofthe.net/blog/links.xml
1•pkal•4m ago•0 comments

Ask HN: How to stay relevant in the age of AI?

1•snow_mac•5m ago•0 comments

Google Guys Say Bye to California

https://www.nytimes.com/2026/01/09/technology/google-founders-california-wealth-tax.html
1•fleahunter•5m ago•0 comments

Latest SteamOS Beta Now Includes Ntsync Kernel Driver

https://www.phoronix.com/news/Steam-OS-Beta-NTSYNC
1•LorenDB•6m ago•0 comments

By 2030, 80% of Internet Traffic Will Be Agent-to-Service

https://www.silasreinagel.com/ai/agents/web/technology/future/2026/01/08/web-pages-are-not-the-fu...
2•SilasReinagel•9m ago•0 comments

Cloudspecs: Cloud Hardware Evolution Through the Looking Glass

http://muratbuffalo.blogspot.com/2026/01/cloudspecs-cloud-hardware-evolution.html
1•speckx•9m ago•0 comments

Boston Dynamics and Google DeepMind partners on AI-powered Atlas robots

https://scienceclock.com/boston-dynamics-google-deepmind-atlas-robots/
1•akg130522•10m ago•1 comments

Ask HN: How do you handle the quantity of AI content in your feeds?

1•jbms•12m ago•0 comments

Research finds women use generative AI less, due to moral concerns

https://www.unite.ai/research-finds-women-use-generative-ai-less-due-to-moral-concerns/
1•binning•13m ago•0 comments

Show HN: I built a free platform for calculators, generators and quizzes

https://ournethelps.com/
1•sanjeevkumardev•14m ago•1 comments

DHS Invokes Immigration Enforcement to Justify Gathering Americans' DNA

https://reason.com/2026/01/09/dhs-invokes-immigration-enforcement-to-justify-gathering-americans-...
5•pseudolus•14m ago•0 comments

Show HN: BuildFix– Semantic feature-extraction transfer between TypeScript repos

https://www.buildfix.dev/
1•RichBennett•16m ago•0 comments

Chemics – Python package for chemical engineering

https://github.com/wigging/chemics
1•nateb2022•17m ago•0 comments

Sodium-Ion Batteries Can Charge Faster Than Lithium-Ion Ones

https://www.tus.ac.jp/en/mediarelations/archive/20251217_7418.html
1•phyzix5761•19m ago•0 comments

Dyalog and AI // Stefan Kruger // DYNA Fall 2025 [video]

https://www.youtube.com/watch?v=H_wdKeJ8gt4
1•pillowshift•19m ago•1 comments

Show HN: CLIs Are All You Need for Agents

https://github.com/caesarnine/binsmith
1•binalpatel•21m ago•0 comments

Common food preservatives linked to cancer and type 2 diabetes

https://www.cnn.com/2026/01/07/health/food-preservatives-cancer-diabetes-wellness
1•koolhead17•22m ago•2 comments

3D-printed PCB made with liquid metal and PVA

https://www.tomshardware.com/3d-printing/3d-printed-pcb-made-with-pva-and-liquid-metal-is-fully-r...
2•r-bt•22m ago•0 comments

Deep Learning for Molecules and Materials

https://dmol.pub/index.html
1•abracos•24m ago•0 comments

Answer Set Programming (2019) [pdf]

https://www.cs.utexas.edu/~vl/teaching/378/ASP.pdf
1•todsacerdoti•25m ago•0 comments

Name That Part: 3D Part Segmentation and Naming

https://arxiv.org/abs/2512.18003
1•unisub_guy•26m ago•0 comments

Show HN: See how LLM providers will make money off of you

1•boh•26m ago•0 comments

Developers Are Solving the Wrong Problem

https://caseysoftware.com/blog/developers-are-solving-the-wrong-problem
1•speckx•27m ago•0 comments

Inlining – The Ultimate Optimisation

https://xania.org/202512/17-inlining-the-ultimate-optimisation
2•PaulHoule•28m ago•0 comments

AI Models Are Starting to Learn by Asking Themselves Questions

https://www.wired.com/story/ai-models-keep-learning-after-training-research/
1•ryan_j_naughton•28m ago•0 comments

Using and Managing Consents in an Express App (2023)

https://fusionauth.io/blog/consents-example
1•mooreds•29m ago•0 comments

McDonald's job post mention that you should lift up to 50 lbs as Qualifications

https://www.indeed.com/jobs?q=usa+tech&vjk=0fb6a6b75fcbfe00
1•danver0•29m ago•3 comments

Show HN: Fart Map

https://fart.mp
1•julien421•29m ago•0 comments