frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

BBC Journalist SEO-Hacks ChatGPT and Google's AI

https://www.bbc.com/future/article/20260218-i-hacked-chatgpt-and-googles-ai-and-it-only-took-20-m...
1•jrmg•14s ago•0 comments

Show HN: SeaRoutes, find the shortest navigable sea routes on the globe

https://searoutes.vercel.app/
1•aayushdutt•1m ago•0 comments

The Rise of the Financial Engineer

https://thefinancialengineer.substack.com/p/the-rise-of-the-financial-engineer
1•gemanor•1m ago•0 comments

Show HN: Next job comes from someone you barely know

https://github.com/navox-labs/network
1•nahrin•2m ago•0 comments

The Predatory Hegemon

https://www.foreignaffairs.com/united-states/predatory-hegemon-walt
1•akbarnama•2m ago•0 comments

US Draft Rules for Power over Nvidia's Global Sales

https://www.bloomberg.com/news/articles/2026-03-05/us-drafts-rules-for-sweeping-power-over-nvidia...
2•spenvo•3m ago•1 comments

A Guide to Wine Certification Programs

https://www.wineenthusiast.com/basics/guide-to-wine-certification-programs/
1•Anon84•3m ago•0 comments

Iranian strikes on Amazon data centers highlight industry's vulnerability

https://apnews.com/article/amazon-aws-data-center-uae-iran-bahrain-71066b0a822c4cfd88b61e3fe79af917
1•gritzko•4m ago•0 comments

The Download: The startup that says it can stop lightning, and inside OpenAI's

https://www.technologyreview.com/2026/03/03/1133900/the-download-the-startup-that-says-it-can-sto...
1•joozio•5m ago•0 comments

Building a Database on S3

http://muratbuffalo.blogspot.com/2026/03/building-database-on-s3.html
1•todsacerdoti•6m ago•0 comments

The largest open-source humanized voice library

https://github.com/jaymunshi/open-swara
1•Anon84•7m ago•0 comments

Congress Is Considering Abolishing Your Right to Be Anonymous Online

https://theintercept.com/2026/03/05/kosa-online-age-verification-free-speech-privacy/
4•cdrnsf•7m ago•0 comments

Olmo Hybrid

https://allenai.org/papers/olmo-hybrid
1•tosh•8m ago•0 comments

Show HN: RedDragon, LLM-assisted IR analysis of code across languages

https://github.com/avishek-sen-gupta/red-dragon
1•armorer•8m ago•1 comments

Exfiltrating passwords with no interaction using autofill

https://varun.ch/posts/autofill/
1•varun_ch•8m ago•0 comments

Show HN: Plought – Reduce noise in decision making

https://plought.app
1•robinoross•8m ago•0 comments

The Brand Age

https://paulgraham.com/brandage.html
2•bigwheels•9m ago•0 comments

We Only Accept Pre-Revenue Projects

https://www.leanvibe.io/blog/bp-1772314620433
1•LeanVibe•10m ago•1 comments

My application programmer instincts failed when debugging assembler

https://landedstar.com/blog/posts/how-my-application-programmer-instincts-failed-when-debugging-a...
1•lifefeed•10m ago•0 comments

Launch HN: Vela (YC W26) – AI for complex scheduling

4•Gobhanu•10m ago•0 comments

Which H100 Instance to Train Nanochat – Benchmarking PCIe, SXM, and NVL

https://bluenotebook.io/blog/h100-nanochat-training/
1•k2so•11m ago•1 comments

Düren's Hydrogen Bet: The Math Behind a Looming Liability

https://cleantechnica.com/2026/03/01/durens-hydrogen-bet-the-math-behind-a-looming-liability/
1•simonebrunozzi•13m ago•0 comments

Using Structured Light Scanning and Photogrammetry in Cultural Heritage

https://www.mdpi.com/2078-2489/17/3/237
1•PaulHoule•13m ago•0 comments

Financial AGI announced – outperforms human experts on 12 professional exams

https://portfoliopilot.com/technology/financial-agi
2•aharm•14m ago•1 comments

Most AI agent demos won't survive enterprise security review

1•ibreakthecloud•15m ago•0 comments

Show HN: Experiment- enforcing accessibility guardrails during AI UI generation

https://github.com/nadav-senseit/ai-accessibility-enforcement
1•nadav_senseit•15m ago•0 comments

Ask HN: Have you noticed how the number of 'Show HN' posts has skyrocketed?

1•nodivbyzero•15m ago•1 comments

CSUN Assistive Technology Conference 2026 files

https://daisy.org/news-events/articles/csun-conference-2026-files/
1•bryanrasmussen•15m ago•0 comments

Show HN: Chatddit.com Fresh off the vibe press

1•gitprolinux•16m ago•0 comments

I'm a Coin Boy, Too (2023)

https://taylor.town/coin-boys
1•surprisetalk•16m ago•0 comments
Open in hackernews

Show HN: OmoiOS–190K lines of Python to stop babysitting AI agents (Apache 2.0)

https://github.com/kivo360/OmoiOS
2•kanddle•1h ago
AI coding agents generate decent code. The problem is everything around the code - checking progress, catching drift, deciding if it's actually done. I spent months trying to make autonomous agents work. The bottleneck was always me.

Attempt 1 - Claude/GPT directly: works for small stuff, but you re-explain context endlessly.

Attempt 2 - Copilot/Cursor: great autocomplete, still doing 95% of the thinking.

Attempt 3 - continuous agents: keeps working without prompting, but "no errors" doesn't mean "feature works."

Attempt 4 - parallel agents: faster wall-clock, but now you're manually reviewing even more output.

The common failure: nobody verifies whether the output satisfies the goal. That somebody was always me. So I automated that job.

OmoiOS is a spec-driven orchestration system. You describe a feature, and it:

1. Runs a multi-phase spec pipeline (Explore > Requirements > Design > Tasks) with LLM evaluators scoring each phase. Retry on failure, advance on pass. By the time agents code, requirements have machine-checkable acceptance criteria.

2. Spawns isolated cloud sandboxes per task. Your local env is untouched. Agents get ephemeral containers with full git access.

3. Validates continuously - a separate validator agent checks each task against acceptance criteria. Failures feed back for retry. No human in the loop between steps.

4. Discovers new work - validation can spawn new tasks when agents find missing edge cases. The task graph grows as agents learn.

What's hard (honest):

- Spec quality is the bottleneck. Vague spec = agents spinning. - Validation is domain-specific. API correctness is easy. UI quality is not. - Discovery branching can grow the task graph unexpectedly. - Sandbox overhead adds latency per task. Worth it, but a tradeoff. - Merging parallel branches with real conflicts is the hardest problem. - Guardian monitoring (per-agent trajectory analysis) has rough edges still.

Stack: Python/FastAPI, PostgreSQL+pgvector, Redis (~190K lines). Next.js 15 + React Flow (~83K lines TS). Claude Agent SDK + Daytona Cloud. 686 commits since Nov 2025, built solo. Apache 2.0.

I keep coming back to the same problem: structured spec generation that produces genuinely machine-checkable acceptance criteria. Has anyone found an approach that works for non-trivial features, or is this just fundamentally hard?

GitHub: https://github.com/kivo360/OmoiOS Live: https://omoios.dev

Comments

kanddle•1h ago
Creator here. TL;DR: OmoiOS takes a feature description, generates structured specs with acceptance criteria, dispatches agents to isolated cloud sandboxes, validates each task autonomously, and produces a PR. You review the PR, not every intermediate step.

The core insight: AI coding tools are great at generating code, but someone still has to verify the output matches the goal. Usually that someone is you. OmoiOS automates that oversight loop.

How this compares to what you're probably using:

- vs Claude Code / Cursor: great interactive tools where you're in the loop. OmoiOS is for when you want to write the spec, approve the plan, and walk away.

- vs Codex: both produce PRs, but Codex is prompt-driven (individual tasks). OmoiOS is spec-driven (full feature lifecycle). Also open-source and not locked to one provider.

- vs Kiro: both spec-driven, but Kiro is a VS Code fork for interactive work. OmoiOS runs autonomously in the cloud. Also open-source, self-hostable, multi-model.

- vs CrewAI / LangGraph: agent frameworks (primitives). OmoiOS is an opinionated system — full lifecycle from spec to PR.

- vs Devin: OmoiOS is open-source, self-hostable, shows you the plan before executing. Devin is a black box.

Built with Claude Agent SDK + FastAPI + PostgreSQL + Next.js 15. Apache 2.0 — fork it, self-host it, build on it.

Happy to go deep on the spec pipeline, the validation loop, or the multi-agent coordination.

genxy•23m ago
The pervasive use of AI to write posts makes them exhausting to read.