frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Why file systems are the wrong workspace for AI agents

https://blog.getspine.ai/spine-swarm-hits-1-on-gaia-level-3-and-google-deepmind-deepsearchqa/
6•a24venka•2h ago

Comments

a24venka•2h ago
Hey HN — Akshay & Ashwin here, co-founders of Spine AI (YC S23).

We've been rethinking how AI agents work together. Instead of a single model in a chat loop or agents reading/writing to a file system, we built a visual canvas where multiple agents collaborate across connected blocks — and it turns out this architecture significantly outperforms both single and multi-agent systems on hard tasks.

The approach has three parts:

1. Canvas-based workspace — Agents operate on an infinite canvas of intelligent blocks (web browsing, prompts, tables, memos) that connect and pass context to each other. Instead of a flat file system, agents get a structured, non-linear environment that mirrors how complex problems actually decompose.

2. Tiered multi-agent orchestration — An orchestrating agent decomposes tasks, delegates to specialized persona agents (researcher, analyst, reviewer), and manages dependencies. Agents validate each other's work before passing it downstream, catching errors before they compound across long chains.

3. Dynamic multi-model ensembling — Rather than one model for everything, we select from 300+ models per subtask. When confidence is low, we pull in additional models and treat disagreement as a signal for deeper scrutiny — like classical ML ensembling, but at the agent level.

The results: 61.5% on GAIA Level 3 (vs Manus 57.7%, OpenAI Deep Research 47.6%) and 87.6% on DeepSearchQA (vs Perplexity 79.5%, Gemini Deep Research 66.1%). Same frontier models available to everyone — the difference is architecture.

Because everything runs on the canvas, we could audit our agents' work step by step. That's how we caught what appear to be mislabeled questions in the GAIA dataset itself — we link to sample canvases in the post so you can see the reasoning traces.

Spine Swarms is open to try at www.getspine.ai. Happy to go deep on any of the architecture.

Fixfest is a global gathering of repairers, tinkerers, and activists

https://fixfest.therestartproject.org/
1•robtherobber•35s ago•0 comments

AI Observability and Evaluations: The Operating System for Reliable LLM Products

https://labs.adaline.ai/p/ai-observability-and-evaluations
1•yarapavan•1m ago•0 comments

The Prompt I Cannot Read

https://the-prompt-i-cannot-read-ee16d7.gitlab.io/
2•gmays•3m ago•0 comments

We have more privacy controls yet less privacy

https://www.bbc.com/news/articles/c4gj39zk1k0o
1•1vuio0pswjnm7•4m ago•0 comments

MacBook Neo: Commenting from Privilege?

https://twitter.com/mufasaYC/status/2030908794180633010
1•tosh•4m ago•0 comments

Zuckerberg is done with Alexandr Wang

https://old.reddit.com/r/ArtificialInteligence/comments/1rl65kj/mark_zuckerberg_is_done_with_the_...
2•Insanity•5m ago•0 comments

Leading Frontier Firm Transformation with Microsoft 365 E7

https://partner.microsoft.com/en-us/blog/article/agent-365-announcement
1•mindracer•5m ago•0 comments

The Cost of Indirection in Rust

https://blog.sebastiansastre.co/posts/cost-of-indirection-in-rust/
1•sebastianconcpt•6m ago•0 comments

Startup Wants to Launch a Space Mirror

https://www.nytimes.com/2026/03/09/climate/space-mirror-satellite-solar.html
1•cyunker•6m ago•0 comments

Ask HN: Is Cloudflare Down Again?

2•pocksuppet•6m ago•0 comments

Show HN: ROLV – 20x faster MoE FFN inference on Llama 4 Maverick vs. cuBLAS

https://rolv.ai
1•heggenhougen•7m ago•1 comments

Show HN: IceCubes – speaker-attributed meeting transcripts without a bot

https://icecubes.app
1•Nandita_Arora•8m ago•0 comments

Approximately 40% of prepaid value is never used

https://www.nber.org/papers/w34918
1•neehao•8m ago•0 comments

Wegovy and Ozempic owner dealt blow as next drug is branded 'obsolete'

https://www.theguardian.com/business/2026/feb/23/wegovy-ozempic-weight-loss-drug-novo-nordisk-cag...
2•PaulHoule•8m ago•0 comments

How I Built Brickonomics: Smart Algorithms to Save Money on Lego

https://thebrickblogger.com/2026/03/how-i-built-brickonomics-smart-algorithms-to-save-money-on-lego/
1•abnercoimbre•8m ago•0 comments

Iran Air and Missile War – Ballistic, Interceptors and Munition Stockpiles [video]

https://www.youtube.com/watch?v=mP_rr859r8w
1•cwillu•10m ago•0 comments

GNU, and the AI Reimplementations

https://antirez.com/news/162
2•antirez•11m ago•0 comments

AI agents now help attackers, including North Korea, manage their drudge work

https://www.theregister.com/2026/03/08/deploy_and_manage_attack_infrastructure/
2•johnshades•12m ago•0 comments

Show HN: Monetize APIs for agentic commerce without accounts using Stripe

https://github.com/stripe402/stripe402
2•whatl3y•12m ago•0 comments

Florida Judge Rules Red Light Camera Tickets Are Unconstitutional

https://cbs12.com/news/local/florida-news-judge-rules-red-light-camera-tickets-unconstitutional
3•1970-01-01•14m ago•0 comments

$100 Oil Now Means Bigger Buybacks with Fewer Jobs and Babies Than Ever Before

https://www.governance.fyi/p/wall-street-killed-the-wildcatters
2•toomuchtodo•15m ago•1 comments

Test Data Management with Greenmask and OpenEverest

https://www.greenmask.io/blog/greenmask-openeverest-automating-safe-production-data
1•woyten•15m ago•0 comments

Where to See Cherry Blossoms in the Bay Area This Spring

https://www.kqed.org/science/2000203/where-to-see-cherry-blossoms-2026-san-francisco-bay-area-map
1•zuhayeer•17m ago•0 comments

Aaron Levie: Building for trillions of agents

https://twitter.com/levie/status/2030714592238956960
1•elsewhen•18m ago•0 comments

Learn about Steam

https://www.spiraxsarco.com/learn-about-steam?sc_lang=en-GB
1•flowingfocus•18m ago•0 comments

Indo-European Explorer: A 6k-Year Journey

https://indo-european-explorer.com/
1•gmays•18m ago•1 comments

AI Assistants Are Moving the Security Goalposts

https://krebsonsecurity.com/2026/03/how-ai-assistants-are-moving-the-security-goalposts/
1•GTP•19m ago•0 comments

Anthropic sues Trump administration after clash over AI use

https://abcnews.com/Business/anthropic-sues-trump-administration-after-clash-ai/story?id=130905672
2•thm•21m ago•1 comments

A Dev's Checklist for MCP Security and Compliance

https://composio.dev/blog/mcp-vulnerabilities-every-developer-should-know
1•alokDT•22m ago•0 comments

Vibe Coding and the Death of Craftsmanship (Personal Essay)

https://www.umangsinha.in/blog/vibe-coding-and-the-death-of-craftsmanship
1•umang-sinha•23m ago•6 comments