frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Autonomous AI benchmark-testing

https://www.jou-labs.com/proof
1•jodytornado•2h ago

Comments

jodytornado•2h ago
Show HN: I built a bare-metal OS with an autonomous reasoning engine in no_std Rust I've been working on this for the last 24 months as a solo developer and wanted to share what I've built and the results we just validated. What it is: PROTOS is a bare-metal operating system (no Linux, no Windows, no cloud) with an integrated reasoning engine called "EARL" (EPISTEMIC ADAPTIVE REASONING LAYER) that does autonomous multi-source intelligence fusion. Everything runs on a single laptop — no GPU, no network connection required. It's 174K+ lines of no_std Rust across 189 modules, including custom NVMe drivers, a filesystem, a knowledge base, the reasoning engine, autonomous learning systems, and safety controls. The core problem it solves: Current AI tools (LLMs included) can't explain their reasoning, give different answers to the same question, require cloud connectivity, hallucinate, and can't be audited. For defense and intelligence applications, this is a non-starter — DoD Directive 3000.09 requires that autonomous systems explain every decision with a verifiable audit trail. No existing AI system meets this standard for intelligence analysis. How it works: EARL uses symbolic reasoning rather than neural networks. It reads raw documents, automatically identifies entities (people, organizations, locations, weapons, financial transactions), discovers hidden connections across sources, learns new concepts from context, and detects when sources contradict each other. Every conclusion is backed by a cryptographically signed (SHA-256) evidence chain. Same inputs always produce the same outputs. What we just proved: We tested EARL against the buildup to Russia's 2022 invasion of Ukraine. We reconstructed 12 intelligence documents spanning satellite imagery analysis, intercepted comms, HUMINT reports, financial intelligence, OSINT, and technical weapons assessments from Oct 2021–Feb 2022. We defined 10 intelligence connections that Five Eyes agencies actually identified during this period (all verifiable against public sources — Maxar imagery, CRS reports, OSCE data, investigative journalism). EARL discovered 8 out of 10 autonomously. No training on the scenario, no pre-labeled data, no human guidance. It read raw text, built its own understanding, and independently arrived at conclusions that took the combined intelligence apparatus months to assemble. The system also correctly enforced safety constraints — connections below the confidence threshold were flagged but blocked from triggering autonomous action, exactly as 3000.09 requires. Technical details for the curious:

Pure no_std Rust, bare-metal execution Custom NVMe drivers and filesystem Symbolic reasoning engine (not neural/statistical) PMI-based knowledge representation with 15M+ edges Three-layer cognitive architecture: symbolic reasoning, metacognition, values/constraints Deterministic — reproducible results on every run Cryptographic forensics chain for full auditability Air-gap capable by design

My background: I'm a civil engineer who transitioned into systems programming after retirement. The technology has received preliminary validation from USASOC analysts who described the "reproducible reasoning" capability as disruptive. We're currently positioned for strategic acquisition and are open to strategic investment for formal DoD certification, team buildout, and field deployment optimization. Happy to answer technical questions about the architecture, the benchmark methodology, or the bare-metal Rust experience.

Can Elon Musk run AI in space?

https://www.economist.com/insider/inside-tech/can-elon-musk-really-run-ai-in-space
1•andsoitis•2m ago•0 comments

Show HN: Vis Pro – A Formula-Based Workout Program Editor

https://vis.fitness/pro
1•strongpigeon•4m ago•1 comments

Revisiting the Steam Controller

https://callmeo.live/blog/revisiting-the-steam-controller/
1•speckx•4m ago•0 comments

Opus 4.6 completed the Blender Donut Tutorial by watching it on YouTube

https://old.reddit.com/r/ClaudeAI/comments/1rdir26/i_had_opus_46_complete_the_entire_blender_donut/
1•bpierre•4m ago•0 comments

Devin 2.2

https://twitter.com/cognition/status/2026343816521994339
2•tosh•6m ago•0 comments

Show HN: Imsg-TUI – A Console App for Sending and Receiving iMessages

https://github.com/plotfi/imsg-tui
1•zer0zzz•6m ago•0 comments

Host Leadership

https://martinfowler.com/bliki/HostLeadership.html
1•rahimnathwani•6m ago•0 comments

Claude Code Remote Control

https://twitter.com/noahzweben/status/2026371260805271615
1•mfiguiere•7m ago•0 comments

Manjaro website off-line again due to lapsed certificate

https://distrowatch.com/dwres.php?resource=showheadline&story=20140
1•hexagonsuns•7m ago•0 comments

Agents of Chaos: a red team study of autonomous LLM agents with full access

https://www.researchgate.net/publication/401123335_Agents_of_Chaos
2•felineflock•9m ago•0 comments

Show HN: Datapoint – replacing mobile ads with data labelling tasks

https://trydatapoint.com/blog-page
1•chancemehmu•9m ago•0 comments

What spec-driven development gets wrong

https://www.augmentcode.com/blog/what-spec-driven-development-gets-wrong
1•thesleepypanda•9m ago•0 comments

npm i chat – One codebase, every chat platform

https://vercel.com/changelog/chat-sdk
1•MaxLeiter•9m ago•0 comments

The vulnerability of aging states (2023)

https://www.pnas.org/doi/10.1073/pnas.2218834120
1•measurablefunc•10m ago•0 comments

Show HN: Open-source EU AI Act compliance layer for AI agents (8/2026 deadline)

1•shotwellj•10m ago•0 comments

Continuous inhalation of essential oil increases gray matter volume in the brain

https://pubmed.ncbi.nlm.nih.gov/38331299/
1•rdgthree•11m ago•0 comments

Influencers are promoting peptides for better health. What does the science say?

https://www.npr.org/2026/02/23/nx-s1-5716162/peptides-science-muscle-growth-longevity-wellness
1•ck2•11m ago•0 comments

I got my phone bill down to $6.25/month after years of overpaying

1•huntsmans•12m ago•0 comments

Add drip email system with onboarding and coverage milestone emails

1•nishiohiroshi•12m ago•0 comments

Agents of Chaos

https://arxiv.org/abs/2602.20021
2•wslh•12m ago•0 comments

Show HN: GenogramAI – Create Genograms in Seconds

1•veritas9•12m ago•0 comments

Use Lyria 3 to create music tracks in the Gemini app

https://blog.google/innovation-and-ai/products/gemini-app/lyria-3/
1•bookofjoe•13m ago•0 comments

Show HN: Tools Are Lying to You

https://cloudstreet-dev.github.io/Your-Tools-Are-Lying-to-You/
2•DavidCanHelp•15m ago•1 comments

Show HN: Recall – A personal CRM you use over text messages

https://www.recall.life/
1•kyledotkyle•15m ago•0 comments

TAWS – The Amiga Workbench Simulation 0.40

https://www.taws.ch/WB.html
1•doener•15m ago•0 comments

Reframed – Open-source alternative to Screen Studio, have editor, auto-zoom

https://github.com/jkuri/Reframed
2•jkuri•17m ago•0 comments

Show HN: MacCoolinator – Putting the "Cool" in Mac

https://github.com/corylevine/MacCoolinator
2•coryxrx•17m ago•0 comments

Inequality aversion can be taught through learning of others' preferences

https://elifesciences.org/articles/102800
1•PaulHoule•18m ago•0 comments

simple timezone tracker

https://time.yaosamo.com/
1•yaosamo•18m ago•0 comments

The whole point of OpenAI's Responses API is to help them hide reasoning traces

https://www.seangoedecke.com/responses-api/
2•dkleinest•18m ago•0 comments