frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

We built a strict AI due-diligence tool. Looking for technical criticism

1•Modular_Hallway•1h ago
We’re experimenting with something called Zeus and would love critical feedback.

The problem we’re targeting: AI evaluation today is mostly hype, cherry-picked benchmarks, and inconsistent model cards. It’s hard to reason about risk, uncertainty, and missing information before deploying or buying a model.

What Zeus does (MVP v0.1): - Takes a minimal description of an AI model or AI-powered tool - Generates standardized ModelCard-style metadata - Runs a structured multi-expert analysis (performance, safety, systems, UX, innovation) - Forces explicit disagreement where evidence conflicts - Scores categories based only on disclosed evidence - Outputs a threat/misuse model and improvement roadmap - Produces deterministic, machine-readable JSON

Constraints: - No model execution - No benchmarks - No rankings - Missing info is explicitly marked as “unknown” - No assumptions or fabricated facts

Think of it as a conservative due-diligence engine, not a judge of “best models.”

Questions we’re trying to answer before going further: - Is evaluation without execution still useful? - Does forced disagreement increase or decrease trust? - Where would this actually fit in real workflows?

Brutal criticism welcome.

So I built an Neuro Symbolic AI, it remembers itself and you

https://signal-zero.ai
1•klietus•54s ago•1 comments

Build vs. buy for regulated clinical alerting systems?

https://actimi.com/signals
1•peppernub•1m ago•1 comments

Show HN: I built a fast cross-platform CLI network scanner via Python

https://github.com/mennylevinski/light-net-scanner
1•mennylevinski•5m ago•1 comments

Mesa's "Present Timing" Vulkan Driver Support Now Feature Complete

https://www.phoronix.com/news/Mesa-VK_EXT_present_timing
1•6581•7m ago•0 comments

Networking Is the Hydra of Kubernetes

https://redmonk.com/kholterhoff/2025/11/18/networking-is-the-hydra-of-kubernetes/
1•mooreds•8m ago•0 comments

Operation Catahoula Crunch

https://www.turtlediaries.net/p/operation-catahoula-crunch
1•mooreds•8m ago•0 comments

Can Trump's Peace Initiative Stop the Congo's Thirty-Year War?

https://www.newyorker.com/magazine/2025/12/01/can-trumps-peace-initiative-stop-the-congos-thirty-...
1•PaulHoule•9m ago•0 comments

Twelve Days of AI

https://www.12daysofai.app/
1•twitchard•9m ago•0 comments

Pay transparency set to disrupt companies' total rewards strategies

https://www.hr-brew.com/stories/2025/12/02/pay-transparency-total-rewards-strategies
1•mooreds•9m ago•0 comments

Importance of self reflection in a complex world

https://programmer.network/aleksandar/articles/daily-reflection-diary-a-self-inflicted-11
1•agjs•10m ago•1 comments

Show HN: CryptoBob – a market intelligence API for crypto trading systems

https://cryptobob.de/
1•jallewa•10m ago•0 comments

Show HN: Open-Source Notion MCP Server (TypeScript, SSE, Apify)

https://github.com/piskunproject/notion-mcp-server
1•piskunlab•11m ago•0 comments

Gnome Now Forbids Vibe Coded Extensions

https://gjs.guide/extensions/review-guidelines/review-guidelines.html
1•speckx•11m ago•0 comments

Show HN: PHP Claude Agents

https://github.com/claude-php/claude-php-agent
1•dalemhurley•11m ago•0 comments

Supersized data centers are coming. See how they will transform America

https://www.washingtonpost.com/climate-environment/interactive/2025/giant-data-centers-energy-pol...
1•voxleone•12m ago•0 comments

Show HN: Stimm – Low-Latency Voice Agent Platform (Python/WebRTC)

https://github.com/stimm-ai/stimm
1•etienne_l•12m ago•1 comments

Show HN: Far RAG API – Semantic Search for Federal Regulations (OpenAPI)

1•blueskyline•14m ago•0 comments

Show HN: ZKS – A Split-Key Mesh Protocol for Private File Transfer (Rust/WASM)

https://github.com/cswasif/zks
1•cswasif•14m ago•0 comments

Job apocalypse? AI is creating brand new occupations

https://www.economist.com/business/2025/12/14/job-apocalypse-humbug-ai-is-creating-brand-new-occu...
2•bookofjoe•15m ago•1 comments

Show HN: DeepAudit – open-source auditing agent (LLMs and Static Analysis)

https://github.com/lintsinghua/DeepAudit
1•lintsinghua•17m ago•0 comments

Show HN: FocusFour – Eisenhower Matrix on Apple Reminders

https://www.focusfour.app/
1•qzcanoe•17m ago•0 comments

Strategy Before Metrics

https://fastwonderblog.com/2025/06/24/strategy-before-metrics/
2•gpi•18m ago•0 comments

LG TV users baffled by unremovable Microsoft Copilot installation

https://www.tomshardware.com/service-providers/tv-providers/lg-tv-update-adds-non-removable-micro...
1•speckx•18m ago•0 comments

Show HN: Ace interviews with AI, real time feedback

https://intermock.com
1•michelutti•19m ago•0 comments

Lalanne Hippo Bar Sells for Record-Breaking $31.4M

https://galeriemagazine.com/lalanne-hippo-bar-sells-for-record-breaking-31-4-million-at-sothebys/
1•olalonde•19m ago•0 comments

Being a SysAdmin Is Hard

https://about.tree.ht/blog/treehut-outages-december-2025
1•gpi•20m ago•0 comments

China's AI Power Play: Cheap Electricity from Biggest Grid

https://www.wsj.com/tech/china-ai-electricity-data-centers-d2a86935
1•JumpCrisscross•23m ago•0 comments

AI is a modeling problem – a non LLM idea

https://theantagonistai.substack.com/p/ai-is-a-modeling-problem
1•theantagonistai•23m ago•0 comments

Constructor Theory: expressing laws in terms of what is possible or impossible

https://www.constructortheory.org/
1•tesserato•24m ago•0 comments

Bass OS

https://bassos.navotpala.tech/
1•rcarmo•24m ago•0 comments