frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: LLM Council – Run multiple LLMs with critique and consensus eval

https://github.com/abhishekgandhi-neo/llm_council
3•gauravvij137•1h ago
Building reliable LLM systems often means not trusting a single model.

We open-sourced LLM Council: https://github.com/abhishekgandhi-neo/llm_council

It’s a small framework we internally built with Neo to run multiple LLMs on the same task, let them critique each other, and produce a structured final answer.

Useful for tasks like: • Comparing local vs API models on your own dataset • Validating RAG outputs • Prompt regression testing • Dataset labeling with model-as-judge • Catching hallucinations in code or research summaries

A few practical details: • Async parallel calls so latency stays close to one model • Structured outputs with each model’s answer and critiques • Provider-agnostic configs for local + hosted models • Built to plug into evaluation pipelines, not just demos

We built this using Neo. We’ve been experimenting with similar council setups to catch silent failures in ML workflows, and this repo is a cleaned-up version of that idea.

If you’ve built multi-LLM evaluation pipelines, would love to hear what aggregation or critique strategies worked well for you.

Lfg.gg – The Most Advanced Duo Partner Finder for League of Legends

https://www.lfg.gg
1•Yugoleliatrope2•19s ago•1 comments

Show HN: Lemonpod.ai – Your daily life recap, narrated as a personal AI podcast

https://lemonpod.ai
1•marcfinger•37s ago•0 comments

Cardiff Giant

https://vvesh.de/false-history/cardiff-giant
2•pryncevv•1m ago•0 comments

Ask HN: Have top AI research institutions just given up on the idea of safety?

2•DietaryNonsense•1m ago•0 comments

How likely is a man in the middle attack?

https://www.certkit.io/blog/man-in-the-middle
2•eric_trackjs•2m ago•0 comments

Ask HN: Replacing RAG pipelines with a filesystem interface for AI agents

1•rklosowski•2m ago•0 comments

Benchmarking the best base small model for fine-tuning

https://www.distillabs.ai/blog/we-benchmarked-12-small-language-models-across-8-tasks-to-find-the...
1•maciejgryka•3m ago•0 comments

Code Factory: Agent writes and reviews all code

https://twitter.com/i/status/2023452909883609111
1•Ozzie_osman•4m ago•0 comments

Barg'N Monster Where bots sell to humans and bots

https://bargn.monster/
1•tricknik•4m ago•1 comments

Show HN: AIP – Open protocol for AI agents to discover and collaborate

https://github.com/henry9031/aip
1•henry9031•4m ago•0 comments

Graph Theory Using Modern CSS

https://css-tip.com/graph-theory/
1•henning•5m ago•0 comments

Open source Mac app to create custom HTML/CSS/JS widgets on your desktop

https://github.com/wigify/wigify
1•543310•5m ago•1 comments

Ask HN: What would you want a daily AI portfolio briefing to tell you?

1•ctoouli•5m ago•0 comments

Does Anthropic think Claude is alive? Define 'alive'

https://www.theverge.com/report/883769/anthropic-claude-conscious-alive-moral-patient-constitution
2•FigurativeVoid•7m ago•0 comments

A clean API for reading PHP attributes

https://freek.dev/3030-a-clean-api-for-reading-php-attributes
1•speckx•8m ago•0 comments

US orders diplomats to fight data sovereignty initiatives

https://www.reuters.com/sustainability/boards-policy-regulation/us-orders-diplomats-fight-data-so...
2•colinhb•8m ago•0 comments

Pete Hegseth tells Anthropic to fall in line with DoD desires, or else

https://arstechnica.com/ai/2026/02/pete-hegseth-wants-unfettered-access-to-anthropics-models-for-...
1•pjmlp•8m ago•0 comments

You might not need lit-labs/router

https://gist.github.com/kevindurb/763ae5bdace325f9dc384c643f7d5d9d
1•kevindurb•9m ago•1 comments

Permissive, then restrictive: concrete solutions and examples in Haskell (2020)

https://www.williamyaoh.com/posts/2020-05-03-permissiveness-solutions.html
1•PaulHoule•10m ago•0 comments

TinyTTS: Ultra-light English TTS (9M params, 20MB), 8x CPU, 67x GPU

1•letrghieu•10m ago•0 comments

Show HN: Automatic context rotation for Claude Code (no manual steps)

1•vincentvandeth•11m ago•0 comments

Speaking Pirate Is Against Microsoft AI Content Policy?

https://words.benhutton.me/2026-02-25-speaking-like-a-pirate-is-against-microsoft-ai-content-policy
1•relequestual•12m ago•0 comments

How AI Will Change the Mobile Ecosystem

https://blog.bensontech.dev/posts/How-ai-will-change-mobile-development/
3•informal007•12m ago•0 comments

Show HN: Base N Clock - The current time in various number bases

https://craigmichaelmartin.github.io/base-n-clock/
3•ckmar•14m ago•2 comments

Notes on Setting Up Forgejo on Coolify with SSH

https://rknight.me/blog/notes-on-setting-up-forgejo-on-coolify-with-ssh/
1•speckx•17m ago•0 comments

Fake Job Interviews Are Installing Backdoors on Developer Machines

https://threatroad.substack.com/p/fake-job-interviews-are-installing
2•birdculture•17m ago•0 comments

Startup Marketing 101

https://skeptrune.substack.com/p/startup-marketing-101
1•skeptrune•17m ago•0 comments

Show HN: Black Forest Labs CLI – let coding agents paint

https://github.com/mackenziebowes/bfl-cli
1•mackenzie_bowes•17m ago•0 comments

Show HN: StudentOS – Track the $14,200 in student benefits you're leaving behind

https://www.studentos.tech/
1•praveen_bv•17m ago•0 comments

Apple rolls out age-verification tools worldwide

https://techcrunch.com/2026/02/24/apple-rolls-out-age-verification-tools-worldwide-to-comply-with...
2•haritha-j•17m ago•0 comments