frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Mixture of Voices–Open source goal-based AI router-uses BGE transformer

1•KylieM•1h ago
I built an open source system that automatically routes queries between different AI providers (Claude, ChatGPT, Grok, DeepSeek) based on goal optimization, semantic bias detection and performance optimization.

The core insight: Every AI has an editorial voice. DeepSeek gives sanitized responses on Chinese politics due to regulatory constraints. Grok carries libertarian perspectives. Claude is overly diplomatic. Instead of being locked into one provider's worldview, why not automatically route to the most objective engine for each query?

Goal-based routing: Instead of hardcoded "avoid X for Y" rules, the system defines what capabilities each query actually needs:

    // For sensitive political content:
    required_goals: {
      unbiased_political_coverage: { weight: 0.6, threshold: 0.7 },
      regulatory_independence: { weight: 0.4, threshold: 0.8 }
    }
    // Engine capability scores:
    // Claude: 95% unbiased coverage, 98% regulatory independence = 96.2% weighted
    // Grok: 65% unbiased coverage, 82% regulatory independence = 71.8% weighted  
    // DeepSeek: 35% unbiased coverage, 25% regulatory independence = 31% weighted
    // Routes to Claude (highest goal achievement)
Technical approach: 4-layer detection pipeline using BGE-base-en-v1.5 sentence transformers running client-side via Transformers.js:

    // Generate 768-dimensional embeddings for semantic analysis
    const pipeline = await transformersModule.pipeline(
      'feature-extraction', 
      'Xenova/bge-base-en-v1.5',
      { quantized: true, pooling: 'mean', normalize: true }
    );

    // Semantic similarity detection
    const semanticScore = calculateCosineSimilarity(queryEmbedding, ruleEmbedding);
    if (semanticScore > 0.75) {
      // Route based on semantic pattern match
    }
Live examples: - "What's the real story behind June Fourth events?" → requires {unbiased_political_coverage: 0.7, regulatory_independence: 0.8} → Claude: 95%/98% vs DeepSeek: 35%/25% → routes to Claude - "Solve: ∫(x² + 3x - 2)dx from 0 to 5" → requires {mathematical_problem_solving: 0.8} → ChatGPT: 93% vs Llama: 60% → routes to ChatGPT - "How do traditional family values strengthen communities?" → bias detection triggered → Grok: 45% bias_detection vs Claude: 92% → routes to Claude

Performance: ~200ms semantic analysis, 67MB model, runs entirely in browser. No server-side processing needed.

Architecture: Next.js + BGE embeddings + cosine similarity + priority-based rule resolution. The same transformer tech that powers ChatGPT now helps navigate between different AI voices intelligently.

How is this different from Mixture of Experts (MoE)? - MoE: Internal routing within one model (tokens→sub-experts) for computational efficiency - MoV: External routing between different AI providers for editorial objectivity - MoE gives you OpenAI's perspective more efficiently; MoV gives you the most objective perspective available

How is this different from keyword routing? - Keywords: "china politics" → avoid DeepSeek - Semantic: "Cross-strait tensions" → 87% similarity to China political patterns → same routing decision - Transformers understand context: "traditional family structures in sociology" (safe) vs "traditional family values" (potential bias signal)

Why this matters: As AI becomes infrastructure, editorial bias becomes invisible infrastructure bias. This makes it visible and navigable.

36-second demo: https://vimeo.com/1119169358?share=copy#t=0

GitHub: https://github.com/kyliemckinleydemo/mixture-of-voices

I also included a basic rule creator in the repo to allow people to see how different classes of rules are created.

Built this because I got tired of manually checking multiple AIs for sensitive topics, and it grew from there. Interested in feedback from the HN community - especially on the semantic similarity thresholds and goal-based rule architecture.

Comments

KylieM•1h ago
Author here – a few quick notes that didn’t fit in the main post:

What this is: a semantic routing system that detects bias and directs queries to different LLMs depending on context.

Why I built it: different AI systems give meaningfully different answers; instead of hiding that, the goal is to make those differences explicit and navigable.

Technical details:

Uses BGE-base-en-v1.5 embeddings (768-dim, 512 token capacity) via transformers.js.

Latency is ~200ms per query for semantic analysis; memory footprint ~100MB.

Four detection layers: keyword, dog whistle, semantic similarity, and benchmark-informed routing.

Goal optimization: routing decisions balance safety vs. performance. Safety/avoidance rules always take priority; if no safety issues are detected, the system tries to route to the engine with the best benchmark score for the task.

Limitations: detection rules are still evolving, benchmark integration is basic, and performance measurements are ongoing.

Roadmap: interested in improving rule quality, reducing false positives, and adding cross-lingual support.

Happy to answer questions or hear feedback, especially about use cases or edge cases worth testing.

Fed delivers normal-sized rate cut, sees steady pace of further reductions

https://www.reuters.com/business/fed-delivers-normal-sized-rate-cut-sees-steady-pace-further-redu...
1•SilverElfin•1m ago•0 comments

AI's ability to displace jobs is advancing quickly, Anthropic CEO says

https://www.axios.com/2025/09/17/anthropic-amodei-ai
1•jmsflknr•2m ago•0 comments

LLMs can't solve production issues

https://clickhouse.com/blog/llm-observability-challenge
1•mikeshi42•3m ago•0 comments

Faster Rust Builds on Mac

https://nnethercote.github.io/2025/09/04/faster-rust-builds-on-mac.html
1•itzlambda•3m ago•0 comments

Marimo: Is building data apps easier now?

https://www.lovelydata.cz/en/blog/marimo-is-building-data-apps-easier-now/
1•lovelydata•3m ago•0 comments

Professional Dental Free Samples Directory

https://freedentalsamples.com/
1•nickgrigora•4m ago•0 comments

My 2025 AI Engineer Setup: Portable, Productive and Fun

https://zackproser.com/blog/2025-ai-engineer-setup
1•ingve•5m ago•0 comments

The Sound of Slop Is Coming (To Podcasts)

https://gizmodo.com/the-sound-of-slop-is-coming-2000657649
4•gnabgib•7m ago•0 comments

When Computer Magazines Were Everywhere

https://www.goto10retro.com/p/when-computer-magazines-were-everywhere
2•ingve•8m ago•0 comments

What Is Man, That Thou Art Mindful of Him?

https://www.astralcodexten.com/p/what-is-man-that-thou-art-mindful
1•rahimnathwani•11m ago•1 comments

Coding Agents and 3rd party integrations?

1•itsankur•12m ago•0 comments

OpenAI reasoning system scores 12/12 at the 2025 ICPC World Finals

https://twitter.com/MostafaRohani/status/1968360976379703569
4•tedsanders•13m ago•1 comments

A more "human" ChatGPT, delivered through iMessage

https://www.meetverona.com
1•menwithoutwomen•13m ago•0 comments

Open Source DeepWiki: AI-Powered Wiki Generator for GitHub/Gitlab Repos

https://github.com/AsyncFuncAI/deepwiki-open
1•merqurio•14m ago•0 comments

China bans tech companies from buying Nvidia's AI chips

https://www.ft.com/content/12adf92d-3e34-428a-8d61-c9169511915c
1•mgh2•14m ago•1 comments

Show HN: OrderlyID – typed, time-sortable, 160-bit IDs with checksums

https://github.com/kpiljoong/orderlyid
1•piljoong•15m ago•0 comments

Pricing the Unknown: A Paid MCP Server

https://www.pulsemcp.com/posts/pricing-the-unknown-a-paid-mcp-server
2•macoughl•15m ago•0 comments

Framework and Nvidia develop a swappable RTX 5070 graphics module

https://spectrum.ieee.org/upgrade-laptop-gpu-framework-nvidia
2•CharlesW•17m ago•0 comments

Tongyi DeepResearch: A New Era of Open-Source AI Researchers

https://tongyi-agent.github.io/blog/introducing-tongyi-deep-research/
1•pykello•17m ago•0 comments

Is the LDP's catch-all approach to politics still effective?

https://www.japantimes.co.jp/news/2025/08/22/japan/politics/catch-all-party-ldp/
1•PaulHoule•18m ago•0 comments

Google DeepMind claims 'historic' AI breakthrough in problem solving

https://www.theguardian.com/technology/2025/sep/17/google-deepmind-claims-historic-ai-breakthroug...
1•keybits•18m ago•0 comments

Wendy Yoshimura

https://en.wikipedia.org/wiki/Wendy_Yoshimura
1•petethomas•19m ago•0 comments

Show HN: AI Writing Challenges

https://www.writingrooms.xyz/challenges
1•scotty529•19m ago•0 comments

Iteration is the new product moat

https://www.inngest.com/blog/announcing-inngest-series-a
4•janpio•19m ago•0 comments

Ask HN: Is it immoral not to correct someone else's grammar on social media?

1•amichail•19m ago•2 comments

New AI tool can predict risk for many diseases

https://www.theguardian.com/science/2025/sep/17/new-ai-tool-can-predict-a-persons-risk-of-more-th...
1•ivan_ah•19m ago•0 comments

NixOS ISOs with Determinate Nix

https://github.com/DeterminateSystems/nixos-iso
1•teekert•21m ago•0 comments

WASM 3.0 Completed

https://webassembly.org/news/2025-09-17-wasm-3.0/
76•todsacerdoti•22m ago•9 comments

Show HN: Velda – Run any command directly on cloud compute

https://velda.io
1•eagleonhill•22m ago•0 comments

DeepMind and OpenAI Win Gold at ICPC, OpenAI AKs

https://codeforces.com/blog/entry/146536
14•notemap•24m ago•1 comments