Show HN: Neutral News AI – Multi-source, MNLI-checked news summaries

1•MarcellLunczer•6h ago

Comments

MarcellLunczer•6h ago

Hi HN,

I’m the co-founder of Neutral News AI: a site that tries to answer a simple question:

“What actually happened here, across multiple biased sources, and can we check the claims against the original articles?”

Link: https://neutralnewsai.com Analyzer: https://neutralnewsai.com/analyzer No signup needed to read the news or run a basic analysis.

What it does

• Crawls multiple outlets (left / center / right + wires / gov sites) for the same story.

• Generates a short, neutral summary constrained to those sources (no extra web search).

• Extracts atomic claims (events, numbers, quotes) from the draft.

• Uses an MNLI model to test each claim against the underlying articles:

• entailment → “Supported”

• contradiction → “Refuted”

• neutral → “Inconclusive”

• Surfaces a “receipt ledger” per article: claim text, verdict, quote, source, timestamp.

• Exposes the underlying models on an Analyzer page where you can paste any URL and get:

• political bias score,

• sentiment / subjectivity,

• readability metrics,

• a rough credibility signal.

Stack and models

• Backend: Python, PostgreSQL.

• Crawling / aggregation: scheduled scrapers + RSS + manual curated source lists.

• Bias / propaganda detection: transformer-based classifiers fine-tuned on public political news datasets, plus some hand-engineered features (e.g., source-level priors, readability, sentiment). In offline tests I get 93% accuracy on bias detection(happy to share more detail if people care).

• Claim extraction: sentence segmentation + a lightweight classifier to label check-worthy clauses (counts, quotes, time-bound events, entity claims).

• Fact-checking: MNLI model (currently DeBERTa-based) over (claim, evidence-passage) pairs with some heuristics to merge multiple snippets.

• Frontend: Angular + server-rendered news pages for speed and SEO.

The methodology is documented here with more detail:

https://neutralnewsai.com/methodology

What I’m unsure about

• How far I can push MNLI-style models before needing a more explicit retrieval-augmented system or custom architectures.

• Whether my current claim extraction approach is good enough for high-stakes use, or if I should move to a more formal information extraction pipeline.

• How to expose uncertainty and failure modes in a way that’s actually useful for non-technical readers.

Why I’m posting

I’d like feedback from this community on:

• ML / NLP choices you strongly disagree with.

• Evaluation: what would be a more convincing test suite or benchmark?

• UI/UX for showing “supported/refuted/inconclusive” without overselling model confidence.

I’m very open to critique. If you think this is conceptually wrong or socially dangerous, I’d also like to hear that argument.

Thanks for reading, Marcell

Show HN: Inspector Claude – explore your Claude Code sessions

For rural Californians, unreliable power has become the norm

Opinion: The era of 'free' excess renewable energy is over

Nicholas Carlini – Are LLMs worth it? [video]

SQLite Cache Schema

Palantir tops estimates, boosts fourth-quarter guidance on AI adoption

Looking for Input

Trump readies US troops for ground invasion in Mexico to go after drug cartels

We Used to Read Things in This Country

Cost-neutral food tax reforms for healthier and more sustainable diets

Visualizee.ai

Fusion Energy in 2025: Six Global Trends to Watch

Claude Code refused to add rainbows and unicorns to my app

A Friendly Tour of Process Memory on Linux

Bay Area man creates prehistoric Halloween by bringing "Doloresaurus" to life [video]

A confidential manifesto lays out a billionaire's new vision for NASA

Linkers: A 20 Part Series (2007)

2025 United States federal government shutdown

KitteHub: Python projects in the cloud in a few clicks

OneBusAway: Open-source transit app for real-time information

Guideline has been acquired by Gusto

Physical activity as a modifiable risk factor in preclinical Alzheimer's disease

No space, no time, no particles: a vision of quantum reality

DJI's Drones, Both Branded and Disguised, Are Even Closer to a US Ban

What's Next in Customer Identity and Access Management

Norway's Public Buses Can Be Shut Down Remotely from China

Ask HN: What Is the State of Mobile Development in 2025?

PocketBook – DIY pocket-sized Project Gutenberg books

LLM Security Guide – 100 tools and real-world attacks from 370 experts

Why Does the Universe Exist? (1991) [pdf]