frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Perspectives – I wanted AI to challenge my thinking, not validate it

https://getperspectives.app
2•Jamium•2w ago
I built Perspectives because I got tired of ChatGPT agreeing with everything I said.

Ask any LLM to "consider multiple perspectives" and you get hedged consensus. The model acknowledges trade-offs exist, then settles on a moderate position that offends nobody. Useful for summaries. Useless for decision making.

Perspectives forces disagreement. 8 personas with fundamentally incompatible frameworks debate your question through a structured protocol, then vote using Single Transferable Vote to surface where they actually land. The output is a PDF report synthesising all of it.

How it works

Blind Proposals: Each persona generates a position without seeing the others. This prevents the "anchoring problem" where early responses shape later ones, bypassing the default sycophancy of LLMs.

Interrogation of Blind Proposals: Proposals face structured challenges from 3 opposing personas. A "high-empathy" persona (e.g., The Idealist) will be challenged by a "low-empathy" cluster (e.g., The Pragmatist). This reveals exactly where arguments buckle under pressure.

Discussion & Voting: Personas can debate (optional) before ranking preferences via STV. This highlights first-choice winners and preference flows rather than simple majority rule.

Analysis/Prediction Report: The final PDF structures recommendations first, followed by supporting analysis (factual background, risk assessment, evidence quality).

Two Operational Modes

Analysis Mode ("What should we do?"): Evaluates options and surfaces trade-offs. Output is qualitative judgment.

Prediction Mode ("What will happen?"): Generates probability estimates with resolution criteria.

Feedback Loops

Most AI agent projects have no way to measure whether their outputs are actually good. Users provide subjective feedback, which is noisy and unreliable. The system optimises for seeming useful rather than being useful.

Prediction Mode creates an objective feedback loop. When a prediction resolves, I can measure accuracy.

I'm integrating Polymarket as the verification source. Run a question through Perspectives, record the predictions, compare against actual outcomes when they resolve. Over time, this builds calibration data showing which methodologies perform best for different question types.

Persona Sets

Different decisions need different analytical lenses. Four built-in sets:

Philosophical (Default): Best for ethical dilemmas and strategic decisions.

Business-Focused: Best for commercial decisions.

Product-Focused: Best for product development.

Forecaster: Optimised for Prediction Mode.

Technical Details

LLM Support: Supports any OpenAI/Anthropic compatible API (Claude, OpenRouter, Ollama, Grok, etc.).

Web Search: Optional integration for grounding debates in recent events.

Output: Single PDF report per query.

What I'm Looking For

I've been building this solo and could use external feedback on a few things:

1. Does the blind proposal mechanism actually produce better disagreement?

2. Is the interrogation protocol overkill or useful? The structured challenge/response/verdict cycle generates rich data, but adds latency (dependant on concurrency settings).

3. What decisions would you run through this?

4. Do you use ChatGPT or similar systems to make decisions?

5. Do you find "chain of thought" output useful for tracking reasoning?

Links

Perspectives: https://getperspectives.app

Dev blog: https://blog.jmatthews.uk

Example Analysis Report (Is it viable to run a nation where all laws expire after 10 years and must be re-passed?): https://drive.google.com/file/d/1hsJOWsQDAtVOqOKF6_a_Q1jYOlB...

Example Prediction Report (Will Kraken IPO by 31st March 2026?): https://drive.google.com/file/d/1m3RedFtv8lKgFqf1_rvzl8W6cTs...

Happy to answer any questions in this thread.

OpenClaw ClawHub Broken Windows Theory – If basic sorting isn't working what is?

https://www.loom.com/embed/e26a750c0c754312b032e2290630853d
1•kaicianflone•51s ago•0 comments

OpenBSD Copyright Policy

https://www.openbsd.org/policy.html
1•Panino•1m ago•0 comments

OpenClaw Creator: Why 80% of Apps Will Disappear

https://www.youtube.com/watch?v=4uzGDAoNOZc
1•schwentkerr•5m ago•0 comments

What Happens When Technical Debt Vanishes?

https://ieeexplore.ieee.org/document/11316905
1•blenderob•6m ago•0 comments

AI Is Finally Eating Software's Total Market: Here's What's Next

https://vinvashishta.substack.com/p/ai-is-finally-eating-softwares-total
1•gmays•7m ago•0 comments

Computer Science from the Bottom Up

https://www.bottomupcs.com/
1•gurjeet•7m ago•0 comments

Show HN: I built a toy compiler as a young dev

https://vire-lang.web.app
1•xeouz•9m ago•0 comments

You don't need Mac mini to run OpenClaw

https://runclaw.sh
1•rutagandasalim•10m ago•0 comments

Learning to Reason in 13 Parameters

https://arxiv.org/abs/2602.04118
1•nicholascarolan•12m ago•0 comments

Convergent Discovery of Critical Phenomena Mathematics Across Disciplines

https://arxiv.org/abs/2601.22389
1•energyscholar•12m ago•1 comments

Ask HN: Will GPU and RAM prices ever go down?

1•alentred•12m ago•0 comments

From hunger to luxury: The story behind the most expensive rice (2025)

https://www.cnn.com/travel/japan-expensive-rice-kinmemai-premium-intl-hnk-dst
2•mooreds•13m ago•0 comments

Substack makes money from hosting Nazi newsletters

https://www.theguardian.com/media/2026/feb/07/revealed-how-substack-makes-money-from-hosting-nazi...
5•mindracer•14m ago•1 comments

A New Crypto Winter Is Here and Even the Biggest Bulls Aren't Certain Why

https://www.wsj.com/finance/currencies/a-new-crypto-winter-is-here-and-even-the-biggest-bulls-are...
1•thm•14m ago•0 comments

Moltbook was peak AI theater

https://www.technologyreview.com/2026/02/06/1132448/moltbook-was-peak-ai-theater/
1•Brajeshwar•15m ago•0 comments

Why Claude Cowork is a math problem Indian IT can't solve

https://restofworld.org/2026/indian-it-ai-stock-crash-claude-cowork/
1•Brajeshwar•15m ago•0 comments

Show HN: Built an space travel calculator with vanilla JavaScript v2

https://www.cosmicodometer.space/
2•captainnemo729•15m ago•0 comments

Why a 175-Year-Old Glassmaker Is Suddenly an AI Superstar

https://www.wsj.com/tech/corning-fiber-optics-ai-e045ba3b
1•Brajeshwar•15m ago•0 comments

Micro-Front Ends in 2026: Architecture Win or Enterprise Tax?

https://iocombats.com/blogs/micro-frontends-in-2026
1•ghazikhan205•18m ago•0 comments

These White-Collar Workers Actually Made the Switch to a Trade

https://www.wsj.com/lifestyle/careers/white-collar-mid-career-trades-caca4b5f
1•impish9208•18m ago•1 comments

The Wonder Drug That's Plaguing Sports

https://www.nytimes.com/2026/02/02/us/ostarine-olympics-doping.html
1•mooreds•18m ago•0 comments

Show HN: Which chef knife steels are good? Data from 540 Reddit tread

https://new.knife.day/blog/reddit-steel-sentiment-analysis
1•p-s-v•18m ago•0 comments

Federated Credential Management (FedCM)

https://ciamweekly.substack.com/p/federated-credential-management-fedcm
1•mooreds•19m ago•0 comments

Token-to-Credit Conversion: Avoiding Floating-Point Errors in AI Billing Systems

https://app.writtte.com/read/kZ8Kj6R
1•lasgawe•19m ago•1 comments

The Story of Heroku (2022)

https://leerob.com/heroku
1•tosh•19m ago•0 comments

Obey the Testing Goat

https://www.obeythetestinggoat.com/
1•mkl95•20m ago•0 comments

Claude Opus 4.6 extends LLM pareto frontier

https://michaelshi.me/pareto/
1•mikeshi42•21m ago•0 comments

Brute Force Colors (2022)

https://arnaud-carre.github.io/2022-12-30-amiga-ham/
1•erickhill•24m ago•0 comments

Google Translate apparently vulnerable to prompt injection

https://www.lesswrong.com/posts/tAh2keDNEEHMXvLvz/prompt-injection-in-google-translate-reveals-ba...
1•julkali•24m ago•0 comments

(Bsky thread) "This turns the maintainer into an unwitting vibe coder"

https://bsky.app/profile/fullmoon.id/post/3meadfaulhk2s
1•todsacerdoti•25m ago•0 comments