frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Perspectives – I wanted AI to challenge my thinking, not validate it

https://getperspectives.app
1•Jamium•1h ago
I built Perspectives because I got tired of ChatGPT agreeing with everything I said.

Ask any LLM to "consider multiple perspectives" and you get hedged consensus. The model acknowledges trade-offs exist, then settles on a moderate position that offends nobody. Useful for summaries. Useless for decision making.

Perspectives forces disagreement. 8 personas with fundamentally incompatible frameworks debate your question through a structured protocol, then vote using Single Transferable Vote to surface where they actually land. The output is a PDF report synthesising all of it.

How it works

Blind Proposals: Each persona generates a position without seeing the others. This prevents the "anchoring problem" where early responses shape later ones, bypassing the default sycophancy of LLMs.

Interrogation of Blind Proposals: Proposals face structured challenges from 3 opposing personas. A "high-empathy" persona (e.g., The Idealist) will be challenged by a "low-empathy" cluster (e.g., The Pragmatist). This reveals exactly where arguments buckle under pressure.

Discussion & Voting: Personas can debate (optional) before ranking preferences via STV. This highlights first-choice winners and preference flows rather than simple majority rule.

Analysis/Prediction Report: The final PDF structures recommendations first, followed by supporting analysis (factual background, risk assessment, evidence quality).

Two Operational Modes

Analysis Mode ("What should we do?"): Evaluates options and surfaces trade-offs. Output is qualitative judgment.

Prediction Mode ("What will happen?"): Generates probability estimates with resolution criteria.

Feedback Loops

Most AI agent projects have no way to measure whether their outputs are actually good. Users provide subjective feedback, which is noisy and unreliable. The system optimises for seeming useful rather than being useful.

Prediction Mode creates an objective feedback loop. When a prediction resolves, I can measure accuracy.

I'm integrating Polymarket as the verification source. Run a question through Perspectives, record the predictions, compare against actual outcomes when they resolve. Over time, this builds calibration data showing which methodologies perform best for different question types.

Persona Sets

Different decisions need different analytical lenses. Four built-in sets:

Philosophical (Default): Best for ethical dilemmas and strategic decisions.

Business-Focused: Best for commercial decisions.

Product-Focused: Best for product development.

Forecaster: Optimised for Prediction Mode.

Technical Details

LLM Support: Supports any OpenAI/Anthropic compatible API (Claude, OpenRouter, Ollama, Grok, etc.).

Web Search: Optional integration for grounding debates in recent events.

Output: Single PDF report per query.

What I'm Looking For

I've been building this solo and could use external feedback on a few things:

1. Does the blind proposal mechanism actually produce better disagreement?

2. Is the interrogation protocol overkill or useful? The structured challenge/response/verdict cycle generates rich data, but adds latency (dependant on concurrency settings).

3. What decisions would you run through this?

4. Do you use ChatGPT or similar systems to make decisions?

5. Do you find "chain of thought" output useful for tracking reasoning?

Links

Perspectives: https://getperspectives.app

Dev blog: https://blog.jmatthews.uk

Example Analysis Report (Is it viable to run a nation where all laws expire after 10 years and must be re-passed?): https://drive.google.com/file/d/1hsJOWsQDAtVOqOKF6_a_Q1jYOlB...

Example Prediction Report (Will Kraken IPO by 31st March 2026?): https://drive.google.com/file/d/1m3RedFtv8lKgFqf1_rvzl8W6cTs...

Happy to answer any questions in this thread.

Show HN: Local-First AI Video Upscaler with CPU Fallback

https://github.com/pratik227/upscale_video_4k
1•pratik227•1m ago•0 comments

'Askers' vs. 'Guessers' (2010)

https://www.theatlantic.com/national/2010/05/askers-vs-guessers/340891/
1•BoorishBears•3m ago•0 comments

Elastic grid by Juan Pinkus, 2021 (2023)

https://elasticgrid.pinkus.link/
1•surprisetalk•3m ago•0 comments

Let's Build an Atmospheric Web

https://jimray-bsky.leaflet.pub/3mcxq7tyx522r
1•danabramov•3m ago•0 comments

Burnout is breaking a sacred pact

https://usefulfictions.substack.com/p/burnout-is-breaking-a-sacred-pact
1•surprisetalk•3m ago•0 comments

Betting on the Millennium Problems

https://news.manifold.markets/p/betting-on-the-millennium-problems
1•surprisetalk•3m ago•0 comments

Caves of Qud: A Review

https://www.mattkeeter.com/blog/2025-12-29-qud/
1•surprisetalk•3m ago•0 comments

Show HN: CyberCage – On-device PII detection for AI tools (text and images)

https://cybercage.io/
1•cybercageio-dev•3m ago•0 comments

Show HN: Erlauncher - A lightweight app launcher written in Zig for X11

https://github.com/ErlantzCalvo/Erlauncher
1•erlfire•4m ago•0 comments

'Test-Time Matching' method lets AI models improve with use

https://news.ucr.edu/articles/2026/01/21/making-ai-smarter-without-more-training-data
1•giuliomagnifico•6m ago•0 comments

Alibaba to IPO AI Chipmaking Unit T-Head

https://www.bloomberg.com/news/articles/2026-01-22/alibaba-is-said-to-plan-ipo-for-ai-chipmaking-...
1•dworks•13m ago•0 comments

Two Twisty Shapes Resolve a Centuries-Old Topology Puzzle

https://www.quantamagazine.org/two-twisty-shapes-resolve-a-centuries-old-topology-puzzle-20260120/
2•ibobev•13m ago•0 comments

My Claude.md for enterprise grade software

https://github.com/heyojules/agent-context
3•gintokinx•21m ago•1 comments

A 'time capsule' for cells stores the secret experiences of their past

https://www.nature.com/articles/d41586-026-00116-8
1•XzetaU8•22m ago•0 comments

Claude finds 353 zero-days on Packagist

https://sansec.io/research/claude-finds-353-zero-days-packagist
4•gwillem•24m ago•1 comments

Turning blockchain wallets into readable stories

https://kettaro.com/
1•chainbuilder•25m ago•1 comments

Are your memories illusions? New study disentangles the Boltzmann brain paradox

https://phys.org/news/2026-01-memories-illusions-disentangles-boltzmann-brain.html
1•pseudolus•26m ago•0 comments

Why should GNOME support server side decorations?

https://blister.zip/posts/gnome-ssd/
1•Jyaif•26m ago•0 comments

Skill Gateway: Intelligent skill selection system that reduces token consumption

https://github.com/onurkanbakirci/skills-gateway
1•onurkanbkrc•26m ago•0 comments

Witcher 3 Quests style TODO App

https://karthikeshwar1.github.io/ToDo-Witcher3Style/
1•rcrKnight•27m ago•1 comments

Ancient handprints may represent some of the first rock art

https://www.science.org/content/article/these-ancient-handprints-may-represent-some-world-s-first...
1•pseudolus•27m ago•0 comments

Why Extreme Posts Go Viral

https://www.tekushi.com/research/why-extreme-post-go-viral
1•laurent_molter•33m ago•1 comments

The rise of 'micro' apps: non-developers are writing apps instead of buying them

https://techcrunch.com/2026/01/16/the-rise-of-micro-apps-non-developers-are-writing-apps-instead-...
1•bookofjoe•34m ago•0 comments

Resource Signal: Reactive async data loading for Python

https://reaktiv.bui.app/docs/resource-guide.html
1•buibuibui•40m ago•1 comments

What is a split network for recruiters?

https://talnet.co/space/blog/what-is-a-split-network-for-recruiters/
1•bouia•40m ago•0 comments

Netflix's Masterclass: Engineering Live Streaming at Scale

https://streaminglearningcenter.com/encoding/netflixs-masterclass-engineering-live-streaming-at-s...
2•breve•43m ago•0 comments

Show HN: RepoSquirrel – New release with deep PagerDuty and Git visualizations

https://github.com/reposquirrel/reposquirrel
1•reposquirrel•46m ago•0 comments

Show HN: It's a free vulnerability database that publishes fresh CVE data daily

https://www.threatroad.com/
2•levberg•54m ago•0 comments

We will ban you and ridicule you in public if you waste our time on crap reports

https://curl.se/.well-known/security.txt
138•latexr•54m ago•78 comments

Starting a Startup at 25, 35, or 45 Is Not the Same Decision

https://nauma.ai/blog/p/starting-a-startup-at-25-35-or-45
1•iliyanko•57m ago•0 comments