frontpage.
newsnewestaskshowjobs

Open Source @Github

fp.

Open in hackernews

Show HN: AptSelect – A local LLM client for parallel testing and evaluation

https://aptselect.com
2•dhavalt•1h ago
I built AptSelect to stop writing throwaway scripts every time I needed to test how different LLMs handle specific instructions and prompt edge cases.

What it does:

Parallel Execution: Send a single prompt to OpenAI, Anthropic, Mistral, and Gemini simultaneously. Compare the outputs, latency, and exact token usage side-by-side.

Batch Evaluations: Upload a CSV dataset to run bulk tests across multiple models at once.

Manual Diagnostics: Grade outputs manually (Pass/Fail) and assign diagnostic tags (e.g., Hallucination, Format Error) to build a human-verified performance leaderboard.

Local-first: API keys encrypted with your OS keyring; history stored in a local SQLite DB; no telemetry.

I’m looking for technical feedback. What do you think current LLM testing/evaluation tools get most wrong?

French physicist and media star loses doctorate after plagiarism investigation

https://www.science.org/content/article/french-physicist-and-media-star-loses-doctorate-after-pla...
1•bookofjoe•23s ago•0 comments

Paul Krugman has the perfect metaphor for the career of Elon Musk

http://observationalepidemiology.blogspot.com/2026/06/paul-krugman-has-perfect-metaphor-for.html
1•speckx•36s ago•0 comments

Estonia to Grant AI Bots Digital IDs to Control Access

https://www.bloomberg.com/news/articles/2026-06-17/estonia-to-grant-ai-bots-legal-rights-with-per...
1•Tomte•55s ago•0 comments

Show HN: Stop your AI agents from approving their own work

https://github.com/sammysltd/makerchecker
1•smashini•1m ago•1 comments

Why There Won't Be a Singleton AI God (Physics and Evolution)

https://github.com/jacob-sha/Information-Existence-Hypothesis/blob/main/README_EN.md
1•jacob-sha•2m ago•0 comments

Kaspersky discovered malware targeting Steam users through Wallpaper Engine

https://www.kaspersky.co.uk/about/press-releases/kaspersky-discovered-a-malware-campaign-targetin...
1•lrae•3m ago•0 comments

Local Qwen isn't a worse Opus, it's a different tool

https://blog.alexellis.io/local-ai-is-not-opus/
1•alexellisuk•5m ago•1 comments

Real Artists Still Ship

https://jerodsanto.net/2026/06/real-artists-still-ship/
1•speckx•7m ago•0 comments

Anthropic Employees Accuse Trump Administration of Targeting Them

https://www.nytimes.com/2026/06/17/technology/anthropic-trump-administration-fable.html
2•thm•8m ago•0 comments

Send Bulk and Transactional Emails for Free

https://mailbro.tech/
1•Sechele•9m ago•0 comments

Orbital Data Centers Have a Silicon Problem Nobody Is Pricing

https://vincentpribble.substack.com/p/orbital-data-centers-have-a-silicon
3•vpribble•11m ago•0 comments

Climbing the Generative AI Mountain: A "hitchhiker's guide" for product managers

https://queue.acm.org/detail.cfm?id=3807965
1•yarapavan•11m ago•0 comments

SHA-1 Was Shattered

https://www.boot.dev/blog/news/sha-1-was-shattered
2•speckx•12m ago•0 comments

Cosmicgpt – A GPT-in-space simulator to research SpaceX AI satellite viability

https://github.com/davedx/cosmicgpt
1•davedx•14m ago•1 comments

Show HN: StumbleUpon Is Back (Kinda)

https://www.stumbleagain.com/
3•nocodeg•14m ago•0 comments

Towards Conversational AI for Disease Management

https://www.nature.com/articles/s41586-026-10764-5
1•ilreb•15m ago•0 comments

Governance Is the Missing Half of AI Efficiency

https://blog.r-lopes.com/posts/governance-missing-half-of-ai-efficiency
1•dovelome•15m ago•0 comments

Claude Code sessions erase after 30 days by default

https://code.claude.com/docs/en/settings
1•markrogersjr•18m ago•1 comments

Volkswagen started blocking GrapheneOS users

https://discuss.grapheneos.org/d/35949-volkswagen-app?page=3
2•microtonal•19m ago•0 comments

The Demise of Real Neighborhoods Is a Story of Finance

https://www.thenewatlantis.com/publications/the-demise-of-real-neighborhoods-is-a-story-of-finance
1•zeveb•20m ago•1 comments

The Evolution of Unix

https://www.nokia.com/bell-labs/about/dennis-m-ritchie/hist.html
2•highfrequency•21m ago•0 comments

The Mind of Anthropic CEO Dario Amodei [Extended Interview] [video]

https://www.youtube.com/watch?v=x2VHFgyawPE
1•gastonmorixe•22m ago•1 comments

Too many newsletters, not enough time? Listen

https://www.theclawcast.com/
1•theantelope•22m ago•0 comments

Language Courses in the Public Domain

https://fsi-languages.yojik.eu/
1•hggh•22m ago•0 comments

Call for proposals, designing new kinds of research organisations

https://science.works/reorganising-research/
1•rorytbyrne•23m ago•1 comments

Show HN: Tyto – find where audio breaks your voice-agent calls

https://call-analysis.ai-coustics.com/
4•corvj•24m ago•1 comments

Built Uber aggregator that tracks top AI researchers and leaders

https://brightray.ai
1•lundbe•25m ago•1 comments

Show HN: FusionHarness – An Open-source Mixture-of-Agents compound-model server

https://github.com/jackulau/fusionHarness
1•jackxlau•25m ago•0 comments

Who Is America's Homer?

https://www.plough.com/articles/who-is-americas-homer
2•Aqua1920•26m ago•1 comments

Cursor built a fleet of security agents to solve a familiar frustration

https://thenewstack.io/cursor-open-sources-security-agents/
2•atkrad•26m ago•0 comments