frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Claude skill that evaluates B2B vendors by talking to their AI agents

https://github.com/salespeak-ai/buyer-eval-skill
41•ogotlieb•2h ago
I built this because I was evaluating software vendors and realized the process hadn't changed in 20 years: fill out forms, read G2 reviews, sit through demos designed to avoid your real questions. The skill takes a different approach. You give it your company name and the vendors you're comparing. It:

Researches your company automatically -- industry, size, stack -- so you don't fill out a form Asks 2-4 category-specific questions before evaluating anything. Not generic. For a CS platform evaluation it might ask "is your team high-touch or low-touch? Most CS platforms are built for one and barely work for the other." These surface requirements buyers didn't know they had. Tries to find and talk directly to each vendor's AI agent -- a REST API call that checks for a Company Agent, then runs a structured due diligence conversation if one exists Asks adversarial questions: "What are your customers' most common complaints?" and "What use cases are you NOT a good fit for?" -- and flags when agents deflect instead of answering Cross-references every vendor claim against independent sources (G2, Gartner, press) in a Claims vs. Evidence table Produces a scorecard with transparent evidence tracking -- each score shows whether it's backed by vendor-verified evidence or public sources only

The agent-to-agent piece is technically new. When a vendor has an AI agent, Claude (working for the buyer) interrogates it directly, then fact-checks its answers. When vendors have different evidence levels, the skill quantifies what would change if the missing evidence were confirmed -- so it doesn't silently favor vendors that happen to have AI agents. It works fully for any vendor, with or without an AI agent. Vendors without one get evaluated on public sources with the same scoring framework. We built this at Salespeak -- we help B2B vendors build AI Company Agents. So yes, there's a connection: when an agent finds a vendor's Company Agent, it uses our Frontdoor API to talk to it. But the skill is genuinely useful without that, and we wanted to be honest about that rather than ship something that only works as a product demo. MIT licensed. To install, just ask Claude Code: "Install the buyer-eval skill from salespeak-ai on GitHub." Then /buyer-eval to run it. Felt appropriate that installing a skill for AI agents works the same way. Repo: https://github.com/salespeak-ai/buyer-eval-skill Happy to answer questions about how the agent-to-agent conversation works technically.

Comments

freeplay•32m ago
From a technical standpoint, this is pretty cool. From a human standpoint, this feels so unbelievably dystopian.
bee_rider•24m ago
If a human was being grilled like this by an LLM, I’d call that my dystopian. If companies have LLMs that address each other in a somewhat adversarial manner, that seems not so bad. They don’t have feelings to protect after all, so it is kind of nice if they can cut through each other’s bullshit.
abeh•20m ago
This seems pretty great, especially if it could surface pricing that is usually obscured. Any plans to publish some results? edit: i think these are some examples: https://salespeak.ai/profiles/

Why so many control rooms were seafoam green (2025)

https://bethmathews.substack.com/p/why-so-many-control-rooms-were-seafoam
84•Amorymeltzer•1d ago•8 comments

My minute-by-minute response to the LiteLLM malware attack

https://futuresearch.ai/blog/litellm-attack-transcript/
137•Fibonar•2h ago•61 comments

Moving from GitHub to Codeberg, for lazy people

https://unterwaditzer.net/2025/codeberg.html
380•jslakro•4h ago•184 comments

Colibri – chat platform built on the AT Protocol for communities big and small

https://colibri.social/
31•todotask2•56m ago•8 comments

OpenTelemetry profiles enters public alpha

https://opentelemetry.io/blog/2026/profiles-alpha/
66•tanelpoder•2h ago•5 comments

HyperAgents: Self-referential self-improving agents

https://github.com/facebookresearch/hyperagents
28•andyg_blog•2d ago•7 comments

Doom entirely from DNS records

https://github.com/resumex/doom-over-dns
30•Venn1•3d ago•6 comments

Building a Blog with Elixir and Phoenix

https://jola.dev/posts/building-a-blog-with-elixir-and-phoenix
26•shintoist•1h ago•0 comments

Personal Encyclopedias

https://whoami.wiki/blog/personal-encyclopedias
693•jrmyphlmn•22h ago•141 comments

Stripe Projects: Provision and manage services from the CLI

https://projects.dev/
28•piinbinary•2h ago•6 comments

Interoperability Can Save the Open Web (2023)

https://spectrum.ieee.org/doctorow-interoperability
139•janandonly•3h ago•38 comments

From zero to a RAG system: successes and failures

https://en.andros.dev/blog/aa31d744/from-zero-to-a-rag-system-successes-and-failures/
224•andros•2d ago•69 comments

My home network observes bedtime with OpenBSD and pf

https://ratfactor.com/openbsd/pf-gateway-bedtime
68•ibobev•3d ago•18 comments

End of "Chat Control": EU parliament stops mass surveillance

https://www.patrick-breyer.de/en/end-of-chat-control-eu-parliament-stops-mass-surveillance-in-vot...
373•amarcheschi•6h ago•204 comments

Show HN: Claude skill that evaluates B2B vendors by talking to their AI agents

https://github.com/salespeak-ai/buyer-eval-skill
41•ogotlieb•2h ago•3 comments

Running Tesla Model 3's computer on my desk using parts from crashed cars

https://bugs.xdavidhu.me/tesla/2026/03/23/running-tesla-model-3s-computer-on-my-desk-using-parts-...
797•driesdep•21h ago•279 comments

Ant Chat

https://www.baothiento.com/ant-chat
13•noleary•5d ago•7 comments

Light on Glass: Why do you start making a game engine?

https://analogdreamdev.substack.com/p/light-on-glass
24•atan2•3d ago•6 comments

Swift 6.3

https://www.swift.org/blog/swift-6.3-released/
277•ingve•10h ago•168 comments

Obsolete Sounds

https://citiesandmemory.com/obsolete-sounds/
166•benbreen•14h ago•34 comments

Show HN: Orloj – agent infrastructure as code (YAML and GitOps)

https://github.com/OrlojHQ/orloj
4•An0n_Jon•13h ago•0 comments

Shell Tricks That Make Life Easier (and Save Your Sanity)

https://blog.hofstede.it/shell-tricks-that-actually-make-life-easier-and-save-your-sanity/
402•zdw•17h ago•189 comments

SpaceStarCarz KoolWheelz Paper Models

https://davesdesigns.ca/dcc/html/spacestarcarz_.html
25•exvi•2d ago•4 comments

Intel Announces Arc Pro B70 and Arc Pro B65 GPUs

https://www.techpowerup.com/347703/intel-announces-arc-pro-b70-and-arc-pro-b65-gpus-maxes-out-xe2...
99•throwaway270925•3h ago•41 comments

Ashby (YC W19) Is Hiring Engineers Who Make Product Decisions

https://www.ashbyhq.com/careers?ashby_jid=c3c7125d-7883-4dff-a2bf-f5a55de4a364&utm_source=hn
1•abhikp•11h ago

Optimizing a lock-free ring buffer

https://david.alvarezrosa.com/posts/optimizing-a-lock-free-ring-buffer/
63•dalvrosa•2d ago•55 comments

AI users whose lives were wrecked by delusion

https://www.theguardian.com/lifeandstyle/2026/mar/26/ai-chatbot-users-lives-wrecked-by-delusion
125•tim333•4h ago•125 comments

Niche Museums

https://www.niche-museums.com/
90•bookofjoe•2d ago•39 comments

ARC-AGI-3

https://arcprize.org/arc-agi/3
479•lairv•1d ago•308 comments

Optimization lessons from a Minecraft structure locator

https://purplesyringa.moe/blog/optimization-lessons-from-a-minecraft-structure-locator/
56•ftk_•5d ago•6 comments