frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: API Ingest – Agentic Search (Inter) API Docs

https://github.com/mohidbt/api-ingest
2•mohidbutt•1h ago
1. CC / Codex dont handle API Docs well enough

No matter what I do, I run into bad requests with claude, day in, day out.

Its making up arguments, misunderstands required types, and misses fields in the requests. And when it catches its issues, the then inititated web search usually ends fuzzy scraped information, that yields even more issues.

Context7 helps. Its better than starting only with the LLM's vague (mis)understanding from pretraining. But it only does semantic search. And often times, semantic search is not precise enough for hyper-precision needed for API requests: CC runs into the same misunderstanding issues as above. And burns tons of tokens in the process.

2. What about Determistic Search in OpenAPI Specs?

In my opinion agents need 1) understanding the damn thing holistically, and 2) ability to do some type of agentic search within the docs.

Thankfully, we do have magnificiently standardized formats for API schemas, most notably OpenAPI/Swagger. Why is no one (to my best knowledge) making use of it?

As I need to work a lot with APIs, I started to build something myself few months ago. In the end its a simple python script that splits the JSON/YAML/RAML/etc files into a) a holistic overview ("manifest"), and b) indexed chunks (by endpoints, tags, and schemas) md files. Agents can access via MCP. It takes a) convert local files, or b) community-converted files, and give the agent the capability to do agentic search on the specs. You can check it out out here, and hook up the MCP server: https://github.com/mohidbt/api-ingest

3. Should we benchmark this? // Feel free to contribute!

WDYT? I am thinking about quantifively corroborating my assumptions, by doing some type of evals. And yes, this by endpoint indexing approach also has many limitations. I.e. when the individual chunks are themselves way too big to load fully into context.

Geniunely curious about all your thoughts

PS: Yes, for many - especially AI-tech - companies, we already have agent optimized API doc formats, like llms.txt in the docs, or skills built for using the APIs; and thats wonderful! But whats with, i.e. Semantic Scholar Graph APIs? What do you do if core CC & Context7 fail? Check out this example: https://github.com/mohidbt/api-ingest/tree/main#opus-47-exam...

Comments

brianwmunz•21m ago
OpenAPI spec indexing is a good idea...semantic search is good for general API questions, but often sucks at specific questions about exact requirements for fields, etc. We've built a lot of connectors at my company and have had this problem.. the agent makes up arguments or misses required types because it's doing too much inference instead of running against an actual schema. I think benchmarking correctness for each endpoint (did the agent construct a valid request on the first try) would be the most useful thing to eval.

Exit Payout Scenarios

https://www.thesaasceo.com/p/your-exit-payout-scenarios
1•sanketbhasin•48s ago•0 comments

US turns to Ukrainian counter-drone tech after Iran attacks, sources say

https://www.reuters.com/business/aerospace-defense/us-turns-ukrainian-counter-drone-tech-after-ir...
1•mikhael•1m ago•0 comments

Show HN: AthleteData – AI coach for endurance athletes that messages you first

https://www.athletedata.health
1•fliellerjulian•1m ago•0 comments

USVC: A new fund by AngelList that broadens access to venture capital

https://usvc.com/
1•bpierre•2m ago•0 comments

RoboLab: Robot- and policy-agnostic simulation benchmarking

https://research.nvidia.com/labs/srl/projects/robolab/
1•dagli•3m ago•0 comments

Show HN: Google Docs MCP that works

https://github.com/dbuxton/google-docs-mcp
1•dbuxton•3m ago•0 comments

Show HN: Free Live Speech Translator

https://timleland.com/live-speech-translator/
2•TimLeland•3m ago•0 comments

SpaceX is working with Cursor and has an option to buy the startup for $60B

https://techcrunch.com/2026/04/21/spacex-is-working-with-cursor-and-has-an-option-to-buy-the-star...
1•hislaziness•3m ago•1 comments

Hi

2•Samsung-A•6m ago•0 comments

How Health Workers Can Love Their Devices

https://za.virtualhospitalsafrica.org/blog/how-health-workers-can-love-their-devices
1•wweiss1230•7m ago•0 comments

Features everyone should steal from npmx

https://nesbitt.io/2026/04/16/features-everyone-should-steal-from-npmx.html
1•speckx•8m ago•0 comments

Building Ridgeline, part 1: I have too many dashboards

https://www.xydac.com/blog/building-ridgeline-part-1/
1•xydac•9m ago•0 comments

World Models will push the frontier for LLMs

https://lucrbvi.bearblog.dev/world-models-will-push-the-frontier/
1•lucrbvi•9m ago•0 comments

AI wants composition, not chat

https://linuxtoaster.com/blog/against-the-chat-box.html
1•dirk94018•10m ago•0 comments

Tolaria

https://tolaria.md/
1•handfuloflight•11m ago•0 comments

Luddites and AI Datacenters

https://www.seangoedecke.com/luddites-and-ai-datacenters/
1•Brajeshwar•11m ago•0 comments

Show HN: Map – Receipts and rollback for AI agents

https://github.com/DeadpxlStudio/ModelActionProtocol
1•Dahvay•11m ago•0 comments

White paper: Enphase universal bidirectional EV charger

https://enphase.com/download/iq-bidirectional-ev-charger-whitepaper
1•malchow•13m ago•0 comments

DCP-AI – Portable accountability layer for AI agents (post-quantum)

https://github.com/dcp-ai-protocol/dcp-ai
1•dnaranjo•13m ago•0 comments

'Finding Satoshi' Makes the Case for Hal Finney, Len Sassaman as BTC Co-Creators

https://decrypt.co/365075/finding-satoshi-makes-the-case-for-hal-finney-len-sassaman-as-bitcoin-c...
1•tromp•14m ago•1 comments

Self-hosted agent pipeline that turns Google Trends into blog posts

https://hyperscale.top/
1•abdu_g•15m ago•0 comments

The world changes its rules. And rarely gives you a warning

https://comuniq.xyz/post?t=978
1•01-_-•16m ago•0 comments

Teen Cannabis Use Linked to Slower Cognitive Development

https://today.ucsd.edu/story/largest-us-study-finds-teen-cannabis-use-linked-to-slower-cognitive-...
1•gmays•20m ago•1 comments

Startup Photreon aims to produce hydrogen with sunlight

https://www.heise.de/en/news/Startup-Photreon-aims-to-produce-hydrogen-with-sunlight-11257703.html
1•car•21m ago•0 comments

Outplaying elite table tennis players with an autonomous robot

https://www.nature.com/articles/s41586-026-10338-5
2•wslh•22m ago•0 comments

Ears and Eyes – Searchable database of cases of physical surveillance devices

https://www.notrace.how/earsandeyes/
1•libroot•22m ago•0 comments

I'm Using AI to Navigate AI Code Review Author Steffen Froehlich

https://www.docusign.com/blog/the-duck-talks-back-how-im-using-ai-to-navigate-ai-code-review
1•dweiss85•23m ago•0 comments

Fighting Internet Contracts One Library at a Time

https://www.techdirt.com/2026/04/21/you-cant-vote-out-amazon-web-services-fighting-adhesion-contr...
1•HotGarbage•23m ago•0 comments

A Field Guide to Bugs

https://www.stephendiehl.com/posts/field_guide_to_bugs/
1•ibobev•24m ago•0 comments

Meta workers outraged over internal software tracking keystrokes, mouse movement

https://nypost.com/2026/04/22/business/meta-workers-outraged-over-software-tracking-keystrokes-mo...
1•1vuio0pswjnm7•24m ago•1 comments