Show HN: Tako, a Knowledge Search API

22•ttobbaybbob•1mo ago

I'm Bobby, CTO of Tako. We just launched our Knowledge Search API.

Our API takes natural-language prompts like "Nvidia M&A history" and returns visual answers and grounding text sourced from real-time, structured data (example: https://trytako.com/card/YHloo1Ea7GRnBr_s5r6s/).

Most AI systems can’t effectively reason about real-time, structured data. One reason is access: a lot of the most valuable info is trapped in databases web crawlers can't index. Google solves this with a team of 2k+ engineers that ingest data (stocks, sports, etc) into a proprietary Knowledge Graph. Our goal is to offer developers the same knowledge search + visualization primitives Google has built, tailored for AI use cases, and delivered via API.

We seek to augment LLM’s capabilities, and this means that most of our biggest technical challenges stem from not getting a lot for “free” from LLMs. For example, RAG architectures that generate final outputs with LLMs introduce accuracy issues we can’t tolerate, and are too slow. We’ve built a Generative Augmented Search (GAS) architecture that uses LLMs (currently Llama 3.3-70B on Cerebras) to analyze input queries (~200 ms) but use deterministic retrieval for most output generation. The data in our knowledge graph generally isn’t available in LLMs or the web, so we have to acquire it directly from sources (including licensing it from authoritative providers like S&P Global). A limitation of this approach is that some developers want us to offer the flexibility of LLM analysis across web sources, even if it means tolerating non-authoritative sourcing and some hallucination. We’re working on some solutions to that now.

Curious to hear how other people are fighting hallucination.

I'd love your feedback on the product (and happy to discuss/answer questions about it/the tech stack)

Comments

jake-jung•1mo ago

How is GAS different from RAG?

dragonwriter•1mo ago

The description seems to invert RAG: RAG (simplified a bit) conducts a search, presents the results to an LLM, and then the LLM produces output based on the search. In GAS as described the LLM analyzes the input query to generate the search query, then a search is run, and the results of the search are presented to the user. With GAS, you always get specific authoritative (to the extent that describes the universe searched) documents, any hallucination will impact document selection not correctness of info. Wtih RAG you get an LLM analysis of potentially many authoritative (with the same caveats as before) documents, but hallucination can affect the accuracy of the presented response as well as its relevance.

jake-jung•1mo ago

that's pretty interesting. What would be the benefit of using an inverted RAG instead of RAG then?

IntToDouble•1mo ago

The first rule of Show HN is, "don't give me a playground where it looks like I can push the button and then force me to auth when I actually do click the button, expecting magic on the other side."

The second rule of Show HN is, "YOU DO NOT FORCE USERS TO AUTHENTICATE AFTER THEY'VE CLICKED THE BUTTON."

ttobbaybbob•1mo ago

:saluting_face we debated whether to do or not to do it like this. We were scared of people scraping us (among other things) and decided to be bad

IntToDouble•1mo ago

Rate-limited Playground Only key? Lock to requests for said key to be only permitted from the exact referrer? Give me one free and then auth?

¯\_(ツ)_/¯

I just want to try before I buy, sir.

huckfinn•1mo ago

Does this work well with MCP?

ttobbaybbob•1mo ago

It does (and we're working on making it better)! Check out our docs here -> https://docs.trytako.com/documentation/integrations-and-exam...

nicholashandel•1mo ago

What kinds of things are you seeing people build? Curious on the use cases in prod!

ttobbaybbob•1mo ago

Overall, we improve any AI answer - we've been integrated into AI search experiences (the most common/obvious use case), content generation use cases (eg https://capitol.ai/), but we're excited to see what else people come up with!

seongboii•1mo ago

Super interesting. Love the flexibility on this

ttobbaybbob•1mo ago

Thanks for trying it out!

Convert a GitHub Markdown file to a pretty HTML CV

Awesome Fresh Developer Tools

Hertz and Other Rental Car Agencies Turn to AI for Damage Detection

Woman takes 10x dose of turmeric, gets hospitalized for liver damage

One Company Poisoned the Planet

Croissant: Building a No-Framework Web App

A Stroke of Genius: Striving for Greatness in All You Do by R.W. Hamming

Slack's 57MB 404 page

A Mind Is Born: 256 byte Commodore 64 demo

Navigating AI in translation: Why human expertise still matters

Show HN: CD Calculator – A tool to calculate bank CD interest returns

Exploiting Public App_key Leaks to Achieve RCE in Laravel Applications

Kimi-Dev-72B

A Practical Guide to Evaluating Large Language Models (LLM)

SEO, Logorrhoea and the Rise of Sick AI

Google strikes deal to buy fusion power from MIT spinoff Commonwealth Fusion Sys

Claude Code/Cursor is using grep? Are we devolving

Guess a random number between 1 and 50

Defold editor scripting adds scene editing in 1.10.4

Show HN: I made a simple iOS app to track and count my habits

7GUIs in Mint

Self-imposed ban – a lightweight bash script to block commands

What happened to XProtect this week?

Engineers develop fire extinguisher that puts out fire with sound (2023)

Screen recording of working with Cursor [video]

Show HN: AI Movie Finder – I created a way to find movies by describing

Wrapping Go errors with caller info

Crates.io: Development Update

The Bitter Lesson (2025)

Why Is Fertility So Low in High Income Countries? (NBER)

Show HN: Tako, a Knowledge Search API

Comments

Convert a GitHub Markdown file to a pretty HTML CV

Awesome Fresh Developer Tools

Hertz and Other Rental Car Agencies Turn to AI for Damage Detection

Woman takes 10x dose of turmeric, gets hospitalized for liver damage

One Company Poisoned the Planet

Croissant: Building a No-Framework Web App

A Stroke of Genius: Striving for Greatness in All You Do by R.W. Hamming

Slack's 57MB 404 page

A Mind Is Born: 256 byte Commodore 64 demo

Navigating AI in translation: Why human expertise still matters

Show HN: CD Calculator – A tool to calculate bank CD interest returns

Exploiting Public App_key Leaks to Achieve RCE in Laravel Applications

Kimi-Dev-72B

A Practical Guide to Evaluating Large Language Models (LLM)

SEO, Logorrhoea and the Rise of Sick AI

Google strikes deal to buy fusion power from MIT spinoff Commonwealth Fusion Sys

Claude Code/Cursor is using grep? Are we devolving

Guess a random number between 1 and 50

Defold editor scripting adds scene editing in 1.10.4

Show HN: I made a simple iOS app to track and count my habits

7GUIs in Mint

Self-imposed ban – a lightweight bash script to block commands

What happened to XProtect this week?

Engineers develop fire extinguisher that puts out fire with sound (2023)

Screen recording of working with Cursor [video]

Show HN: AI Movie Finder – I created a way to find movies by describing

Wrapping Go errors with caller info

Crates.io: Development Update

The Bitter Lesson (2025)

Why Is Fertility So Low in High Income Countries? (NBER)