frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

Open in hackernews

Show HN: Airweave – Let agents search any app

https://github.com/airweave-ai/airweave
93•lennertjansen•5h ago
Hey HN, we're Lennert and Rauf. We’re building Airweave (https://github.com/airweave-ai/airweave), an open-source tool that lets agents search and retrieve data from any app or database. Here’s a general intro: https://www.youtube.com/watch?v=EFI-7SYGQ48, and here’s a longer one that shows more real-world use cases, examples of how Airweave is used by Cursor (0:33) and Claude desktop (2:04), etc.: https://youtu.be/p2dl-39HwQo

A couple of months ago we were building agents that interacted with different apps and were frustrated when they struggled to handle vague natural language requests like "resolve that one Linear issue about missing auth configs", "if you get an email from an unsatisfied customer, reimburse their payment in Stripe", or "what were the returns for Q1 based on the financials sheet in gdrive?", only to have the agent inefficiently chain together loads of function calls to find the data or not find it at all and hallucinate.

We also noticed that despite the rise of MCP creating more desire for agents to interact with external resources, the majority of agent dev tooling focused on function calling and actions instead of search. We were annoyed by the lack of tooling that enabled agents to semantically search workspace or database contents, so we started building Airweave first as an internal solution. Then we decided to open-source it and pursue it full time after we got positive reactions from coworkers and other agent builders.

Airweave connects to productivity tools, databases, or document stores via their APIs and transforms their contents into searchable knowledge bases, accessible through a standardized interface for the agent. The search interface is exposed via REST or MCP. When using MCP, Airweave essentially builds a semantically searchable MCP server on top of the resource. The platform handles the entire data pipeline from connection and extraction to chunking, embedding, and serving. To ensure knowledge is current, it has automated sync capabilities, with configurable schedules and change detection through content hashing.

We built it with support for white-labeled multi-tenancy to provide OAuth2-based integration across multiple user accounts while maintaining privacy and security boundaries. We're also actively working on permission-awareness (i.e., RBAC on the data) for the platform.

So happy to share learnings and get insights from your experiences. looking forward to comments!

Comments

reneBond•4h ago
Had meetings with a ton of MCP-server providers, no one came close to Airweave’s retrieval accuracy. I even tried Zapier and similar large companies, didn’t come near airweave. Highly highly recommend if you need third party integrations to your AI agents or workflows. Love the team too, cracked, cool, kind, and always there to support their customers (they even took one of their customers dog on a walk when they couldn’t lol)
risyachka•1h ago
Noob here - why would mcp providers have a good accuracy?

Don’t they just adjust existing apis to mcp protocol basically just wrapping them?

om0agarwal•4h ago
Looks like a great product! Do you integrate with datalakes (Snowflake/Fabric?)
raufakdemir•4h ago
Hi, co-founder here. No Snowflake or Fabric yet. We do support some popular regular SQL connectors. We are working towards an async distributed processing architecture that should allow us to process >50M row datasets but we're still looking for strong usecase signal here. What would you like to do with it?
rkhanna23•4h ago
this is so powerful - MCP on steroids :). what are the next use cases you're looking to build?
raufakdemir•3h ago
We're mostly focused on getting this right - better than any other tool atm. We are evaluating ideas like mapped RBAC, self-updating deep research and other tools for agent builders but it should first be very clear to us the devs actually need it :D
ayxliu•4h ago
I was looking everywhere for some solution like this. Finally! Curious, do you guys integrate with internal data sources within a company?
bartjanjorna•4h ago
How is this different from regular MCP servers?
raufakdemir•4h ago
Co-founder here. The platform provides MCP or REST endpoints on top of searchable information. The tool is specifically geared towards agents that want to perform actions on external systems (through an MCP server, for example) but get confused about which objects to interact with. Airweave provides a robust interface for this.

You can compare it to how coding agents like Cursor work. This is the usual pattern you see: - The first step is reading your prompt - Then it goes through all the attached files and searches your codebase - The last step is to make code file edits.

Non-coding agents that use "regular" MCP servers completely miss the second part. It's very hard to go from natural language instruction, to a chain of API calls that actually work and don't end up in hallucination

valianter•4h ago
Is chat always the best interface for all of these apps? I feel like search is the natural first step, but chat-based search has been around for a while. Feel like an MCP-based version of Glean/Onyx/Moveworks/Dashworks is interesting, but unsure how much better it makes the product. Curious to see why your product is better
raufakdemir•4h ago
Co-founder here. The Airweave interface doesn't discriminate which downstream use case it's applied in. Most current developers don't build it for a chat interface at all actually. Instead they fold it into their agents to give them access to user data. At first sight enterprise search looks quite similar, but instead this is a building block for developers to set up integrations for their internal agent / agent product.
brene•4h ago
Pretty cool stuff. How does it deal with self-hosted data sources? can it run inside a VPC and talk to my RDS instances directly?
raufakdemir•3h ago
You can self-host Airweave on Docker or Kubernetes within your VPC. We eventually want to move towards AWS/Azure/GCP marketplace offerings that should make this easier for you. RDS should work - if you get an instance with PSQL/MySQL dialect.
pomarie•3h ago
Pretty cool – when does it make sense to use this vs n8n?
raufakdemir•3h ago
n8n is a good example of a tool that Airweave can enhance. n8n allows (no-code) developers to set up pre-determined automations but as soon as you want to process non-deterministic text into action on an app, you will still need a way to search the app. Example: you have a n8n workflow that gets you on track with Linear tickets. You hook it into a text-based human interface in which the user says: "I just created a task about database migration on Linear, can you start doing the preparations for it?". Airweave can 1. find that damn ticket, 2. give additional context on database migrations based on what else it finds in the integrated systems.
nishanthooda•3h ago
Nice - does it have role based access controls built in?
raufakdemir•3h ago
I assume you're talking about the data layer (not the control plane)? We are currently in PoC phase for mapping the role graphs from source systems (Asana, Google Drive) to our internal role model, but this is still in the works. The way developers work around this atm is by configuring a connection on a subset of the source info. Example: only make Airweave sync info from the `Shared Drive/Marketing/Branding` path
mike_d•31m ago
How do you handle data retention? For example say that you suck in the information of a California resident and the company is obligated by law to delete it on request. How do you ensure no derivative data exists within your model?
flockonus•3h ago
fyi there is a project with almost phonetic writing name to yours - arweave.org
raufakdemir•3h ago
Lol, good to know. Thanks
1317•1h ago
it's also the name of a mattress
modelorona•3h ago
Looks cool! How are you thinking about pricing it?
lennertjansen•2h ago
hi cofounder here. until now it's been custom deployments for customers with additional b2b/enterprise features. we're also releasing a managed service for a flat fee subscription
howmayiannoyyou•1h ago
If we want to integrate our SAAS apps into airweave, is there an appexchange or directory for doing so?

FCC Seeks Comment on EchoStar Licenses of 2 GHz MSS Spectrum

https://docs.fcc.gov/public/attachments/DA-25-405A1.txt
1•impish9208•2m ago•0 comments

Self-hosting HyperDX for fun and profit

https://weberdominik.com/blog/self-host-hyperdx/
1•brendanashworth•3m ago•0 comments

Show HN: Launched Badges-showcase launches on HN, Reddit and more, not just PH

https://launched-badges.lovable.app/
1•sundaywong•6m ago•0 comments

ChatGPT could never get a PhD in geography

https://garymarcus.substack.com/p/chatgpt-blows-mapmaking-101
2•garymarcus•7m ago•1 comments

Why aren't more Windows programs written in JavaScript?

https://old.reddit.com/r/microsoft/comments/1kkzmmu/why_arent_more_windows_programs_written_in/
1•bundie•7m ago•0 comments

The DoD Is Looking for C-UAS Low-Cost Sensing Solutions

https://www.diu.mil/latest/diu-presents-c-uas-low-cost-sensing-challenge
1•josh_carterPDX•9m ago•0 comments

Show HN: Understand your current page at a glance – chrome extension

https://chromewebstore.google.com/detail/page-overview/linicdbaokahhhglapipfcadglghbadh
1•samiezkay•12m ago•0 comments

Why is Bella Ramsey the target of so much hate?

https://english.elpais.com/culture/2025-05-12/why-is-bella-ramsey-the-target-of-so-much-hate-the-last-of-us-star-sparks-the-fury-of-the-manosphere.html
4•geox•13m ago•1 comments

New obesity drugs are coming

https://www.nature.com/articles/d41586-025-00404-9
1•paulpauper•13m ago•0 comments

Zero-shot forecasting of chaotic systems

https://arxiv.org/abs/2409.15771
1•wil3•14m ago•0 comments

US-China Tariff Pause Spurs Stock Market Surge

https://www.nytimes.com/2025/05/11/business/us-china-trade-stock-market.html
1•paulpauper•15m ago•0 comments

What the hell are rare earth elements?

https://thehustle.co/originals/what-the-hell-are-rare-earth-elements?hubs_content=thehustle.co/&hubs_content-cta=What%20the%20hell%20are%20rare%20earth%20elements?
1•paulpauper•15m ago•1 comments

Hunting extreme microbes that redefine the limits of life

https://www.nature.com/articles/d41586-025-01464-7
1•gnabgib•15m ago•0 comments

AI-focused software engineering consulting for startups and dev teams

https://seconsultant.gumroad.com/l/soft
1•kalel314•16m ago•1 comments

MethaneSAT

https://www.methanesat.org/
1•simonebrunozzi•17m ago•0 comments

Observer Theory

https://writings.stephenwolfram.com/2023/12/observer-theory/
1•Anon84•20m ago•0 comments

Lavaforming

https://saparkitektar.is/PROJECTS
1•jonah•20m ago•0 comments

Little Dig Game

https://little-dig-ga.me/
1•gaws•21m ago•0 comments

Show HN: GS-Base – A multifunctional database tool with Python integration

https://citadel5.com/gs-base.htm
1•jpiech•22m ago•0 comments

Henk Rogers on buying Tetris and foiling

https://www.theguardian.com/games/2025/may/12/henk-rogers-interview-tetris-kgb
5•billybuckwheat•25m ago•0 comments

Hegel 2.0: The imaginary history of ternary computing (2018)

https://www.cabinetmagazine.org/issues/65/weatherby.php
1•Hooke•26m ago•0 comments

Macroscale ceramic origami structures with hyper-elastic coating

https://link.springer.com/article/10.1007/s42114-025-01284-3
1•PaulHoule•29m ago•0 comments

Ask HN: Claude 3.7 or Gemini 2.5 Pro?

2•apt-apt-apt-apt•29m ago•0 comments

State AI Regulation Ban Tucked into Republican Tax, Fiscal Bill

https://www.bloomberg.com/news/articles/2025-05-12/state-ai-regulation-ban-tucked-into-republican-tax-fiscal-bill
2•mfiguiere•31m ago•0 comments

Show HN: I Built Regiapadel.com: A Padel Tournament Manager

https://regiapadel.com/
1•hersoncruz•33m ago•0 comments

AI Trained to Gaslight Users

https://www.gaslight-gpt.com/
2•spongebob_sh•33m ago•0 comments

New pope chose his name based on AI's threats to "human dignity"

https://arstechnica.com/information-technology/2025/05/new-pope-chose-his-name-based-on-ais-threats-to-human-dignity/
2•shinryuu•33m ago•1 comments

Note Rush: Notes to Typing Practice via AI

https://www.note-rush.com/
1•Abilash-Suresh•34m ago•1 comments

Dark Money Hit a Record High of $1.9B in 2024 Federal Races

https://www.brennancenter.org/our-work/research-reports/dark-money-hit-record-high-19-billion-2024-federal-races
2•hn_acker•35m ago•0 comments

Delivering most-favored-nation prescription drug pricing to American patients

https://www.whitehouse.gov/presidential-actions/2025/05/delivering-most-favored-nation-prescription-drug-pricing-to-american-patients/
3•prossercj•36m ago•0 comments