frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

Open in hackernews

Show HN: Cognee – Open-Source AI Memory Layer That Remembers Context

https://github.com/topoteretes/cognee
8•vasa_•1d ago
Hey HN! We're Vasilije, Laszlo and Lazar, the authors of a new paper and part of https://www.cognee.ai. cognee let’s you build memory layers for AI applications and agents, allowing them to personalize results, connect various data sources, and add custom rules. This enables AI apps to deliver increasingly accurate responses, we reached almost 90% on standard industry benchmarks as you can see here https://github.com/topoteretes/cognee/tree/main/evals and our paper can be accessed at: https://arxiv.org/abs/2505.24478 and collab is in the repository

LLMs don't remember context well—they can’t ingest your data and keep it in memory. This limitation leads to lacking interactions, a lack of accuracy, and the inability to connect your data sources cheaply because developers must include long, unmanaged context in every prompt.

When we were building RAGs we saw that we can’t find the data we need and that there are too many knobs to turn in RAG frameworks. We had to tweak many parameters and also could not specify the rules we wanted the data to follow. No ontologies, no rulesets, no state or good data engineering practices and a lot of manual work. That’s why cognee

cognee builds memory that combines graph, vector, and relational stores. Here is how it works:

Adding data: When you use cognee with your AI App, it can take in any message, string, S3 bucket or even a relational database and automatically ingest it

Managing information: cognee sorts this information into semantic graph: - It extracts entities and connections between things. - It embeds the data in the vector store, It enriches data with custom ontologies you provide, that help ground the graphs and make them more reliable. - The overall information is stored in many layers of a graph and vector store that allows for finding similar information later using a variety of types of searches.

Retrieving data: When given an input query, cognee searches for and retrieves related stored information by leveraging a combination of graph traversal techniques, vector similarity, and COT techniques. It can use the internal benchmarking system to make sure that your pipelines are returning only accurate data when you need it.

---

cognee introduces a self-improving group of memory layers that cover various topics, data sources and can be customized. This reduces the need to build everything from scratch, and you can use our primitives to get started and move more quickly. On the other hand, you can do everything yourself, from start to end. We’ve designed the system to be modular and extensible.

We’ve open-sourced cognee —specifically the framework and various vector and graph database adapters, as well as our default data pipeline, cognify—under the Apache 2.0 license. This includes the ability to add, cognify, and retrieve data within your AI applications and also extend it with custom components that are just pure Python.

However, many keep features that are optimized for production use, as a part of our paid platform. We release these functionalities too, including in the last few weeks permission management + distributed pipelines(dev). These are a part of our open-source package and are available to those who in production environments need things like rate limiting and credential management. We will release a paid offering for developers to get easy API access to our platform.

Our automation tooling allows us to optimize the pipelines and find the best combination of parameters that answer questions our stakeholders have.

We’d love to hear what you think! Please feel free to try our demo, check out the code, read the research paper, and share thoughts or suggestions with us. Your feedback will help shape where we take cognee from here!

Comments

dovlex•1d ago
Hey, this looks super interesting - nice work!

Couple questions:

How does cognee handle temporal evolution of memory (e.g. when facts change over time)? Curious what’s still an open problem here because AFAIK this is really hard to solve.

Is cognee meant to power agent memory that builds up evolving contextual data about individual users and agents acting on their behalf? Say you track evolving user preferences in some domain space e.g. travel. If so, how do you handle user-specific memory separation—especially in multi-user or multi-agent setups?

vasa_•1d ago
Hi, founder of cognee here

We have temporal resolution mechanisms we are building, and framework is generalizable enough to build any custom logic. We have a few ideas and some things will be posted there soon.

cognee has notion of nodesets which work similar to tags: https://docs.cognee.ai/core-concepts/node-sets

And also we have graphs per user now available. So, user permisions + graph filtering

How Much Energy Does It Take to Think?

https://www.quantamagazine.org/how-much-energy-does-it-take-to-think-20250604/
1•pseudolus•35s ago•0 comments

Show HN: Heynds – Write 3x Faster – AI Voice and Writing Assistant (Mac/Windows)

https://www.heynds.com/en
1•pierremouchan•1m ago•0 comments

Gbe_fork – Steam emulator that emulates steam online features

https://github.com/Detanup01/gbe_fork
1•WhereIsTheTruth•3m ago•0 comments

Show HN: A rude AI that will roast your website SEO

https://seomode.co/tools/seo-roast
1•thisismehrab•4m ago•0 comments

Biggest boom since Big Bang: Most energetic explosions in universe uncovered

https://phys.org/news/2025-06-biggest-boom-big-astronomers-uncover.html
1•pseudolus•4m ago•0 comments

De Bruijn's Combinatorics

https://vixra.org/abs/1208.0223
1•fanf2•5m ago•0 comments

Meta is massively suspending Instagram accounts based on flawed AI

https://www.reddit.com/r/Instagram/s/J5Kd18juRF
1•100c1p43r•5m ago•1 comments

Digital Twins in Data Centers: Revolution or Passing Trend?

https://www.powernet.es/blog/gemelos-digitales-cpd
1•Joseblasco20•6m ago•1 comments

China's Hundred Lens War

https://www.chinatalk.media/p/chinas-ar-arms-race
1•ZeljkoS•10m ago•0 comments

Nick Bostrom's "gravely misinformed" notion of superintelligence (2024)

https://caiml.org/dighum/announcements/digital-humanism-salon-capital-and-the-computer-by-lachlan-kermode-2024-06-24/
1•breezykermo•11m ago•1 comments

Ask HN: I don't understand what problems ORMs solve

3•iondodon•21m ago•3 comments

Ask HN: Almost 3 years since ChatGPT. What tools do you use?

1•break_the_bank•33m ago•2 comments

Ask HN: Has ChatGPT been trained on Hacker News comments?

1•leftcenterright•33m ago•0 comments

How does browser automation with browsergpt by civai work

https://app.vearn.co/q/how-does-browser-automation-with-browsergpt-really-work-for-people-who-want-to-save-time-online-and-do-stuff-hands-free
1•usecodenaija•38m ago•0 comments

The Writers on the Leaves of the Trees That Surround the Palace Hathel

https://medium.com/luminasticity/the-writers-on-the-leaves-of-the-trees-that-surround-the-palace-hathel-c32dd25b26ab
1•bryanrasmussen•40m ago•0 comments

Free Prompt Engineering Chrome Extension

https://chromewebstore.google.com/detail/promptjesus/haaecanojfcjlbjbalioknghgchlglnl
1•zigmazigma•42m ago•0 comments

Ask HN: Best Low-Power, Budget-Friendly, and Capable Home Server Setup?

1•johnnykree•43m ago•0 comments

Amazon to invest $10B in North Carolina to expand cloud, AI infra

https://www.reuters.com/business/retail-consumer/amazon-invest-10-billion-north-carolina-expand-cloud-ai-infrastructure-2025-06-04/
2•Kevvv•44m ago•0 comments

Ask HN: How can LLMs boost my developer experience?

2•rich_sasha•45m ago•0 comments

Statement on California State Senate Advancing Dangerous Surveillance Bill

https://www.eff.org/deeplinks/2025/06/statement-california-state-senate-advancing-dangerous-surveillance-bill
3•mdp2021•46m ago•0 comments

From Tokens to Thoughts: How LLMs and Humans Trade Compression for Meaning

https://arxiv.org/abs/2505.17117
2•ggirelli•48m ago•0 comments

Poison everywhere: No output from your MCP server is safe

https://www.cyberark.com/resources/threat-research-blog/poison-everywhere-no-output-from-your-mcp-server-is-safe
2•nor0x•50m ago•0 comments

Everabyte Cloud Secure Storage

https://everabyte.com
1•tojor27•52m ago•1 comments

Elon Musk shared my photos without credit, and then suspended my account

https://www.reddit.com/r/mildlyinfuriating/s/rwA6ORbw4r
7•nixass•53m ago•2 comments

HipScript: CUDA in Web Browser

https://lights0123.com/blog/2025/01/07/hip-script/
1•walterbell•53m ago•0 comments

How to get started with writing tech video essays

1•sonderotis•54m ago•0 comments

How a chicken.png file made me $100k

https://substack.com/inbox/post/165250741
4•pompomsheep•59m ago•0 comments

You can now present content from your camera feed in Google Meet

https://www.neowin.net/news/you-can-now-present-content-from-your-camera-feed-in-google-meet/
1•bundie•1h ago•0 comments

The Consensus Machine

https://psychip.net/entry/the-consensus-machine
1•psychip•1h ago•0 comments

Air Lab – A portable and open air quality measuring device

https://networkedartifacts.com/airlab/simulator
27•256dpi•1h ago•7 comments