Show HN: RAG in 3 Lines of Python

41•init0•2mo ago

Got tired of wiring up vector stores, embedding models, and chunking logic every time I needed RAG. So I built piragi.

  from piragi import Ragi

  kb = Ragi(\["./docs", "./code/\*\*/\*.py", "https://api.example.com/docs"\])

  answer = kb.ask("How do I deploy this?")

That's the entire setup. No API keys required - runs on Ollama + sentence-transformers locally.

What it does:

  - All formats - PDF, Word, Excel, Markdown, code, URLs, images, audio

  - Auto-updates - watches sources, refreshes in background, zero query latency

  - Citations - every answer includes sources

  - Advanced retrieval - HyDE, hybrid search (BM25 + vector), cross-encoder reranking

  - Smart chunking - semantic, contextual, hierarchical strategies

  - OpenAI compatible - swap in GPT/Claude whenever you want

Quick examples:

  # Filter by metadata
  answer = kb.filter(file_type="pdf").ask("What's in the contracts?")

  #Enable advanced retrieval

    kb = Ragi("./docs", config={
     "retrieval": {
        "use_hyde": True,
        "use_hybrid_search": True,
        "use_cross_encoder": True
     }
   })


  # Use OpenAI instead  
  kb = Ragi("./docs", config={"llm": {"model": "gpt-4o-mini", "api_key": "sk-..."}})

Install:

  pip install piragi
  PyPI: https://pypi.org/project/piragi/

Would love feedback. What's missing? What would make this actually useful for your projects?

Comments

atoav•2mo ago

What is missing is a definition of RAG. The whole page uses this acronym like it has to be clear what it is. It is not.

Not that people can't google it, but it is just friendlier if you answer the question a good chunk of people looking at the site will have. It also allows users who need what RAG provides but don't know what it is called to discover it easier.

sigwinch•2mo ago

It’s enough for the demo to use a variable named ‘kb’, which harkens to the business logic jargon. I don’t see benefit for expanding technical jargon once the business jargon has been uttered.

freakynit•2mo ago

Just tested it. Worked brilliantly well after fixing a minior issue.

My question: any plans on adding graph (rdf) support in near future?

Thanks..

nubg•2mo ago

Great documentation and dev-friendly feature set. Looks promising.

whattheheckheck•2mo ago

Nice, can you make it work with aws bedrock?

jimmySixDOF•2mo ago

Chunking on hierarchy is a good and async built in and a cross encoder mode .... I like this project's Keep It Simple Stupid approach without skipping on functions even a basic graph triple. Using this to fill out a PoC mockup could be worth it vs dummy data and just drawing a cloud.

Show HN: I saw this cool navigation reveal, so I made a simple HTML+CSS version

Show HN: Stacky – certain block game clone

Show HN: Look Ma, No Linux: Shell, App Installer, Vi, Cc on ESP32-S3 / BreezyBox

Show HN: I spent 4 years building a UI design tool with only the features I use

Show HN: If you lose your memory, how to regain access to your computer?

Show HN: A toy compiler I built in high school (runs in browser)

Show HN: Kappal – CLI to Run Docker Compose YML on Kubernetes for Local Dev

Show HN: R3forth, a ColorForth-inspired language with a tiny VM

Show HN: Smooth CLI – Token-efficient browser for AI agents

Show HN: Nginx-defender – realtime abuse blocking for Nginx

Show HN: Slack CLI for Agents

Show HN: Which chef knife steels are good? Data from 540 Reddit tread

Show HN: BioTradingArena – Benchmark for LLMs to predict biotech stock movements

Show HN: Artifact Keeper – Open-Source Artifactory/Nexus Alternative in Rust

Show HN: MCP App to play backgammon with your LLM

Show HN: ARM64 Android Dev Kit

Show HN: I'm 75, building an OSS Virtual Protest Protocol for digital activism

Show HN: I built Divvy to split restaurant bills from a photo

Show HN: Gigacode – Use OpenCode's UI with Claude Code/Codex/Amp

Show HN: XAPIs.dev – Twitter API Alternative at 90% Lower Cost

Show HN: I Hacked My Family's Meal Planning with an App

Show HN: I built a free UCP checker – see if AI agents can find your store

Show HN: Micropolis/SimCity Clone in Emacs Lisp

Show HN: Daily-updated database of malicious browser extensions

Show HN: Falcon's Eye (isometric NetHack) running in the browser via WebAssembly

Show HN: Compile-Time Vibe Coding

Show HN: Horizons – OSS agent execution engine

Show HN: Slop News – HN front page now, but it's all slop

Show HN: Local task classifier and dispatcher on RTX 3080

Show HN: Fitspire – a simple 5-minute workout app for busy people (iOS)