frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Ragctl – document ingestion CLI for RAG (OCR, chunking, Qdrant)

https://github.com/datallmhub/ragstudio
4•ahsekka•10h ago
Hi HN — sharing ragctl, an open-source CLI for the most failure-prone part of RAG pipelines: document ingestion, OCR, parsing/cleaning, and chunking.

Vector DB setup is fairly standardized now, but getting high-quality, consistent text + metadata into it still takes a lot of brittle glue code. ragctl aims to make that “pre-vector” step repeatable: turn messy documents into retrieval-ready chunks in a few commands.

Features • Multi-format input: PDF, DOCX, HTML, images • OCR for scanned/image-based docs • Semantic chunking (LangChain) • Batch runs with retries + error handling • Output: direct ingestion into Qdrant (for now)

Looking for feedback • DX: is the CLI intuitive? • Performance / edge cases: weird PDFs, mixed layouts, tables • Roadmap: which connectors (S3, Slack, Notion) or vector stores should be next?

Repo: https://github.com/datallmhub/ragstudio Happy to answer questions about the architecture and chunking approach.

Show HN: Tonbo – an embedded database for serverless and edge runtimes

https://github.com/tonbo-io/tonbo
26•ethegwo•6d ago•6 comments

Show HN: Turn raw HTML into production-ready images for free

https://html2png.dev
87•alvinunreal•9h ago•41 comments

Show HN: Semantic Coverage – A tool to visualize RAG blind spots using UMAP

https://github.com/aashirpersonal/semantic-coverage
2•aashirpersonal•50m ago•1 comments

Show HN: I built a tool that creates videos out of React code

https://github.com/outscal/video-generator
2•mayankkgrover•57m ago•0 comments

Show HN: CodinIT, local open-source Lovable alternative (Electron desktop app)

https://github.com/codinit-dev/codinit-dev
13•Gerome24•4d ago•2 comments

Show HN: CineCLI – Browse and torrent movies directly from your terminal

https://github.com/eyeblech/cinecli
312•samsep10l•1d ago•101 comments

Show HN: Cosmofy – bundle your Python code for Linux/Windows/MacOS

https://github.com/metaist/cosmofy
7•metaist•5h ago•1 comments

Show HN: SatoriDB – embedded vector database written in Rust

3•joeeverjk•1h ago•1 comments

Show HN: Yapi – FOSS terminal API client for power users

https://yapi.run/blog/what-is-yapi
44•jamiepond•2d ago•16 comments

Show HN: Jmail – Google Suite for Epstein files

https://www.jmail.world
1530•lukeigel•3d ago•350 comments

Show HN: C-compiler to compile TCC for live-bootstrap

https://github.com/FransFaase/MES-replacement
67•fjfaase•6d ago•23 comments

Show HN: Python SDK – forecasting with foundation time-series and tabular models

https://github.com/S-FM/faim-python-client
41•ChernovAndrei•6d ago•16 comments

Show HN: Books mentioned on Hacker News in 2025

https://hackernews-readings-613604506318.us-west1.run.app
602•seinvak•2d ago•211 comments

Show HN: Netrinos – A keep it simple Mesh VPN for small teams

https://netrinos.com
91•pcarroll•4d ago•65 comments

Show HN: Ragctl – document ingestion CLI for RAG (OCR, chunking, Qdrant)

https://github.com/datallmhub/ragstudio
4•ahsekka•10h ago•0 comments

Show HN: HN Wrapped 2025 - an LLM reviews your year on HN

https://hn-wrapped.kadoa.com?year=2025
308•hubraumhugo•3d ago•152 comments

Show HN: Kapso – WhatsApp for developers

https://kapso.ai/
27•aamatte•16h ago•15 comments

Show HN: Rust/WASM lighting data toolkit – parses legacy formats, generates SVGs

https://eulumdat.icu
50•holg•2d ago•5 comments

Show HN: Agentica – 200 reqs/day for free, data not used to train our LLMs

https://agentica.genlabs.dev
3•GenLabs-AI•13h ago•0 comments

Show HN: An easy way of broadcasting radio around you (looking for feedback)

https://github.com/dpipstudio/botwave
36•douxx•6d ago•17 comments

Show HN: RenderCV – Open-source CV/resume generator, YAML to PDF

https://github.com/rendercv/rendercv
97•sinaatalay•2d ago•41 comments

Show HN: WalletWallet – create Apple passes from anything

https://walletwallet.alen.ro/
438•alentodorov•2d ago•111 comments

Show HN: Shittp – Volatile Dotfiles over SSH

https://github.com/FOBshippingpoint/shittp
134•sdovan1•2d ago•85 comments

Show HN: Lume.js – 1.5KB React alternative using zero custom syntax

https://sathvikc.github.io/lume-js/
7•sathvikchinnu•15h ago•2 comments

Show HN: CarryFit – Open-source carry-on compliance checker for 170 airlines

https://carryon.fit/
3•axeluser•15h ago•1 comments

Show HN: The Official National Train Map Sucked, So I Made My Own

https://www.bdzmap.com/
76•Pavlinbg•2d ago•24 comments

Show HN: A kids book that introduces authorization and permissions concepts

https://authzed.com/resources/dibs-and-the-magic-library
11•samkim•17h ago•1 comments

Show HN: DeepSearch – a high-performance SMB directory scanner in Rust

https://github.com/dohuyhoang93/DeepSearch
16•dohuyhoangvn93•1d ago•3 comments

Show HN: "What Should I Build?" A directory of what people want

https://www.whatshouldibuild.online/
4•emil154•18h ago•3 comments

Show HN: Openinary – Self-hosted image processing like Cloudinary

https://github.com/openinary/openinary
5•fheysen•19h ago•4 comments