frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

An Enterprise-Level Retrieval-Augmented Generation System

https://comfyai.app/article/llm-applications/enterprise-level-rag-hands-on-practice-II
6•zljdanceholic•10mo ago

Comments

zljdanceholic•10mo ago
How can we search the wanted key information from 10,000+ pages of PDFs within 2.5 hours? For fact check, how do we implement it so that answers are backed by page-level references, minimizing hallucinations?

RAG-Challenge-2 is a great open-source project by Ilya Rice that ranked 1st at the Enterprise RAG Challenge, which has 4500+ lines of code for implementing a high-performing RAG system. It might seem overwhelming to newcomers who are just beginning to learn this technology. Therefore, to help you get started quickly—and to motivate myself to learn its ins and outs—I’ve created a complete tutorial on this.

We have a complete graph to explain its workflow where multiple tools are used: Docling for parsing PDFs, LangChain for chunking text, faiss for vectorization and similarity searching, and chatgpt for LLMs.

Besides, I also outline the codeflow, demonstrating the running logic involving multiple python files where starters can easily get lost. Different files are colored differently. The purpose of showing this is not letting you memorize all of these file relationships. It works better for you to check the source code yourself and use this as a reference if you find yourself lost in the code.

The original project of Ilya Rice design its RAG system for answering questions of annual reports from companies, so he only designed three types of question response format for that challenge: a name, a number, or a boolean. But to ask questions about technical stuff, we absolutely ask general questions like How does RoPE work? to know about some concepts and the like Therefore, I further modify the system logic to fit this need by customizing an AnswerWithRAGContextExplanationPrompt class and automatically matching the most related chapter and corresponding pages via searching through all faiss databases (only retrieve the top-1) The final performance is demonstrated below (not cherry-picked, only tested once).

- What's the steps to train a nanoGPT from scratch? The answer is:

Training nanoGPT from scratch involves several clearly defined steps. First, set up the environment by installing necessary libraries, using either Anaconda or Google Colab, and then download the dataset (e.g., tinyShakespeare). Next, tokenize the text into numerical representations and split the data into training and validation sets. Define the model architecture including token/positional embeddings, transformer blocks with multi-head self-attention and feed-forward networks, and layer normalization. Configure training hyperparameters and set up an optimizer (such as AdamW). Proceed with a training loop that performs forward passes, computes loss, backpropagates, and updates parameters, while periodically evaluating performance on both training and validation data. Finally, use the trained model to generate new text from a given context.

All code are provided on Colab and the tutorial is referenced here. Hope this helps!

Show HN: Queuelo – Approval infrastructure for AI agents

https://queuelo.com
1•Jordanrcrane•3m ago•0 comments

Claude hits No 2 on Apple's top free apps list after Pentagon rejection

https://www.cnbc.com/2026/02/28/anthropics-claude-apple-apps.html
1•pseudolus•4m ago•0 comments

WebTiles: It's fine to run user-supplied code

https://dimden.dev/blog/?id=15-webtiles-its-fine-to-accept-user-supplied-code-actually
1•dimden•5m ago•0 comments

We will all be digital gods: the death of apps and the rise of the meta-app

https://twitter.com/zachlloydtweets/status/2027480476730925116
2•zachlloyd•9m ago•0 comments

The Looming Taiwan Chip Disaster That Silicon Valley Has Long Ignored

https://www.nytimes.com/2026/02/24/technology/taiwan-china-chips-silicon-valley-tsmc.html
1•car•10m ago•0 comments

Programmable Cryptography

https://0xparc.org/writings/programmable-cryptography-1
1•fi-le•11m ago•0 comments

Verification Is Easier Than Discovery

https://chatbotkit.com/reflections/verification-is-easier-than-discovery
1•_pdp_•11m ago•0 comments

Show HN: Docmd – A minimalist, zero-config docs generator with 0ms latency

https://github.com/docmd-io/docmd
1•enigmazi•12m ago•1 comments

Simple Screw Counter

https://mitxela.com/projects/screwcounter
1•jk_tech•13m ago•0 comments

Boscah – a subscription box curated by your TikTok algorithm

https://boscah.com/
1•TealMyEal•13m ago•0 comments

Empirically Testing the Softwar Thesis: Bitcoin as Power Projection

https://doi.org/10.36227/techrxiv.177223033.39479389/v1
1•mauoak•14m ago•1 comments

Every Hardware Deserves a Coder: Devstral Small 2 24B and Qwen3 Coder 30B

https://byteshape.com/blogs/Devstral-Small-2-24B-Instruct-2512/
1•dajonker•17m ago•0 comments

Show HN: ClawNet – Agent-first communication infrastructure (email, DMs, feed)

https://clwnt.com
1•ethanbeard•18m ago•0 comments

The Hardest Working Office Design in America Encrypts Your Data–With Lava Lamps

https://www.fastcompany.com/90137157/the-hardest-working-office-design-in-america-encrypts-your-d...
1•codesuki•19m ago•0 comments

After Alignment

https://utopai.substack.com/p/after-alignment
1•cyberneticc•20m ago•3 comments

Wezzly Companion – AI desktop assistant that sees your screen in real time

https://github.com/idobaibai-wezzly/wezzly-companion-public
2•idobaiba•21m ago•1 comments

Anthropic's Claude rises to No. 2 in the App Store following Pentagon dispute

https://techcrunch.com/2026/02/28/anthropics-claude-rises-to-no-2-in-the-app-store-following-pent...
15•Philpax•21m ago•1 comments

English translation of a recent German patent application [pdf]

https://github.com/Berlin-West/Topology/blob/main/Topology%20(EN).pdf
1•RHEFOR•23m ago•0 comments

AI Hurtles Ahead

https://www.oaktreecapital.com/insights/memo/ai-hurtles-ahead
1•kristianp•24m ago•0 comments

The 19th century silent film that first captured a robot attack

https://www.npr.org/2026/02/28/nx-s1-5730373/georges-melies-robot-film-1897-library-of-congress-g...
2•andsoitis•25m ago•0 comments

Show HN: Constrained Chess, Play Stockfish with custom natural-language rules

https://constrainedchess.vercel.app/
1•vigrant•26m ago•1 comments

Iran reels as more than 80 children reportedly killed in school bombing

https://www.theguardian.com/world/2026/feb/28/children-dead-as-missile-hits-elementary-school-in-...
8•n1b0m•27m ago•0 comments

Frontier AI Models Exhibit Sophisticated Reasoning in Simulated Nuclear Crises

https://arxiv.org/abs/2602.14740
1•iamskeole•28m ago•0 comments

Iran's supreme leader reportedly killed in air strikes

https://www.abc.net.au/news/2026-03-01/iran-missiles-shake-gulf-states-after-us-israel-strike-teh...
1•hulahoof•28m ago•0 comments

Suspected Insiders predict US attack on Polymarket

https://twitter.com/peterjliu/status/2027871116774281489
2•somerandomness•31m ago•0 comments

I (mostly AI) made a Supabase pentesting tool

https://github.com/BobTheShoplifter/supabase-pwn
1•BobTShoplifter•31m ago•0 comments

Show HN: Free, open-source native macOS client for di.fm

https://github.com/drmikexo2/DIBar-macOS
2•thucydides•33m ago•0 comments

Phoropter

https://github.com/lightward/phoropter
2•isaacbowen•33m ago•0 comments

Electric semi trucks can save fleets nearly $160,000 per truck

https://electrek.co/2026/02/28/real-world-test-electric-semi-trucks-can-save-fleets-nearly-160000...
4•breve•33m ago•0 comments

Minimal now supports 22 hardened container images

https://github.com/rtvkiz/minimal/blob/main/README.md
1•theoo21•35m ago•0 comments