frontpage.
newsnewestaskshowjobs

Open Source @Github

fp.

Open in hackernews

An Enterprise-Level Retrieval-Augmented Generation System

https://comfyai.app/article/llm-applications/enterprise-level-rag-hands-on-practice-II
6•zljdanceholic•1y ago

Comments

zljdanceholic•1y ago
How can we search the wanted key information from 10,000+ pages of PDFs within 2.5 hours? For fact check, how do we implement it so that answers are backed by page-level references, minimizing hallucinations?

RAG-Challenge-2 is a great open-source project by Ilya Rice that ranked 1st at the Enterprise RAG Challenge, which has 4500+ lines of code for implementing a high-performing RAG system. It might seem overwhelming to newcomers who are just beginning to learn this technology. Therefore, to help you get started quickly—and to motivate myself to learn its ins and outs—I’ve created a complete tutorial on this.

We have a complete graph to explain its workflow where multiple tools are used: Docling for parsing PDFs, LangChain for chunking text, faiss for vectorization and similarity searching, and chatgpt for LLMs.

Besides, I also outline the codeflow, demonstrating the running logic involving multiple python files where starters can easily get lost. Different files are colored differently. The purpose of showing this is not letting you memorize all of these file relationships. It works better for you to check the source code yourself and use this as a reference if you find yourself lost in the code.

The original project of Ilya Rice design its RAG system for answering questions of annual reports from companies, so he only designed three types of question response format for that challenge: a name, a number, or a boolean. But to ask questions about technical stuff, we absolutely ask general questions like How does RoPE work? to know about some concepts and the like Therefore, I further modify the system logic to fit this need by customizing an AnswerWithRAGContextExplanationPrompt class and automatically matching the most related chapter and corresponding pages via searching through all faiss databases (only retrieve the top-1) The final performance is demonstrated below (not cherry-picked, only tested once).

- What's the steps to train a nanoGPT from scratch? The answer is:

Training nanoGPT from scratch involves several clearly defined steps. First, set up the environment by installing necessary libraries, using either Anaconda or Google Colab, and then download the dataset (e.g., tinyShakespeare). Next, tokenize the text into numerical representations and split the data into training and validation sets. Define the model architecture including token/positional embeddings, transformer blocks with multi-head self-attention and feed-forward networks, and layer normalization. Configure training hyperparameters and set up an optimizer (such as AdamW). Proceed with a training loop that performs forward passes, computes loss, backpropagates, and updates parameters, while periodically evaluating performance on both training and validation data. Finally, use the trained model to generate new text from a given context.

All code are provided on Colab and the tutorial is referenced here. Hope this helps!

Privacy-Preserving Process Mining

https://hash.ai/blog/privacy-preserving-process-mining
1•tosh•23s ago•0 comments

The Eternal Allure of the Rabbit Hole

https://www.theatlantic.com/technology/2026/06/jfk-assassination-epstein-ufo-conspiracy-theories/...
1•fortran77•1m ago•0 comments

Bun 1.4: The Controversial AI-Driven Rewrite from Zig to Rust

https://grigio.org/bun-1-4-the-controversial-ai-driven-rewrite-from-zig-to-rust/
1•baranul•3m ago•0 comments

Nvidia-ZPPO: Zone of Proximal Policy Optimization

https://byungkwanlee.github.io/ZPPO-page/
1•gmays•4m ago•0 comments

Kremlometr - Using NLP to spot pro-Russian propaganda in czech media comments

https://kremlometr.cz/?lang=en
1•kekqqq•7m ago•0 comments

Show HN: I made AI bets for all World Cup games – bet against or with them

https://www.getaiso.com/tools/world-cup-ai-challenge
1•bentannen•10m ago•0 comments

Show HN: Material Hack – Reboot of the Hacker News Client Materialistic

https://github.com/mxmilkiib/material-hack
1•mxmilkiib•10m ago•1 comments

Canyon HUD helmet for road riding

https://media-centre.canyon.com/en-INT/266866-new-canyon-heads-up-display-helmet-could-be-a-safet...
1•zh3•10m ago•0 comments

US-Iran memorandum of understanding in full

https://www.bbc.com/news/articles/c4gy700j0eko
2•thisislife2•11m ago•2 comments

FIDE AI – Research and public infrastructure for faith-facing AI

https://fideai.substack.com/p/introducing-fide-ai
1•alexchaomander•13m ago•0 comments

Show HN: Web-Based FPGA Viewer

https://edacation.github.io/nextpnr-viewer/
1•malmeloo•13m ago•0 comments

Show HN: Forecastion – A forecasting workbench for analysts and operators

https://forecastion.com/
2•PrimeMentat•16m ago•0 comments

Anthropic build AI so safe the Gov made them delete it (YouTube) – Patrick Boyle

https://www.youtube.com/watch?v=AJP6K2_rr90
1•ziptron•20m ago•0 comments

Why do developers choose one programming language over another? (2002)

https://web.archive.org/web/20020803183731/http://www.joelonsoftware.com/news/20020505.html
1•tosh•23m ago•0 comments

Show HN: FOSS sandbox platform that hides infra secrets from devs and AI agents

https://github.com/octelium/cordium
1•geoctl•24m ago•0 comments

Pandas vs. DuckDB vs. Polars: 20M rows, 10 operations, benchmarked

https://thedatabytes.substack.com/p/pandas-vs-duckdb-vs-polars-i-ran
1•badhayaru•25m ago•0 comments

The Emerging AI Governance Space

https://www.threedeep.tech/ai-governance-stack
1•ethigent•25m ago•0 comments

Show HN: Criterion Closet as a website – pull any of 1,247 films off the shelf

https://the-criterion-closet.vercel.app
1•olievans•25m ago•0 comments

The Agentic Resource Discovery Specification

https://developers.googleblog.com/announcing-the-agentic-resource-discovery-specification/
2•simonpure•29m ago•0 comments

Gotique – AI that gives a first-read appraisal on antiques from photos

https://gotique.ai/en/community
2•gotique•30m ago•0 comments

Five Worlds (2002)

https://www.joelonsoftware.com/2002/05/06/five-worlds/
2•tosh•32m ago•0 comments

Two production Next.js apps, built solo with Cursor+Claude, $13,945

https://technicalstrat.com/articles/enterprise-app-vibecode-recipe
4•matttek•33m ago•0 comments

Getting over Your Skills Issues

https://theoryvc.com/blog
2•sambcui•37m ago•0 comments

India Calls in Military, Blocks Telegram App to Lock Down College-Entrance Exam

https://www.wsj.com/world/india/india-calls-in-military-blocks-app-to-lock-down-college-entrance-...
4•bookofjoe•38m ago•1 comments

Changes that cut our LLM pipeline costs more than model-switching did

2•Abbas_Maka•38m ago•0 comments

Ag.ide Index, rank, and refactor your repo's worst code

https://agide.dev/
2•navs•39m ago•0 comments

Most Affordable Cities to Buy a Home

https://wallethub.com/edu/most-affordable-cities-for-home-buyers/121950
4•panny•58m ago•3 comments

Show HN: Convert your landing pages to powerful visuals for social media

3•umeshmr•59m ago•0 comments

The IPv4 Parser AI Couldn't Have Written

https://extractingcycles.com/blog/the-ipv4-parser-ai-couldnt-have-written/article/
5•csno•1h ago•0 comments

The ERoadBook – Rally-Inspired GPS Device for Riders and Adventurers [video]

https://www.youtube.com/watch?v=1KoQ-l3-tg8
3•arbayi•1h ago•0 comments