frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

An Enterprise-Level Retrieval-Augmented Generation System

https://comfyai.app/article/llm-applications/enterprise-level-rag-hands-on-practice-II
6•zljdanceholic•1y ago

Comments

zljdanceholic•1y ago
How can we search the wanted key information from 10,000+ pages of PDFs within 2.5 hours? For fact check, how do we implement it so that answers are backed by page-level references, minimizing hallucinations?

RAG-Challenge-2 is a great open-source project by Ilya Rice that ranked 1st at the Enterprise RAG Challenge, which has 4500+ lines of code for implementing a high-performing RAG system. It might seem overwhelming to newcomers who are just beginning to learn this technology. Therefore, to help you get started quickly—and to motivate myself to learn its ins and outs—I’ve created a complete tutorial on this.

We have a complete graph to explain its workflow where multiple tools are used: Docling for parsing PDFs, LangChain for chunking text, faiss for vectorization and similarity searching, and chatgpt for LLMs.

Besides, I also outline the codeflow, demonstrating the running logic involving multiple python files where starters can easily get lost. Different files are colored differently. The purpose of showing this is not letting you memorize all of these file relationships. It works better for you to check the source code yourself and use this as a reference if you find yourself lost in the code.

The original project of Ilya Rice design its RAG system for answering questions of annual reports from companies, so he only designed three types of question response format for that challenge: a name, a number, or a boolean. But to ask questions about technical stuff, we absolutely ask general questions like How does RoPE work? to know about some concepts and the like Therefore, I further modify the system logic to fit this need by customizing an AnswerWithRAGContextExplanationPrompt class and automatically matching the most related chapter and corresponding pages via searching through all faiss databases (only retrieve the top-1) The final performance is demonstrated below (not cherry-picked, only tested once).

- What's the steps to train a nanoGPT from scratch? The answer is:

Training nanoGPT from scratch involves several clearly defined steps. First, set up the environment by installing necessary libraries, using either Anaconda or Google Colab, and then download the dataset (e.g., tinyShakespeare). Next, tokenize the text into numerical representations and split the data into training and validation sets. Define the model architecture including token/positional embeddings, transformer blocks with multi-head self-attention and feed-forward networks, and layer normalization. Configure training hyperparameters and set up an optimizer (such as AdamW). Proceed with a training loop that performs forward passes, computes loss, backpropagates, and updates parameters, while periodically evaluating performance on both training and validation data. Finally, use the trained model to generate new text from a given context.

All code are provided on Colab and the tutorial is referenced here. Hope this helps!

A detailed introduction to Kakoune for the aspiring power user

https://ficd.sh/blog/kakoune-is-a-text-editor/
1•birdculture•3m ago•0 comments

Collect as many lottery tickets as you can (2022)

https://adayeo.substack.com/p/collect-as-many-lottery-tickets-as
1•itzlambda•3m ago•0 comments

Open-Source Agentic QA Harness with Memory

https://github.com/vostride/agent-qa
1•pranshuchittora•5m ago•1 comments

I built a free Open Sourced, local audio stem separation

https://github.com/stemdeckapp/stemdeck
1•thclpr•7m ago•0 comments

More Tagged Union Subsets with Comptime in Zig

https://sinclairtarget.com/blog/2026/05/18/even-more-tagged-union-subsets-with-comptime/
1•xngbuilds•8m ago•0 comments

The small sample trap in A/B testing

https://hadid.dev/posts/averages-lie/
1•mustaphah•9m ago•0 comments

Secure Boot Certificate Expiry (Windows and Linux)

https://www.youtube.com/watch?v=_AwzaZmRNsI
1•nullpwr•12m ago•0 comments

The Windows DLL loader lock: how a Rust thread can hang your JVM

https://questdb.com/blog/windows-dll-loader-lock-rust-jni-deadlock/
1•bluestreak•13m ago•0 comments

Prejudice and truth about the effect of testosterone on bargaining behaviour

https://www.nature.com/articles/nature08711
1•mpweiher•17m ago•0 comments

MCP Tool Routing Has a Security Problem Nobody Is Talking About

https://medium.com/@will.jh75/the-hidden-flaws-of-mcp-routing-and-why-we-need-to-talk-about-them-...
1•rogueparticle•17m ago•0 comments

Show HN: Blog post and slideshow automatic generator

https://slidio.xyz/
1•oyaa52•19m ago•0 comments

Bournegol???

https://oldhome.schmorp.de/marc/bournegol.html
1•greyface-•26m ago•0 comments

Blog post: why and how we built local-first with Zero (prev. Replicache)

https://ano.chat/blog/why-we-built-ano-on-zero
1•bill-cupid•27m ago•0 comments

What changes when AI reads you first

https://onomeokajevo.substack.com/p/stop-telling-ai-to-sound-like-you
1•snoren•27m ago•0 comments

One Mars spacecraft, two senators, and a cloud of questions

https://arstechnica.com/space/2026/05/one-mars-spacecraft-two-senators-and-a-cloud-of-questions/
1•rbanffy•28m ago•0 comments

Do you value tight machining in everyday carry knives?

https://www.paragon-knives.com/
1•bgzlsxaz•28m ago•0 comments

Show HN: Resilient, A composable async resilience toolkit for rust

https://github.com/resilient-rs/resilient
2•yofabr•29m ago•0 comments

Extensy – turn any prompt into a monetizable browser extension in 2 minutes

https://extensy.dev/
5•truetemir•32m ago•1 comments

Unprecedented 19 Day Type IV Radio Burst as a Corotating Electron Reservoir

https://iopscience.iop.org/article/10.3847/2041-8213/ae5537
1•fodmap•36m ago•0 comments

Social Media Zero (2017)

https://leejo.github.io/2017/09/27/social_media_zero/
2•chistev•37m ago•0 comments

The US space enterprise is desperately waiting for Starship–will it deliver?

https://arstechnica.com/space/2026/05/the-us-space-enterprise-is-desperately-waiting-for-starship...
2•rbanffy•37m ago•1 comments

I've created a platform where sites get paid not to show ads

https://medium.com/@laurynas.karvelis_95228/is-monetising-your-site-possible-without-serving-ads-...
2•luggage_bazooka•39m ago•0 comments

Elon Musk Loses OpenAI Lawsuit After Jury Finds It Was Filed Too Late

https://firethering.com/elon-musk-openai-lawsuit-lost/
1•steveharing1•42m ago•2 comments

I built a Slack client because:wave: was lagging

https://grant.dev/posts/built-a-slack-client
1•figmert•43m ago•0 comments

AI-driven development – It's a spectrum

https://avohq.io/blog/ai-driven-development-it-s-a-spectrum
1•adrianthedev•44m ago•0 comments

List of price of medieval items (2006)

https://medieval.ucdavis.edu/120D/Money.html
1•downbad_•47m ago•0 comments

Ask HN: How to enforce engineers to understand the code they are shipping

3•hchua•47m ago•2 comments

Why Education Startups Do Not Succeed (2011)

https://avichal.com/2011/10/07/why-education-startups-do-not-succeed/
1•downbad_•48m ago•0 comments

Show HN: Cervantes yet Another HN Reader

https://github.com/nhdez/cervantes
2•pelagicAustral•49m ago•0 comments

The highest ROI activity in AI isn't on your screen

https://layerx.xyz/blog/sim-recap
4•supermalvo•54m ago•0 comments