frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

An Enterprise-Level Retrieval-Augmented Generation System

https://comfyai.app/article/llm-applications/enterprise-level-rag-hands-on-practice-II
6•zljdanceholic•11mo ago

Comments

zljdanceholic•11mo ago
How can we search the wanted key information from 10,000+ pages of PDFs within 2.5 hours? For fact check, how do we implement it so that answers are backed by page-level references, minimizing hallucinations?

RAG-Challenge-2 is a great open-source project by Ilya Rice that ranked 1st at the Enterprise RAG Challenge, which has 4500+ lines of code for implementing a high-performing RAG system. It might seem overwhelming to newcomers who are just beginning to learn this technology. Therefore, to help you get started quickly—and to motivate myself to learn its ins and outs—I’ve created a complete tutorial on this.

We have a complete graph to explain its workflow where multiple tools are used: Docling for parsing PDFs, LangChain for chunking text, faiss for vectorization and similarity searching, and chatgpt for LLMs.

Besides, I also outline the codeflow, demonstrating the running logic involving multiple python files where starters can easily get lost. Different files are colored differently. The purpose of showing this is not letting you memorize all of these file relationships. It works better for you to check the source code yourself and use this as a reference if you find yourself lost in the code.

The original project of Ilya Rice design its RAG system for answering questions of annual reports from companies, so he only designed three types of question response format for that challenge: a name, a number, or a boolean. But to ask questions about technical stuff, we absolutely ask general questions like How does RoPE work? to know about some concepts and the like Therefore, I further modify the system logic to fit this need by customizing an AnswerWithRAGContextExplanationPrompt class and automatically matching the most related chapter and corresponding pages via searching through all faiss databases (only retrieve the top-1) The final performance is demonstrated below (not cherry-picked, only tested once).

- What's the steps to train a nanoGPT from scratch? The answer is:

Training nanoGPT from scratch involves several clearly defined steps. First, set up the environment by installing necessary libraries, using either Anaconda or Google Colab, and then download the dataset (e.g., tinyShakespeare). Next, tokenize the text into numerical representations and split the data into training and validation sets. Define the model architecture including token/positional embeddings, transformer blocks with multi-head self-attention and feed-forward networks, and layer normalization. Configure training hyperparameters and set up an optimizer (such as AdamW). Proceed with a training loop that performs forward passes, computes loss, backpropagates, and updates parameters, while periodically evaluating performance on both training and validation data. Finally, use the trained model to generate new text from a given context.

All code are provided on Colab and the tutorial is referenced here. Hope this helps!

WallasAPI: Multi-provider AI router with automatic fallback (12 providers)

https://github.com/wubjak/wallasapi
1•wubjak•8m ago•0 comments

Open Museum – an MCP server for license-verified search across museums

https://github.com/cfpramod/open-museum-mcp
1•pramodprasanth•9m ago•1 comments

Pip 26.1 Released

https://lwn.net/Articles/1070010/
2•kazu11max17•10m ago•0 comments

AI researchers want AI to fake "thinking"

https://www.machinesociety.ai/p/ai-researchers-want-ai-to-fake-thinking-247
2•mikelgan•13m ago•1 comments

Initial tests find lead in children's fast-fashion clothing

https://www.acs.org/pressroom/presspacs/2026/march/initial-tests-find-lead-in-childrens-fast-fash...
1•_DeadFred_•14m ago•0 comments

I build my LLM a Brain

1•Kevintbt•14m ago•1 comments

I ran retrieval-auditor against LangChain's RAG quickstart, 5/6 flagged

https://github.com/kevin-luddy39/contrarianAI/tree/main/tools/retrieval-auditor/examples/langchai...
1•kevinluddy39•14m ago•0 comments

A Disappearing JSX Framework

https://twitter.com/dashersw/status/2048719900013732232
1•arbayi•14m ago•0 comments

Show HN: Need Human Lawyer – when AI for legal work isn't enough

1•fcpguru•17m ago•0 comments

Claude Pro: Opus model will only be available if extra usage is enabled

https://support.claude.com/en/articles/11940350-claude-code-model-configuration
4•yrds96•19m ago•2 comments

Show HN: Memory Guardian – open-source memory governance for AI agents

https://github.com/rishipratap10/memory-guardian
1•rishipratap10•20m ago•1 comments

Notes on structured concurrency, or: Go statement considered harmful (2018)

https://vorpus.org/blog/notes-on-structured-concurrency-or-go-statement-considered-harmful/
1•shadow28•23m ago•0 comments

muted.io

https://muted.io/
4•bookofjoe•23m ago•1 comments

Fashion for a Hot Planet

https://faculty.engineering.asu.edu/konrad/research/cool-future-fashion-for-a-hot-planet/
1•dreadsword•26m ago•0 comments

Thunderbird Pro April 2026 Update

https://blog.thunderbird.net/2026/04/thunderbird-pro-april-2026-update/
1•raybb•26m ago•0 comments

Xiaomi MiMo-v2.5 is open-sourced

https://twitter.com/XiaomiMiMo/status/2048821516079661561
2•bsgada•28m ago•1 comments

Show HN: macOS screensaver that displays Google Trends as fish in a deep-ocean

https://apoorvkulkarni.com/trending-screensaver/
1•ak009•29m ago•0 comments

Ask HN: What Is the "Lore" of HN?

2•Cider9986•30m ago•0 comments

Show HN: CoPilot for Project Management

https://quickapproveai.com/
1•xvok•30m ago•0 comments

Compu-Global-Hyper-Mega-Net: A Retro Internet for Retro Computers (LFNW 2026) [video]

https://www.youtube.com/watch?v=cSJsGNIDjtc
1•CursedSilicon•32m ago•0 comments

A man who decides when and where your next flight will be going

https://www.cnn.com/travel/airline-planning-officer-aviation-decisions
1•charrington•32m ago•0 comments

Event Clash of Prompts: A Real-Time Prompt Battle Royale

https://builder.aws.com
1•symbiotic_sec•33m ago•1 comments

Another supply-chain attack: elementary-data Python package compromised

https://arstechnica.com/security/2026/04/open-source-package-with-1-million-monthly-downloads-sto...
3•mil22•34m ago•1 comments

Live coverage: ULA to launch 29 Amazon Leo satellites on Atlas 5 LIVE in ~2hrs

https://spaceflightnow.com/2026/04/27/live-coverage-ula-to-launch-29-amazon-leo-satellites-on-atl...
1•bookmtn•35m ago•0 comments

Immigrants' Recent Effects on Government Budgets: 1994–2023

https://www.cato.org/white-paper/immigrants-recent-effects-government-budgets-1994-2023
3•Anon84•35m ago•0 comments

Talkie: a 13B vintage language model from 1930

https://talkie-lm.com/introducing-talkie
1•jekude•37m ago•0 comments

Ask HN: Will hardware ever be cheap again?

4•bjourne•37m ago•0 comments

Talkie: An LM from 1930

https://talkie-lm.com/chat
1•yusufozkan•38m ago•0 comments

ChatGPT Images 2.0 Still Can't Draw the Seven-Legged Spider I Want

https://will-keleher.com/posts/chatgpt-image-2-still-cant-draw-a-seven-legged-spider/
3•bsgada•42m ago•1 comments

AMD used AI to reimplement slurm in Rust

https://github.com/ROCm/spur
1•latchkey•42m ago•0 comments