frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

An Enterprise-Level Retrieval-Augmented Generation System

https://comfyai.app/article/llm-applications/enterprise-level-rag-hands-on-practice-II
6•zljdanceholic•11mo ago

Comments

zljdanceholic•11mo ago
How can we search the wanted key information from 10,000+ pages of PDFs within 2.5 hours? For fact check, how do we implement it so that answers are backed by page-level references, minimizing hallucinations?

RAG-Challenge-2 is a great open-source project by Ilya Rice that ranked 1st at the Enterprise RAG Challenge, which has 4500+ lines of code for implementing a high-performing RAG system. It might seem overwhelming to newcomers who are just beginning to learn this technology. Therefore, to help you get started quickly—and to motivate myself to learn its ins and outs—I’ve created a complete tutorial on this.

We have a complete graph to explain its workflow where multiple tools are used: Docling for parsing PDFs, LangChain for chunking text, faiss for vectorization and similarity searching, and chatgpt for LLMs.

Besides, I also outline the codeflow, demonstrating the running logic involving multiple python files where starters can easily get lost. Different files are colored differently. The purpose of showing this is not letting you memorize all of these file relationships. It works better for you to check the source code yourself and use this as a reference if you find yourself lost in the code.

The original project of Ilya Rice design its RAG system for answering questions of annual reports from companies, so he only designed three types of question response format for that challenge: a name, a number, or a boolean. But to ask questions about technical stuff, we absolutely ask general questions like How does RoPE work? to know about some concepts and the like Therefore, I further modify the system logic to fit this need by customizing an AnswerWithRAGContextExplanationPrompt class and automatically matching the most related chapter and corresponding pages via searching through all faiss databases (only retrieve the top-1) The final performance is demonstrated below (not cherry-picked, only tested once).

- What's the steps to train a nanoGPT from scratch? The answer is:

Training nanoGPT from scratch involves several clearly defined steps. First, set up the environment by installing necessary libraries, using either Anaconda or Google Colab, and then download the dataset (e.g., tinyShakespeare). Next, tokenize the text into numerical representations and split the data into training and validation sets. Define the model architecture including token/positional embeddings, transformer blocks with multi-head self-attention and feed-forward networks, and layer normalization. Configure training hyperparameters and set up an optimizer (such as AdamW). Proceed with a training loop that performs forward passes, computes loss, backpropagates, and updates parameters, while periodically evaluating performance on both training and validation data. Finally, use the trained model to generate new text from a given context.

All code are provided on Colab and the tutorial is referenced here. Hope this helps!

Microsoft must face $2.8B UK lawsuit over cloud computing licences

https://www.reuters.com/sustainability/boards-policy-regulation/microsoft-must-face-28-billion-uk...
1•rolph•1m ago•0 comments

Firefox 150 Released

https://lwn.net/Articles/1068839/
1•amcclure•2m ago•1 comments

Schematico discovers, generates, and serves formatted data

https://schematico.app/
1•neeharchow•4m ago•0 comments

Nandan

1•Nandan_J•5m ago•0 comments

Show HN: Image to Explorable 3D Scene

https://huggingface.co/spaces/ggamecrazy/lyra2-explorable-scene
1•ggamecrazy•5m ago•0 comments

aube: A New Dawn for Node Installs (from mise author)

https://aube.en.dev/
1•rsyring•6m ago•1 comments

We Built OpsFabric

https://opsfabric.io/blog/why-we-built-opsfabric.html
1•taubek•6m ago•0 comments

Grok 4.3 Beta

https://grok.com/release-notes
2•DeathArrow•6m ago•0 comments

A network of YouTube accounts is promoting U.S. annexation to Albertans

https://www.thestar.com/news/investigations/a-network-of-youtube-accounts-is-promoting-us-annexat...
2•Teever•8m ago•0 comments

Meta to start capturing employee mouse movements, keystrokes for AI training

https://www.reuters.com/sustainability/boards-policy-regulation/meta-start-capturing-employee-mou...
3•symisc_devel•8m ago•1 comments

Rehumanising our Conflicts with Carm Aufderheide [audio]

https://accidentalgods.life/beyond-zero-sum-rehumanising-our-conflicts-with-carm-aufderheide-of-n...
1•mooreds•8m ago•0 comments

HOPL IV: A History of Clojure (2021)

https://hopl4.sigplan.org/details/hopl-4-papers/9/A-History-of-Clojure
1•adityaathalye•9m ago•0 comments

Is your business idea good?

https://www.inc.com/ben-sherry/is-your-business-idea-actually-good-this-claude-hack-provides-the-...
1•tacon•10m ago•0 comments

Four Horsemen of the AIpocalypse

https://www.wheresyoured.at/four-horsemen-of-the-aipocalypse/
3•7777777phil•11m ago•0 comments

Noah Wyle, Chappell Roan, and the Wretched Rise of Toxic Fandoms

https://www.vanityfair.com/hollywood/story/the-pitt-heated-rivalry-toxic-fans
1•Tomte•11m ago•0 comments

Hyper-DERP: Same throughput as Tailscale's derper, half the cores

https://hyper-derp.dev/blog/hyper-derp-announcement/
1•KRuskowski•12m ago•0 comments

New poker tax laws have forced Erik Seidel into semi-retirement

https://www.cnbc.com/2026/04/21/pro-poker-player-says-new-tax-laws-forced-him-into-semi-retiremen...
1•indigodaddy•12m ago•0 comments

Mitshe open-source platform that gives AI agents isolated Docker workspaces

https://github.com/mitshe/mitshe
1•3uba•12m ago•0 comments

Europe, Wake Up

https://datascienceathome.com/europe-wake-up-you-cant-be-a-superpower-on-someone-elses-servers-ep...
1•frag•12m ago•0 comments

Privacy and Anonymity in Monero

https://risencrypto.github.io/Monero/
1•Cider9986•13m ago•0 comments

Cook Editor: Plain text recipe editor

https://cook.md/editor
1•dubadub•14m ago•0 comments

Palantir manifesto described as 'ramblings of a supervillain' amid contract fear

https://www.theguardian.com/technology/2026/apr/21/palantir-manifesto-uk-contract-fears-mps
3•e12e•16m ago•0 comments

TranslateGemma Running in the Browser

https://artisincode.com/playground/translation/
2•parentheses•17m ago•1 comments

Show HN: Built an open-source dashboarding MCP

1•ss323•17m ago•0 comments

Show HN: Nobulex – Cryptographic receipts for AI agent actions

https://github.com/arian-gogani/nobulex
1•arian_•18m ago•0 comments

Meta to start capturing employee mouse movement, keystrokes for AI training data

https://tech.yahoo.com/ai/meta-ai/articles/exclusive-meta-start-capturing-employee-162745587.html
3•louiereederson•19m ago•0 comments

A Periodic Map of Cheese

https://cheesemap.netlify.app/
2•sfrechtling•20m ago•0 comments

I Feel So Sorry for My A.I. Sunglasses

https://www.nytimes.com/2026/04/14/magazine/ai-sunglasses-meta-zuckerberg.html
1•lxm•20m ago•0 comments

Deep Research Max: a step change for autonomous research agents

https://blog.google/innovation-and-ai/models-and-research/gemini-models/next-generation-gemini-de...
1•meetpateltech•21m ago•0 comments

Google's Internal Politics Leave It Playing Catch-Up on AI Coding

https://www.bloomberg.com/news/articles/2026-04-21/google-struggles-to-gain-ground-in-ai-coding-a...
4•htrp•22m ago•1 comments