frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

An Enterprise-Level Retrieval-Augmented Generation System

https://comfyai.app/article/llm-applications/enterprise-level-rag-hands-on-practice-II
6•zljdanceholic•8mo ago

Comments

zljdanceholic•8mo ago
How can we search the wanted key information from 10,000+ pages of PDFs within 2.5 hours? For fact check, how do we implement it so that answers are backed by page-level references, minimizing hallucinations?

RAG-Challenge-2 is a great open-source project by Ilya Rice that ranked 1st at the Enterprise RAG Challenge, which has 4500+ lines of code for implementing a high-performing RAG system. It might seem overwhelming to newcomers who are just beginning to learn this technology. Therefore, to help you get started quickly—and to motivate myself to learn its ins and outs—I’ve created a complete tutorial on this.

We have a complete graph to explain its workflow where multiple tools are used: Docling for parsing PDFs, LangChain for chunking text, faiss for vectorization and similarity searching, and chatgpt for LLMs.

Besides, I also outline the codeflow, demonstrating the running logic involving multiple python files where starters can easily get lost. Different files are colored differently. The purpose of showing this is not letting you memorize all of these file relationships. It works better for you to check the source code yourself and use this as a reference if you find yourself lost in the code.

The original project of Ilya Rice design its RAG system for answering questions of annual reports from companies, so he only designed three types of question response format for that challenge: a name, a number, or a boolean. But to ask questions about technical stuff, we absolutely ask general questions like How does RoPE work? to know about some concepts and the like Therefore, I further modify the system logic to fit this need by customizing an AnswerWithRAGContextExplanationPrompt class and automatically matching the most related chapter and corresponding pages via searching through all faiss databases (only retrieve the top-1) The final performance is demonstrated below (not cherry-picked, only tested once).

- What's the steps to train a nanoGPT from scratch? The answer is:

Training nanoGPT from scratch involves several clearly defined steps. First, set up the environment by installing necessary libraries, using either Anaconda or Google Colab, and then download the dataset (e.g., tinyShakespeare). Next, tokenize the text into numerical representations and split the data into training and validation sets. Define the model architecture including token/positional embeddings, transformer blocks with multi-head self-attention and feed-forward networks, and layer normalization. Configure training hyperparameters and set up an optimizer (such as AdamW). Proceed with a training loop that performs forward passes, computes loss, backpropagates, and updates parameters, while periodically evaluating performance on both training and validation data. Finally, use the trained model to generate new text from a given context.

All code are provided on Colab and the tutorial is referenced here. Hope this helps!

Neo-Royalism, the Trump Administration, and the Emerging International System

https://www.cambridge.org/core/journals/international-organization/article/further-back-to-the-fu...
1•bikenaga•3m ago•0 comments

Australia's social media ban, one month on

https://www.bbc.com/news/articles/c0mpmgn3jv2o
1•dabinat•5m ago•0 comments

System: Control your Mac from anywhere with AI

https://github.com/ygwyg/system
1•latchkey•6m ago•0 comments

EU calls for input: How to strengthen EU Open Source

https://eur-lex.europa.eu/legal-content/EN/TXT/?uri=intcom:Ares%282026%2969111
2•Flundstrom2•8m ago•0 comments

The quietest home – an architect built it for himself out of medical need

https://nypost.com/2026/01/09/real-estate/inside-the-quietest-home-in-the-world/
1•Stratoscope•8m ago•0 comments

Timeline of supercomputers that carried the Cray name

https://cray-history.net/
2•stmw•8m ago•0 comments

Show HN: I vibecoded an ARM64 operating system that boots on real hardware

https://github.com/kaansenol5/VibeOS
2•kaansenol5•8m ago•0 comments

The places we make memories help us inscribe them

https://news.columbia.edu/news/places-we-make-memories-help-us-inscribe-them
1•hhs•10m ago•0 comments

Show HN: Constellations – On-the-fly D3 collaboration graphs of history via LLMs

https://github.com/johndimm/Constellations
1•johndimm•11m ago•1 comments

Amazon Has Big Hopes for Wearable AI – Starting with This $50 Gadget

https://www.bloomberg.com/news/articles/2026-01-09/amazon-has-big-hopes-for-wearable-ai-starting-...
1•geox•14m ago•0 comments

UK electric car charger rollout slows amid worries over EV switch

https://www.theguardian.com/environment/2025/dec/25/uk-electric-car-charger-ev-switch-sales
1•PaulHoule•14m ago•0 comments

Show HN: Senior Developer Playbook

https://thomastartiere.com/a-senior-developer-playbook
1•tartieret•15m ago•0 comments

Fly's Sprites.dev addresses dev environment sandboxes and API sandboxes together

https://simonwillison.net/2026/Jan/9/sprites-dev/
1•simonw•15m ago•1 comments

NT town of Katherine named Australia's best drop, nine years after PFAS detected

https://www.abc.net.au/news/2026-01-10/katherine-pfas-australia-best-drinking-water/106184842
1•defrost•15m ago•0 comments

Rust Crate for iMessage Database Operation

https://github.com/ReagentX/imessage-exporter
1•RyanZhuuuu•16m ago•0 comments

Washington National Opera Is Leaving the Kennedy Center

https://www.nytimes.com/2026/01/09/arts/music/washington-national-opera-kennedy-center.html
7•mikhael•16m ago•0 comments

Superposition

https://github.com/SuperP2026/RealStableSuperposition
1•SuperpositionCA•20m ago•0 comments

Senior Django Developers?

https://docs.google.com/forms/d/e/1FAIpQLSf_4wdfjMyIwqHm_3g0kP1KqtTZtusFrSv7J7c_JT-vqQdtGg/viewform
1•hoveratskycf•21m ago•0 comments

Transform a Commodore 1541 into a KIM-1

http://retro.hansotten.nl/transform-a-commodore-1541-into-a-kim-1/
2•reaperducer•22m ago•0 comments

First All-Solid-State Battery in Production Vehicles

https://www.donutlab.com/battery/
1•extesy•24m ago•1 comments

Small-time crypto investors are facing violent attacks

https://www.bloomberg.com/features/2026-crypto-thieves-kidnappers/
1•hhs•24m ago•0 comments

The Order in Chaos: 4M Double Pendulums [video]

https://www.youtube.com/watch?v=8jVogdTJESw
1•bromuro•25m ago•0 comments

X changed its Iran flag emoji to the historical lion and sun symbol

https://twitter.com/pubity/status/2009641460795416923
2•mahdihabibi•26m ago•1 comments

Online source of Lego instructions, catalogues and ideas books from years past

https://oldinstructions.com/
1•nailer•30m ago•0 comments

Show HN: I built a tool to create LLM Tier Lists based on real tasks

https://promt.oshn-ai.com/community/004471c6-b508-4ae8-a7cd-20ce6ab4ad65
1•iliailinskii•31m ago•0 comments

Are Tesla Gigafactory Berlin's days numbered?

https://electrek.co/2026/01/08/are-tesla-gigafactory-berlins-days-numbered/
5•pintxo•32m ago•3 comments

EktuPy

https://kushaldas.in/posts/introducing-ektupy.html
2•pauloxnet•36m ago•0 comments

The "self-help" genre holds up an unflattering mirror to society

https://www.economist.com/culture/2025/12/30/what-self-help-books-tell-us-about-ourselves
1•hhs•38m ago•0 comments

French-U.K. Starlink rival pitches Canada on 'sovereign' service for Arctic ops

https://www.cbc.ca/news/politics/satellite-eutelsat-starlink-musk-defence-communications-9.7038830
2•saubeidl•46m ago•0 comments

OpenAI Divorces Microsoft [video]

https://www.youtube.com/watch?v=fD_33neMVQ4&list=RDfD_33neMVQ4
1•cowLamp•46m ago•1 comments