frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

An Enterprise-Level Retrieval-Augmented Generation System

https://comfyai.app/article/llm-applications/enterprise-level-rag-hands-on-practice-II
6•zljdanceholic•8mo ago

Comments

zljdanceholic•8mo ago
How can we search the wanted key information from 10,000+ pages of PDFs within 2.5 hours? For fact check, how do we implement it so that answers are backed by page-level references, minimizing hallucinations?

RAG-Challenge-2 is a great open-source project by Ilya Rice that ranked 1st at the Enterprise RAG Challenge, which has 4500+ lines of code for implementing a high-performing RAG system. It might seem overwhelming to newcomers who are just beginning to learn this technology. Therefore, to help you get started quickly—and to motivate myself to learn its ins and outs—I’ve created a complete tutorial on this.

We have a complete graph to explain its workflow where multiple tools are used: Docling for parsing PDFs, LangChain for chunking text, faiss for vectorization and similarity searching, and chatgpt for LLMs.

Besides, I also outline the codeflow, demonstrating the running logic involving multiple python files where starters can easily get lost. Different files are colored differently. The purpose of showing this is not letting you memorize all of these file relationships. It works better for you to check the source code yourself and use this as a reference if you find yourself lost in the code.

The original project of Ilya Rice design its RAG system for answering questions of annual reports from companies, so he only designed three types of question response format for that challenge: a name, a number, or a boolean. But to ask questions about technical stuff, we absolutely ask general questions like How does RoPE work? to know about some concepts and the like Therefore, I further modify the system logic to fit this need by customizing an AnswerWithRAGContextExplanationPrompt class and automatically matching the most related chapter and corresponding pages via searching through all faiss databases (only retrieve the top-1) The final performance is demonstrated below (not cherry-picked, only tested once).

- What's the steps to train a nanoGPT from scratch? The answer is:

Training nanoGPT from scratch involves several clearly defined steps. First, set up the environment by installing necessary libraries, using either Anaconda or Google Colab, and then download the dataset (e.g., tinyShakespeare). Next, tokenize the text into numerical representations and split the data into training and validation sets. Define the model architecture including token/positional embeddings, transformer blocks with multi-head self-attention and feed-forward networks, and layer normalization. Configure training hyperparameters and set up an optimizer (such as AdamW). Proceed with a training loop that performs forward passes, computes loss, backpropagates, and updates parameters, while periodically evaluating performance on both training and validation data. Finally, use the trained model to generate new text from a given context.

All code are provided on Colab and the tutorial is referenced here. Hope this helps!

Parents Are Going Broke from Their Kids' Sushi Obsession

https://www.wsj.com/lifestyle/parenting-food-diet-kids-sushi-8ff64063
1•kdazzle•28s ago•0 comments

EU countries have approved the Mercosur trade deal after 25 years of talks

https://www.politico.eu/article/eu-countries-approve-mercosur-trade-deal-for-signature/
1•paulpauper•3m ago•0 comments

Betterment users phished with real but unauthorized messages

https://www.reddit.com/u/bettermenthq/s/QaU43KrF4t
1•Ozzie_osman•3m ago•0 comments

Show HN: FeedPod – Convert your RSS feeds to personalized podcasts

https://feedpod.io/
1•thatloststudent•14m ago•0 comments

Deep sequence models tend to memorize geometrically; it is unclear why

https://arxiv.org/abs/2510.26745
1•erhuve•27m ago•0 comments

Show HN: CloudMasters – one TUI to rule them all, rent VPS from 58000 prices

https://github.com/BrowserBox/CloudMasters-Marketplace
1•keepamovin•31m ago•2 comments

The Incidence of Tariffs: Rates and Reality [pdf]

https://bfi.uchicago.edu/wp-content/uploads/2025/12/BFI_WP_2025-151.pdf
1•neehao•35m ago•0 comments

CDC staff 'blindsided' as child vaccine schedule unilaterally overhauled

https://www.unmc.edu/healthsecurity/transmission/2026/01/07/cdc-staff-blindsided-as-child-vaccine...
1•stopbulying•36m ago•0 comments

Anthropic cut off xAI's Claude access in Cursor

https://twitter.com/kyliebytes/status/2009686466746822731
2•meetpateltech•37m ago•0 comments

UN chief says US has 'legal obligation' to fund agencies after Trump withdrawal

https://www.pbs.org/newshour/world/un-chief-says-the-u-s-has-legal-obligation-to-fund-agencies-af...
2•stopbulying•39m ago•2 comments

Private Inference

https://confer.to/blog/2026/01/private-inference/
1•vishnukvmd•43m ago•0 comments

Bash by the Numbers

https://mckern.sh/post/bash-by-the-numbers/
2•thunderbong•43m ago•0 comments

Some first thoughts about live immersive basketball

https://sixcolors.com/post/2026/01/some-first-thoughts-about-live-immersive-basketball/
2•coloneltcb•49m ago•0 comments

Where's the $100k iPhone?

https://boydkane.com/essays/100k-iphone
1•zdw•54m ago•0 comments

MIT Non-AI License

4•dumindunuwan•55m ago•2 comments

Show HN: Understand the Picture of the Day

https://picture.learntosolveit.com
1•orsenthil•56m ago•0 comments

Haraltd – A cross-platform Bluetooth daemon with a JSON-based RPC

https://github.com/bluetuith-org/haraltd
1•darkhz•57m ago•0 comments

The Stick in the Stream

https://randsinrepose.com/archives/the-stick-in-the-stream/
1•zdw•59m ago•0 comments

MAKERphone 2: first modular DIY phone, no soldering

https://circuitmess.com/products/makerphone-2-0
2•nateb2022•59m ago•0 comments

Sodium-ion battery cells near lithium-ion cost parity, set to get cheaper

https://www.ess-news.com/2026/01/09/sodium-ion-battery-cells-already-near-lithium-ion-cost-parity...
1•toomuchtodo•59m ago•1 comments

OpenAI to Buy Pinterest? Strategic Analysis

https://nekuda.substack.com/p/openai-to-buy-pinterest-heres-what
1•gmays•1h ago•0 comments

Vajra BM25 is a fast BM25 implementation in Python

https://twitter.com/aiexplorations/status/2009846407881212136
1•aiexplorations•1h ago•1 comments

Show HN: A website to save moments that remind you of someone

https://thisremindedme.com/
1•Winggo•1h ago•0 comments

Google and chatbot startup Character move to settle teen suicide lawsuits

https://www.washingtonpost.com/technology/2026/01/07/google-character-settle-lawsuits-suicide/
1•1vuio0pswjnm7•1h ago•0 comments

Agent skills: what can go wrong?

https://github.com/pors/skill-audit
2•pors•1h ago•0 comments

You probably don't need Oh My Zsh

https://rushter.com/blog/zsh-shell/
55•fla•1h ago•36 comments

Fix Your Robots.txt or Your Site Disappears from Google

https://www.alanwsmith.com/en/37/wa/jz/s1/
2•qingcharles•1h ago•1 comments

Show HN: VoiceBrainDump – voice-first idea capture, single HTML file, offline

https://voicebraindump.app/
1•digi_wares•1h ago•0 comments

Show HN: Focus timer that turns hours into assets

https://seton.run/
1•keplerjst•1h ago•0 comments

Kazakhstan Launches First Institute of Transport Sciences and Technologies

https://qazinform.com/news/kazakhstan-launches-first-institute-of-transport-sciences-and-technolo...
1•Bolat14•1h ago•0 comments