frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

An Enterprise-Level Retrieval-Augmented Generation System

https://comfyai.app/article/llm-applications/enterprise-level-rag-hands-on-practice-II
6•zljdanceholic•9mo ago

Comments

zljdanceholic•9mo ago
How can we search the wanted key information from 10,000+ pages of PDFs within 2.5 hours? For fact check, how do we implement it so that answers are backed by page-level references, minimizing hallucinations?

RAG-Challenge-2 is a great open-source project by Ilya Rice that ranked 1st at the Enterprise RAG Challenge, which has 4500+ lines of code for implementing a high-performing RAG system. It might seem overwhelming to newcomers who are just beginning to learn this technology. Therefore, to help you get started quickly—and to motivate myself to learn its ins and outs—I’ve created a complete tutorial on this.

We have a complete graph to explain its workflow where multiple tools are used: Docling for parsing PDFs, LangChain for chunking text, faiss for vectorization and similarity searching, and chatgpt for LLMs.

Besides, I also outline the codeflow, demonstrating the running logic involving multiple python files where starters can easily get lost. Different files are colored differently. The purpose of showing this is not letting you memorize all of these file relationships. It works better for you to check the source code yourself and use this as a reference if you find yourself lost in the code.

The original project of Ilya Rice design its RAG system for answering questions of annual reports from companies, so he only designed three types of question response format for that challenge: a name, a number, or a boolean. But to ask questions about technical stuff, we absolutely ask general questions like How does RoPE work? to know about some concepts and the like Therefore, I further modify the system logic to fit this need by customizing an AnswerWithRAGContextExplanationPrompt class and automatically matching the most related chapter and corresponding pages via searching through all faiss databases (only retrieve the top-1) The final performance is demonstrated below (not cherry-picked, only tested once).

- What's the steps to train a nanoGPT from scratch? The answer is:

Training nanoGPT from scratch involves several clearly defined steps. First, set up the environment by installing necessary libraries, using either Anaconda or Google Colab, and then download the dataset (e.g., tinyShakespeare). Next, tokenize the text into numerical representations and split the data into training and validation sets. Define the model architecture including token/positional embeddings, transformer blocks with multi-head self-attention and feed-forward networks, and layer normalization. Configure training hyperparameters and set up an optimizer (such as AdamW). Proceed with a training loop that performs forward passes, computes loss, backpropagates, and updates parameters, while periodically evaluating performance on both training and validation data. Finally, use the trained model to generate new text from a given context.

All code are provided on Colab and the tutorial is referenced here. Hope this helps!

Show HN: Skillscape – Engineering skills matrix without the spreadsheet

https://www.skillscape.dev/
1•danielyefet•1m ago•0 comments

SimpleSteps – TypeScript-to-ASL Compiler

https://github.com/DevNamedZed/simplesteps
1•aman96_54_3•4m ago•0 comments

Demonstration of Network Tap and Packet Filter Using a Security Camera

https://privateisland.tech/dev/betsy-demo-tap-w-cam
1•mindchasers•4m ago•0 comments

I thought freelancers hated invoices. They hated the tools

https://www.indiehackers.com/post/i-thought-freelancers-hated-invoices-they-actually-hated-the-to...
1•allinonetools_•10m ago•0 comments

ThePrimeagen goes back to traditional coding

https://twitter.com/theprimeagen/status/2026771192191824108
2•rob•13m ago•0 comments

When "technically true" becomes "misleading"

https://www.theargumentmag.com/p/when-technically-true-becomes-actually
1•bananaflag•19m ago•0 comments

Australia's WiseTech to cut 2k jobs as AI renders manual coding obsolete

https://www.computerworld.com/article/4137200/australias-wisetech-to-cut-2000-jobs-as-ai-renders-...
1•netfortius•19m ago•0 comments

CleverMock – An AI voice interviewer that interrupts you like a real human

https://www.clevermock.com
1•devinda-dilshan•20m ago•1 comments

Show HN: Programmatic (and self-updating) SaaS demo videos

https://www.rundown.video/
1•guico•21m ago•0 comments

Show HN: Bing Webmaster CLI for Agents and LLMs

https://github.com/NmadeleiDev/bing_webmaster_cli
1•Gregoryy•24m ago•0 comments

A White House Staffer Appears to Run Pro-Trump X Account

https://www.wired.com/story/a-white-house-staffer-appears-to-run-massive-pro-trump-meme-page/
2•doener•28m ago•2 comments

Show HN: Onera – Private LLM Inference Inside AMD SEV-SNP Enclaves

https://onera.chat
1•shreyaspapi•29m ago•1 comments

Next-Token Predictor Is an AI's Job, Not Its Species

https://www.astralcodexten.com/p/next-token-predictor-is-an-ais-job
1•bananaflag•30m ago•0 comments

Tests Are the New Moat

https://saewitz.com/tests-are-the-new-moat
1•vinhnx•33m ago•1 comments

'Access to Insight' is shutting down

https://www.accesstoinsight.org/
1•bifftastic•33m ago•0 comments

The next batch of fixed Epstein files links and notes is live

https://xcancel.com/IAmAnonLegion/status/2026853415863615662?s=20
1•doener•33m ago•0 comments

Programming has changed dramatically due to AI in the last 2 months (Karpathy)

https://twitter.com/karpathy/status/2026731645169185220
2•bakigul•36m ago•0 comments

Demo of an indie AI collaboration app – beyond Codex and Claude Code desktop

1•seeksky•38m ago•1 comments

AIQuotaBar – macOS menu bar app that shows Claude and ChatGPT usage limits

https://github.com/yagcioglutoprak/AIQuotaBar
1•toprak123•43m ago•1 comments

Git City – Your GitHub as a 3D City

https://www.thegitcity.com/
1•duck•44m ago•2 comments

Mumsnet campaign demands ban on social media for under-16s

https://www.theguardian.com/society/2026/feb/26/mumsnet-campaign-demands-ban-social-media-under-16s
2•pmg101•46m ago•0 comments

Shipcast – Turn your Git commits into tweets, automatically

https://shipcast.dev/
1•guoyu•46m ago•0 comments

Show HN: LucidExtractor – Extract web data in plain English, no selectors

https://lucidextractor.liceron.in
1•yukendiran_j•51m ago•0 comments

A larger cage: about the ongoing calls for "digital sovereignty"

https://www.structural-integrity.eu/a-larger-cage-about-the-ongoing-calls-for-digital-sovereignty/
1•doener•52m ago•0 comments

Earth's heat to power 10k homes in renewable energy first for UK

https://www.bbc.co.uk/news/articles/cewzg77k721o
2•RobinL•52m ago•0 comments

Show HN: Snaplake – Query past database states without restoring backups

https://snaplake.clroot.io
1•clroot•52m ago•0 comments

Show HN: Context Harness – Local first context engine for AI tools

https://github.com/parallax-labs/context-harness
1•__parallaxis•52m ago•0 comments

Perplexity Computer

https://www.perplexity.ai/hub/blog/introducing-perplexity-computer
1•kamaal•52m ago•1 comments

Show HN: I Made an AI Skill to Help Write Tlaps Proofs

https://github.com/younes-io/agent-skills/blob/main/skills/tlaps-workbench/SKILL.md
1•youio•52m ago•0 comments

Implementing a Clear Room Z80 / ZX Spectrum Emulator with Claude Code

https://antirez.com/news/160
2•boyter•54m ago•0 comments