frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

An Enterprise-Level Retrieval-Augmented Generation System

https://comfyai.app/article/llm-applications/enterprise-level-rag-hands-on-practice-II
6•zljdanceholic•9mo ago

Comments

zljdanceholic•9mo ago
How can we search the wanted key information from 10,000+ pages of PDFs within 2.5 hours? For fact check, how do we implement it so that answers are backed by page-level references, minimizing hallucinations?

RAG-Challenge-2 is a great open-source project by Ilya Rice that ranked 1st at the Enterprise RAG Challenge, which has 4500+ lines of code for implementing a high-performing RAG system. It might seem overwhelming to newcomers who are just beginning to learn this technology. Therefore, to help you get started quickly—and to motivate myself to learn its ins and outs—I’ve created a complete tutorial on this.

We have a complete graph to explain its workflow where multiple tools are used: Docling for parsing PDFs, LangChain for chunking text, faiss for vectorization and similarity searching, and chatgpt for LLMs.

Besides, I also outline the codeflow, demonstrating the running logic involving multiple python files where starters can easily get lost. Different files are colored differently. The purpose of showing this is not letting you memorize all of these file relationships. It works better for you to check the source code yourself and use this as a reference if you find yourself lost in the code.

The original project of Ilya Rice design its RAG system for answering questions of annual reports from companies, so he only designed three types of question response format for that challenge: a name, a number, or a boolean. But to ask questions about technical stuff, we absolutely ask general questions like How does RoPE work? to know about some concepts and the like Therefore, I further modify the system logic to fit this need by customizing an AnswerWithRAGContextExplanationPrompt class and automatically matching the most related chapter and corresponding pages via searching through all faiss databases (only retrieve the top-1) The final performance is demonstrated below (not cherry-picked, only tested once).

- What's the steps to train a nanoGPT from scratch? The answer is:

Training nanoGPT from scratch involves several clearly defined steps. First, set up the environment by installing necessary libraries, using either Anaconda or Google Colab, and then download the dataset (e.g., tinyShakespeare). Next, tokenize the text into numerical representations and split the data into training and validation sets. Define the model architecture including token/positional embeddings, transformer blocks with multi-head self-attention and feed-forward networks, and layer normalization. Configure training hyperparameters and set up an optimizer (such as AdamW). Proceed with a training loop that performs forward passes, computes loss, backpropagates, and updates parameters, while periodically evaluating performance on both training and validation data. Finally, use the trained model to generate new text from a given context.

All code are provided on Colab and the tutorial is referenced here. Hope this helps!

Show HN: Boolean-query-parser – From a 4-hour hack to 3k downloads

https://github.com/Piergiuseppe/boolean-query-parser
1•TheBuc•34s ago•0 comments

RCT: Vaporized cannabis versus placebo for acute migraine

https://headachejournal.onlinelibrary.wiley.com/doi/10.1111/head.70025
1•PaulHoule•36s ago•0 comments

Show HN: Local Voice Assistant

1•armcat•59s ago•0 comments

Sentinel – watch over your Tailscale network and notify of changes

https://github.com/jaxxstorm/sentinel
1•jaxxstorm•1m ago•0 comments

Temporal Raises $300M Series D to Make Agentic AI Real for Companies

https://temporal.io/news/temporal-raises-300M-to-make-agentic-ai-real-for-companies
1•eatonphil•1m ago•0 comments

Show HN: MAKO – Open protocol for LLM-optimized web content (93% fewer tokens)

https://makospec.vercel.app/en
1•juanisidoro•2m ago•1 comments

Show HN: Cai – AI actions on your clipboard, runs locally (macOS, open source)

https://github.com/soyasis/cai
1•soyasis•3m ago•0 comments

Show HN: Kremis – Deterministic memory graph for AI agents (Rust)

https://github.com/M2Dr3g0n/kremis
1•M2Dr3g0n•4m ago•0 comments

Instagram boss defends app in trial over alleged harms to kids

https://www.latimes.com/california/story/2026-02-11/instagram-adam-mosseri-social-media-lawsuit-t...
1•1vuio0pswjnm7•6m ago•0 comments

Java.evolved: Java has evolved. Your code can too

https://javaevolved.github.io
2•jongalloway2•8m ago•0 comments

Vibe coding broke the Ballmer Peak

https://www.adriankrebs.ch/blog/the-new-ballmer-peak/
1•hubraumhugo•8m ago•0 comments

Quiet: A private, P2P alternative to Slack and Discord built on Tor and IPFS

https://tryquiet.org/index.html
1•hliyan•9m ago•0 comments

Many consumer electronics manufacturers will bankrupt due to AI memory crisis

https://www.pcgamer.com/hardware/memory/many-consumer-electronics-manufacturers-will-go-bankrupt-...
1•taubek•10m ago•0 comments

EU launches probe into xAI over sexualized images

https://arstechnica.com/tech-policy/2026/02/eu-launches-probe-into-xai-over-sexualized-images/
1•ndsipa_pomu•10m ago•0 comments

Ukraine recaptures 201km² after shutdown of Russian forces' access to Starlink

https://www.france24.com/en/europe/20260216-ukraine-makes-fastest-battlefield-gain-in-2-5-years
1•barredo•10m ago•0 comments

Microsoft Is Auto-Enabling Passkeys in March 2026

https://entra.news/p/microsoft-is-auto-enabling-passkeys
1•vdelitz•10m ago•0 comments

Dutch Lawmakers Advance 36% Capital Gains Tax on Crypto

https://cryptonews.com/news/dutch-lawmakers-advance-36-capital-gains-tax-on-crypto/
1•brodouevencode•10m ago•0 comments

Show HN: CasperAI – A local MCP server for cross-platform engineering context

https://github.com/chose166/CasperAI
1•chose166•12m ago•1 comments

Show HN: Broomy – Open-source app for working with many AI agents at once

https://broomy.org/
1•robotelvis•12m ago•0 comments

Ask HN: How have your security policies kept up with AI?

1•frenchtoast8•12m ago•0 comments

Dutch cops arrest man after sending him confidential files by mistake

https://www.theregister.com/2026/02/16/dutch_cops_breach/
2•tchalla•12m ago•0 comments

The Answer Isn't Macro

https://www.mountaineagle.net/articles/display/?entry_short=the-answer-isnt-macro
1•retrocog•13m ago•0 comments

Lit: Version control where prompts are the source of truth

https://clintonboys.com/projects/lit/
1•mtsolitary•13m ago•0 comments

The oldest known vertebrates had two pairs of eyes

https://newatlas.com/biology/the-worlds-oldest-known-vertebrates-had-two-pairs-of-eyes/
1•Brajeshwar•14m ago•0 comments

5k-year-old bacteria thawed in Romanian ice cave

https://www.popsci.com/science/bacteria-ice-cave-romania/
1•Brajeshwar•14m ago•0 comments

Anthropic's CEO says we're in the 'centaur phase' of software engineering

https://www.businessinsider.com/anthropic-ceo-dario-amodei-centaur-phase-of-software-engineering-...
1•smurda•14m ago•2 comments

AI and the Age of Probabilistic Programming

https://www.abrahammarinperez.com/post/ai-and-the-age-of-probabilistic-programming
1•youknownothing•14m ago•0 comments

Robinhood Ventures Fund I

https://www.sec.gov/ix?doc=/Archives/edgar/data/0002085091/000162828026008313/ck0002085091-202602...
1•sahin•14m ago•0 comments

Show HN: PokeDex++ – I rebuilt my Pokémon app as a web app

https://pokedexplus.shop
1•meimeixoxi•15m ago•0 comments

Show HN: Kanban_P2P – A P2P Kanban board contained in a single HTML file

https://github.com/LukeB42/kanban_p2p
1•LukeB42•15m ago•0 comments