frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

An Enterprise-Level Retrieval-Augmented Generation System

https://comfyai.app/article/llm-applications/enterprise-level-rag-hands-on-practice-II
6•zljdanceholic•9mo ago

Comments

zljdanceholic•9mo ago
How can we search the wanted key information from 10,000+ pages of PDFs within 2.5 hours? For fact check, how do we implement it so that answers are backed by page-level references, minimizing hallucinations?

RAG-Challenge-2 is a great open-source project by Ilya Rice that ranked 1st at the Enterprise RAG Challenge, which has 4500+ lines of code for implementing a high-performing RAG system. It might seem overwhelming to newcomers who are just beginning to learn this technology. Therefore, to help you get started quickly—and to motivate myself to learn its ins and outs—I’ve created a complete tutorial on this.

We have a complete graph to explain its workflow where multiple tools are used: Docling for parsing PDFs, LangChain for chunking text, faiss for vectorization and similarity searching, and chatgpt for LLMs.

Besides, I also outline the codeflow, demonstrating the running logic involving multiple python files where starters can easily get lost. Different files are colored differently. The purpose of showing this is not letting you memorize all of these file relationships. It works better for you to check the source code yourself and use this as a reference if you find yourself lost in the code.

The original project of Ilya Rice design its RAG system for answering questions of annual reports from companies, so he only designed three types of question response format for that challenge: a name, a number, or a boolean. But to ask questions about technical stuff, we absolutely ask general questions like How does RoPE work? to know about some concepts and the like Therefore, I further modify the system logic to fit this need by customizing an AnswerWithRAGContextExplanationPrompt class and automatically matching the most related chapter and corresponding pages via searching through all faiss databases (only retrieve the top-1) The final performance is demonstrated below (not cherry-picked, only tested once).

- What's the steps to train a nanoGPT from scratch? The answer is:

Training nanoGPT from scratch involves several clearly defined steps. First, set up the environment by installing necessary libraries, using either Anaconda or Google Colab, and then download the dataset (e.g., tinyShakespeare). Next, tokenize the text into numerical representations and split the data into training and validation sets. Define the model architecture including token/positional embeddings, transformer blocks with multi-head self-attention and feed-forward networks, and layer normalization. Configure training hyperparameters and set up an optimizer (such as AdamW). Proceed with a training loop that performs forward passes, computes loss, backpropagates, and updates parameters, while periodically evaluating performance on both training and validation data. Finally, use the trained model to generate new text from a given context.

All code are provided on Colab and the tutorial is referenced here. Hope this helps!

US Military disrupts cell phones in Texas after UFO reports

https://www.dailymail.co.uk/sciencetech/article-15557509/black-outs-Texas-El-Paso-airspace-shutdo...
1•Bender•1m ago•0 comments

Show HN: Mimir – Cursor for Product Managers

https://www.mimir.build/
1•schreibertuc•3m ago•0 comments

The first Android 17 beta is now available on Pixel devices

https://arstechnica.com/gadgets/2026/02/the-first-android-17-beta-is-now-available-on-pixel-devices/
1•Bender•3m ago•0 comments

I have been banned from Gemini

https://twitter.com/sschueller/status/2022417041555444142
3•sschueller•5m ago•1 comments

Chemical habitability of Earth and rocky planets prescribed by core formation

https://www.nature.com/articles/s41550-026-02775-z
1•Tomte•6m ago•0 comments

15× vs. ~1.37×: Recalculating GPT-5.3-Codex-Spark on SWE-Bench Pro

https://twitter.com/nvanlandschoot/status/2022385829596078100
1•nvanlandschoot•6m ago•1 comments

Show HN: MCP tools that let AI agents dispatch batteries and prove carbon

https://energyatit.com/developers
1•kasathur•8m ago•0 comments

Driverless trucks can now travel farther distances faster than human drivers

https://techcrunch.com/2026/02/12/auroras-driverless-trucks-can-now-travel-farther-distances-fast...
1•jimt1234•8m ago•0 comments

AI safety leader says 'world is in peril' and quits to study poetry

https://www.bbc.com/news/articles/c62dlvdq3e3o
1•darod•8m ago•0 comments

Drive the 'ice road', Estonians told – just don't fasten your seatbelt

https://www.theguardian.com/world/2026/feb/10/estonia-ice-road-frozen-sea-saaremaa-hiiumaa
1•eatonphil•8m ago•0 comments

Software Provenance

https://blog.jsbarretto.com/post/provenance
1•smartmic•9m ago•0 comments

Show HN: AccessiGuard – Web accessibility scanner with AI fix suggestions

https://accessiguard.app
1•PrimeStark•10m ago•0 comments

VC-backed unicorns are losing their horns

https://www.axios.com/2026/02/13/vc-unicorn-companies
1•toomuchtodo•16m ago•1 comments

ArsTechnica seemingly using AI to write an article about AI impersonation

https://arstechnica.com/ai/2026/02/after-a-routine-code-rejection-an-ai-agent-published-a-hit-pie...
3•AdmiralAsshat•18m ago•0 comments

How Nintendo Became the Most Fun Video Game Company

https://www.nytimes.com/2026/02/06/books/review/podcast-keza-macdonald-nintendo.html
2•CharlesW•19m ago•0 comments

Txtbrd

https://txtbrd.com
2•1o1o1o1o1•20m ago•0 comments

Perfect Squares and Pythagorean Triples on the Ulam Spiral

https://www.youtube.com/watch?v=x4ooQSrdz6g
1•nyc111•22m ago•0 comments

Possible identification of the Luna 9 Moon landing site using machine learning

https://www.nature.com/articles/s44453-025-00020-x
1•geox•24m ago•0 comments

We allowed remote code execution (but safely)

https://tumuchdata.club/post/coding-challenge-infrastructure/
1•todsacerdoti•24m ago•0 comments

Sovereign Code from the Heart of Suffering: Injecting Logic into AI

Https://paragraph.com/@0x4fd3729a4fedf54a74b73d93f7f775a1ef520cec/sovereign-logic-injection-how-t...
1•suffering•24m ago•1 comments

Resurrected nitrogenases recapitulate N-isotope biosignatures over 2B years

https://pmc.ncbi.nlm.nih.gov/articles/PMC9755046/
1•PaulHoule•24m ago•1 comments

Where will China get its compute in 2026?

https://www.the-substrate.net/p/where-will-china-get-its-compute
1•erwald•24m ago•0 comments

Show HN: Superposition, open source access to Claude Code or Codex from anywhere

https://github.com/trezm/superposition
1•trezm•25m ago•0 comments

The EU moves to kill infinite scrolling

https://www.politico.eu/article/tiktok-meta-facebook-instagram-brussels-kill-infinite-scrolling/
50•danso•26m ago•47 comments

No Here on Slack

https://noathere.org/
2•jcmuller•28m ago•0 comments

Humans as Constancy Anchors: A Response to 'Something Big Is Happening'

2•mrev2•30m ago•1 comments

Show HN: ARA-Engine – Modeling the Alberta power grid transition in Python

https://github.com/ada33934/ARA-Engine
1•ada33934•30m ago•0 comments

The AI hater's guide to code with LLMs

https://aredridel.dinhe.net/2026/02/12/the-ai-haters-guide-to-code-with-llms/
3•speckx•33m ago•0 comments

Show HN: An MCP server that gives AI assistants a live Mermaid diagram canvas

https://github.com/iishyfishyy/mermaid-live-mcp
1•ishyfishyy•34m ago•0 comments

Ask HN: Are there examples of 3D printing data onto physical surfaces?

1•catapart•34m ago•0 comments