frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

An Enterprise-Level Retrieval-Augmented Generation System

https://comfyai.app/article/llm-applications/enterprise-level-rag-hands-on-practice-II
6•zljdanceholic•8mo ago

Comments

zljdanceholic•8mo ago
How can we search the wanted key information from 10,000+ pages of PDFs within 2.5 hours? For fact check, how do we implement it so that answers are backed by page-level references, minimizing hallucinations?

RAG-Challenge-2 is a great open-source project by Ilya Rice that ranked 1st at the Enterprise RAG Challenge, which has 4500+ lines of code for implementing a high-performing RAG system. It might seem overwhelming to newcomers who are just beginning to learn this technology. Therefore, to help you get started quickly—and to motivate myself to learn its ins and outs—I’ve created a complete tutorial on this.

We have a complete graph to explain its workflow where multiple tools are used: Docling for parsing PDFs, LangChain for chunking text, faiss for vectorization and similarity searching, and chatgpt for LLMs.

Besides, I also outline the codeflow, demonstrating the running logic involving multiple python files where starters can easily get lost. Different files are colored differently. The purpose of showing this is not letting you memorize all of these file relationships. It works better for you to check the source code yourself and use this as a reference if you find yourself lost in the code.

The original project of Ilya Rice design its RAG system for answering questions of annual reports from companies, so he only designed three types of question response format for that challenge: a name, a number, or a boolean. But to ask questions about technical stuff, we absolutely ask general questions like How does RoPE work? to know about some concepts and the like Therefore, I further modify the system logic to fit this need by customizing an AnswerWithRAGContextExplanationPrompt class and automatically matching the most related chapter and corresponding pages via searching through all faiss databases (only retrieve the top-1) The final performance is demonstrated below (not cherry-picked, only tested once).

- What's the steps to train a nanoGPT from scratch? The answer is:

Training nanoGPT from scratch involves several clearly defined steps. First, set up the environment by installing necessary libraries, using either Anaconda or Google Colab, and then download the dataset (e.g., tinyShakespeare). Next, tokenize the text into numerical representations and split the data into training and validation sets. Define the model architecture including token/positional embeddings, transformer blocks with multi-head self-attention and feed-forward networks, and layer normalization. Configure training hyperparameters and set up an optimizer (such as AdamW). Proceed with a training loop that performs forward passes, computes loss, backpropagates, and updates parameters, while periodically evaluating performance on both training and validation data. Finally, use the trained model to generate new text from a given context.

All code are provided on Colab and the tutorial is referenced here. Hope this helps!

Ask HN: Thoughts on maintaining anonymity against state-level actors?

1•adamanteye•2m ago•0 comments

Goatmire Elixir 2026 Announced

https://underjord.io/goatmire-2-announced.html
1•lawik•3m ago•0 comments

The hunt for a stolen Jackson Pollock – and answers to a family's pain

https://www.washingtonpost.com/entertainment/art/interactive/2026/jackson-pollock-theft-isaacs-fa...
1•prismatic•4m ago•0 comments

We asked four AI coding agents to rebuild Minesweeper–the results were explosive

https://arstechnica.com/ai/2025/12/the-ars-technica-ai-coding-agent-test-minesweeper-edition/
1•keyle•5m ago•0 comments

What Would AI Do?

https://www.amazon.com/dp/B0F99PJ7YK
1•jjuliano•8m ago•0 comments

Vespa.ai Blog: Embedding Tradeoffs, Quantified

https://blog.vespa.ai/embedding-tradeoffs-quantified/
1•goinglong•14m ago•0 comments

Show HN: A browser-based evolutionary simulation with emergent behavior

https://soupof.life/
1•maybe-tomorrow•16m ago•0 comments

Agent Skills are now available in Google Antigravity

https://antigravity.google/docs/skills
1•pretext•17m ago•0 comments

Iran Jamming of Starlink and Ways to Overcome Jamming

https://www.nextbigfuture.com/2026/01/iran-jamming-of-starlink-and-ways-to-overcome-jamming.html
1•ensocode•20m ago•0 comments

Show HN: Serverless GraphQL analytics framework for AWS

https://github.com/acikelli/oc-graphql
1•oacikelli•24m ago•0 comments

My parenting screen time philosophy

https://mikemcquaid.com/my-parenting-screen-time-philosophy/
1•wrxd•26m ago•0 comments

Pentagon Reportedly Testing Radio Wave Device Linked to 'Havana Syndrome'

https://www.scientificamerican.com/article/pentagon-reportedly-testing-radio-wave-device-linked-t...
1•beardyw•26m ago•0 comments

How to make users know that this is the heartopia wiki website they need?

https://www.heartopia-tips.com/
1•pacewang•27m ago•0 comments

CoreWeave Hid Data Center Delays, Investors Say

https://www.law360.com/articles/2429441/coreweave-hid-data-center-delays-investors-say
1•zerosizedweasle•28m ago•0 comments

Popular Python Libraries Used in Hugging Face Models

https://www.theregister.com/2026/01/13/ai_python_library_bugs_allow/
1•jruohonen•31m ago•1 comments

I Manage My Personal Infrastructure in 2026

https://taoofmac.com/space/blog/2026/01/09/1900
2•rcarmo•32m ago•0 comments

AI as Entertainment

https://arxiv.org/abs/2601.08768
2•50kIters•33m ago•1 comments

Sargasso Sea

https://en.wikipedia.org/wiki/Sargasso_Sea
2•thunderbong•35m ago•0 comments

Better `pre-commit`, re-engineered in Rust

https://github.com/j178/prek
2•vismit2000•35m ago•0 comments

Show HN: Drop The Adverb – Replace weak verb+adverb pairs

https://droptheadverb.com
1•ClozeLine•35m ago•1 comments

How to De-Risk from America

https://www.ft.com/content/331ec18b-9960-4954-8728-f2ff5bb5ee7b
4•KnuthIsGod•35m ago•1 comments

WordPress as a Headless CMS: A Production-Ready Plugin Stack

https://nesmachny.com/post/my-wordpress-stack-for-headless
1•taubek•36m ago•0 comments

Scott Adams, 'Dilbert' Creator, Dies at 68

https://variety.com/2026/artisans/people-news/scott-adams-dead-dilbert-creator-1236630162/
1•Rant423•40m ago•0 comments

Bad Science, Good Politics

https://shaemclaughlin.substack.com/p/bad-science-good-politics
1•MaysonL•40m ago•0 comments

The Internet Archive Crawler

https://github.com/internetarchive/heritrix3
1•dvrp•56m ago•0 comments

US State Department Threatens UK over Probe into Elon Musk's X

https://www.politico.eu/article/us-state-department-threaten-uk-probe-elon-musk-x-grok/
6•saubeidl•58m ago•1 comments

Ask HN: How do you apply for jobs in the age of AI?

1•surrTurr•58m ago•0 comments

I've created a prototype for the front-end of a website inside an AI chatbot

1•5color•59m ago•1 comments

Claude Cowork Runs Linux VM via Apple Virtualization Framework

https://gist.github.com/simonw/35732f187edbe4fbd0bf976d013f22c8
2•jumploops•1h ago•0 comments

Show HN: Gilda runs multiple LLMs, compares them, and merges the result

https://gildaapp.com/
1•osgohe•1h ago•1 comments