frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

An Enterprise-Level Retrieval-Augmented Generation System

https://comfyai.app/article/llm-applications/enterprise-level-rag-hands-on-practice-II
6•zljdanceholic•9mo ago

Comments

zljdanceholic•9mo ago
How can we search the wanted key information from 10,000+ pages of PDFs within 2.5 hours? For fact check, how do we implement it so that answers are backed by page-level references, minimizing hallucinations?

RAG-Challenge-2 is a great open-source project by Ilya Rice that ranked 1st at the Enterprise RAG Challenge, which has 4500+ lines of code for implementing a high-performing RAG system. It might seem overwhelming to newcomers who are just beginning to learn this technology. Therefore, to help you get started quickly—and to motivate myself to learn its ins and outs—I’ve created a complete tutorial on this.

We have a complete graph to explain its workflow where multiple tools are used: Docling for parsing PDFs, LangChain for chunking text, faiss for vectorization and similarity searching, and chatgpt for LLMs.

Besides, I also outline the codeflow, demonstrating the running logic involving multiple python files where starters can easily get lost. Different files are colored differently. The purpose of showing this is not letting you memorize all of these file relationships. It works better for you to check the source code yourself and use this as a reference if you find yourself lost in the code.

The original project of Ilya Rice design its RAG system for answering questions of annual reports from companies, so he only designed three types of question response format for that challenge: a name, a number, or a boolean. But to ask questions about technical stuff, we absolutely ask general questions like How does RoPE work? to know about some concepts and the like Therefore, I further modify the system logic to fit this need by customizing an AnswerWithRAGContextExplanationPrompt class and automatically matching the most related chapter and corresponding pages via searching through all faiss databases (only retrieve the top-1) The final performance is demonstrated below (not cherry-picked, only tested once).

- What's the steps to train a nanoGPT from scratch? The answer is:

Training nanoGPT from scratch involves several clearly defined steps. First, set up the environment by installing necessary libraries, using either Anaconda or Google Colab, and then download the dataset (e.g., tinyShakespeare). Next, tokenize the text into numerical representations and split the data into training and validation sets. Define the model architecture including token/positional embeddings, transformer blocks with multi-head self-attention and feed-forward networks, and layer normalization. Configure training hyperparameters and set up an optimizer (such as AdamW). Proceed with a training loop that performs forward passes, computes loss, backpropagates, and updates parameters, while periodically evaluating performance on both training and validation data. Finally, use the trained model to generate new text from a given context.

All code are provided on Colab and the tutorial is referenced here. Hope this helps!

Agentic Engineering Best Practices

1•kingJulio•46s ago•0 comments

I tripled my SaaS prices after 2 weeks and signups didn't drop

https://web-production-71423.up.railway.app/
1•Shmungus•58s ago•1 comments

Show HN: One async PHP process serving web, REST API, and MCP for AI agents

https://pascualmg.dev/blog/pascual/one-async-php-process-web-server-rest-api-and-mcp-for-ai-agents
1•passh•1m ago•0 comments

Improving Interactive In-Context Learning from Natural Lang Feedback – DeepMind

https://arxiv.org/abs/2602.16066
1•zerop•5m ago•0 comments

Show HN: Behavr – Run realistic user simulations on your prototypes in minutes

1•Behavrai•10m ago•1 comments

Microsoft: Anti-phishing rules mistakenly blocked emails, Teams messages

https://www.bleepingcomputer.com/news/microsoft/microsoft-anti-phishing-rules-mistakenly-blocked-...
1•exploraz•12m ago•0 comments

Arpa.net

http://www.arpa.net/
1•TigerUniversity•14m ago•0 comments

Cloudflare uses ClickHouse to scale analytics at quadrillion-row scale

https://clickhouse.com/blog/cloudflare
2•samaysharma•14m ago•0 comments

Laurie Spiegel's pioneering '80s music software, Music Mouse, returns updated

https://djmag.com/news/laurie-spiegels-pioneering-80s-music-making-software-music-mouse-returns-m...
3•coffeeyesplease•15m ago•0 comments

A Constructive Look at TempleOS

http://www.codersnotes.com/notes/a-constructive-look-at-templeos/
1•TigerUniversity•17m ago•0 comments

Accenture combats AI refuseniks by linking promotions to log-ins

https://www.ft.com/content/ac672f97-a603-4c56-afa3-4a5273d45674
2•cianmm•24m ago•1 comments

Show HN: I Emulated My Childhood

https://sklivvz.com/posts/i-finally-emulated-my-childhood
1•sklivvz1971•28m ago•0 comments

Show HN: 17MB pronunciation scorer beats human experts at phoneme level

2•fabiosuizu•29m ago•0 comments

The Great Locomotive Chase

https://en.wikipedia.org/wiki/Great_Locomotive_Chase
2•keiferski•29m ago•0 comments

State of Generative Media

https://fal.ai/gen-media-report-volume-1
1•mdrzn•29m ago•0 comments

Brave Iranians gather in central Iran to honour those killed in uprising

https://twitter.com/IranIntl_En/status/2024767317893075330
3•ukblewis•30m ago•0 comments

Trump to order declassification of UFO/UAP related files

https://twitter.com/TrumpDailyPosts/status/2024661955479556382
1•lucasRW•31m ago•0 comments

Show HN: SaveTheTrade – a simple trade journal and performance tracker

https://survivethetrade.com/
1•daniellax•32m ago•0 comments

Wikipedia has deprecated and will blacklist archive.today

https://en.wikipedia.org/wiki/Wikipedia:Archive.today_guidance
4•gyrovague-com•35m ago•2 comments

TamboUI: A Modern Terminal UI Framework for Java (GraalVM Native)

https://github.com/tamboui/tamboui
1•mikepapadim•36m ago•0 comments

Optimism Plunges 28% as Base Drifts from OP Stack: What's Next?

https://timescrypto.com/cryptonews/altcoins/optimism-plunges-28-as-base-drifts-from-op-stack-what...
1•Alan_Writer•37m ago•0 comments

Show HN: Aismond – attack-surface monitoring for MSP client fleets

https://www.aismond.com/
1•mirceamitu•38m ago•0 comments

I discovered a hidden tragedy tied to Russia's most famous painting

https://www.theguardian.com/artanddesign/2026/feb/20/an-unknown-woman-how-i-discovered-a-hidden-t...
2•n1b0m•40m ago•0 comments

I have included AI in Corpus Lifetime free. net worth in one place

https://icorpus.vercel.app
1•mathan_karthik•43m ago•1 comments

Retrofitting Bridges with "Smart" Steel

https://www.empa.ch/web/s604/bruecken-mit-intelligentem-stahl-sanieren
1•JeanKage•44m ago•0 comments

CHERIoT Rust status update #0

https://rust.cheriot.org/2026/02/15/status-update.html
1•fanf2•44m ago•0 comments

Pens, pen clips, design engineer, obsession. Surprisingly interesting [video]

https://www.youtube.com/shorts/3i9FGaakX-Y
2•lifeisstillgood•46m ago•0 comments

Trump directs US govt to release files on 'alien and extraterrestrial life'

https://news.sky.com/story/trump-directs-us-government-to-release-files-on-alien-and-extraterrest...
2•iamben•48m ago•1 comments

Show HN: Delulu9 - SEO keyword research for Claude Code content pipeline

https://delulu9.com/
1•last_layer•58m ago•0 comments

Show HN: Chowser – A lightweight macOS browser chooser

https://github.com/bsreeram08/chowser
1•bsreeram08•1h ago•0 comments