frontpage.
newsnewestaskshowjobs

Open Source @Github

fp.

Are readers generating fiction with AI models?

https://arxiv.org/abs/2606.22748
1•ilamont•1m ago•0 comments

Devin Security Swarm

https://devin.ai/blog/security-swarm-eval/
1•meco•1m ago•0 comments

Wisk, Boeing's air taxi firm, rushed software testing, ex-employee claims

https://www.seattletimes.com/business/boeing-aerospace/wisk-boeings-air-taxi-firm-rushed-software...
1•Jtsummers•4m ago•0 comments

The Website Is Down

https://www.thewebsiteisdown.com/
1•kretaceous•7m ago•0 comments

Tech giants lose $2T in SpaceX's IPO month

https://english.elpais.com/economy-and-business/2026-07-01/tech-giants-lose-2-trillion-in-spacexs...
3•01-_-•8m ago•1 comments

The Regret We Get Wrong

https://jordangrumet.substack.com/p/the-regret-we-get-wrong
1•jader201•8m ago•0 comments

Show HN: Coding Agent Survey – Which coding agents do you use?

https://codingagentsurvey.org/
1•jacobgold•8m ago•1 comments

What do you mean by "Event-Driven"? (2017)

https://martinfowler.com/articles/201701-event-driven.html
1•adletbalzhanov•8m ago•0 comments

Show HN: I Made TS Compiler Graph MCP: 10x Fewer Tokens in Claude Code and Codex

https://github.com/samchon/ttsc/tree/master/packages/graph
1•autobe•12m ago•0 comments

FFmpeg's native AAC encoder has just been rewritten, and beats fdk_aac

https://xcancel.com/FFmpeg/status/2072320220509741087
2•wyattblue•12m ago•0 comments

Who needs a museum when there's a banana room in town?

https://www.nytimes.com/2026/07/01/arts/design/museum-of-ice-cream-companies-art.html
1•thebigship•14m ago•0 comments

Fedora: Future of Community Initiatives and AI Deveoper Desktop

https://discussion.fedoraproject.org/t/fedora-council-statement-on-the-future-of-community-initia...
2•logic•16m ago•0 comments

What are you, Claude Fable 5?

https://slug-kebabs.dev/blog/what-are-you/
1•jedwidz•16m ago•1 comments

Salt v1.0.0 – a systems language with Z3 theorem proving in the compiler

https://salt-lang.dev
2•bneb-dev•16m ago•0 comments

A complete ClickHouse OLAP engine, compiled to WebAssembly

https://wasm.chdb.io/
1•porridgeraisin•17m ago•0 comments

The average vibe coder experience

https://www.reddit.com/r/vibecoding/s/xqZ2MpJJ8R
1•eeko_systems•17m ago•0 comments

Belkin's Cable Won't Charge Your Switch 2 More Quickly

https://www.lttlabs.com/articles/2026/07/01/does-the-belkin-cable-charge-the-nintendo-switch-2-fa...
1•LabsLucas•17m ago•0 comments

When does redundancy become redundant?

https://www.bengodfrey.dev/blog/redundancy/
1•sudo-bendg•17m ago•0 comments

Consciousness: How 'working memory' may mysteriously give rise to it

https://theconversation.com/consciousness-how-working-memory-may-mysteriously-give-rise-to-it-283823
2•anarbadalov•17m ago•0 comments

Differentiation drives the erosion of positivity on social media

https://www.pnas.org/doi/10.1073/pnas.2527316123
2•marojejian•18m ago•1 comments

Codex reasoning-token clustering at 516 may be leading to degraded performance

https://github.com/openai/codex/issues/30364
1•0x_rs•19m ago•0 comments

I prompt AI to write at a 7th-grade level

https://anatoliybabushka.com/blog/ela7-writing-rule.html
1•bbsnly•19m ago•0 comments

Govt issues notice to WhatsApp over username feature

https://www.thehindu.com/sci-tech/technology/govt-issues-notice-to-whatsapp-over-username-feature...
3•thisislife2•19m ago•0 comments

LLMs are stuck in a groupthink groove. This startup is trying to get them out

https://www.technologyreview.com/2026/07/01/1140003/llms-are-stuck-in-a-groupthink-rut-this-start...
1•joozio•20m ago•0 comments

The Second Coming of the Command Line

https://cautomaton.com/articles/second-coming-of-the-command-line/
1•jaaron•20m ago•0 comments

PlayStation 5 Linux project gets upgraded to support new firmware and PS5 Slim

https://www.gamingonlinux.com/2026/07/playstation-5-linux-project-gets-upgraded-to-support-new-fi...
1•watermelon0•21m ago•0 comments

AI's next bottleneck is power

https://www.businessinsider.com/why-creator-economy-startup-founder-pivot-data-center-power-tar-2...
2•toredo1729_2•24m ago•0 comments

NanoShell: A preemptively multi-tasked 32-bit OS with a windowed GUI

https://github.com/iProgramMC/NanoShellOS
1•mrunix•24m ago•0 comments

Show HN: Classify mechanical faults using Contrastive Language-Audio Pretraining

https://github.com/adam-s/car-diagnosis
2•dataviz1000•24m ago•0 comments

Please Do Not Touch

https://readme.dm/please-do-not-touch/
1•dmadisetti•28m ago•0 comments
Open in hackernews

An Enterprise-Level Retrieval-Augmented Generation System

https://comfyai.app/article/llm-applications/enterprise-level-rag-hands-on-practice-II
6•zljdanceholic•1y ago

Comments

zljdanceholic•1y ago
How can we search the wanted key information from 10,000+ pages of PDFs within 2.5 hours? For fact check, how do we implement it so that answers are backed by page-level references, minimizing hallucinations?

RAG-Challenge-2 is a great open-source project by Ilya Rice that ranked 1st at the Enterprise RAG Challenge, which has 4500+ lines of code for implementing a high-performing RAG system. It might seem overwhelming to newcomers who are just beginning to learn this technology. Therefore, to help you get started quickly—and to motivate myself to learn its ins and outs—I’ve created a complete tutorial on this.

We have a complete graph to explain its workflow where multiple tools are used: Docling for parsing PDFs, LangChain for chunking text, faiss for vectorization and similarity searching, and chatgpt for LLMs.

Besides, I also outline the codeflow, demonstrating the running logic involving multiple python files where starters can easily get lost. Different files are colored differently. The purpose of showing this is not letting you memorize all of these file relationships. It works better for you to check the source code yourself and use this as a reference if you find yourself lost in the code.

The original project of Ilya Rice design its RAG system for answering questions of annual reports from companies, so he only designed three types of question response format for that challenge: a name, a number, or a boolean. But to ask questions about technical stuff, we absolutely ask general questions like How does RoPE work? to know about some concepts and the like Therefore, I further modify the system logic to fit this need by customizing an AnswerWithRAGContextExplanationPrompt class and automatically matching the most related chapter and corresponding pages via searching through all faiss databases (only retrieve the top-1) The final performance is demonstrated below (not cherry-picked, only tested once).

- What's the steps to train a nanoGPT from scratch? The answer is:

Training nanoGPT from scratch involves several clearly defined steps. First, set up the environment by installing necessary libraries, using either Anaconda or Google Colab, and then download the dataset (e.g., tinyShakespeare). Next, tokenize the text into numerical representations and split the data into training and validation sets. Define the model architecture including token/positional embeddings, transformer blocks with multi-head self-attention and feed-forward networks, and layer normalization. Configure training hyperparameters and set up an optimizer (such as AdamW). Proceed with a training loop that performs forward passes, computes loss, backpropagates, and updates parameters, while periodically evaluating performance on both training and validation data. Finally, use the trained model to generate new text from a given context.

All code are provided on Colab and the tutorial is referenced here. Hope this helps!