frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: The biggest achievement of my life so far

https://github.com/adityaprasad-sudo/Explore-Singapore
6•ambitious_potat•1h ago
Hello everyone,

I have always loved coding and in the couple I was thinking of making an open source project and it turned out to be awesome I hope you guys like it.

I present Explore Singapore which I created as an open-source intelligence engine to execute retrieval-augmented generation (RAG) on Singapore's public policy documents and legal statutes and historical archives.

The objective required building a domain-specific search engine which enables LLM systems to decrease errors by using government documents as their exclusive information source.

What my Project does :- basically it provides legal information faster and reliable(due to RAG) without going through long PDFs of goverment websites and helps travellers get insights faster about Singapore.

Target Audience:- Python developers who keep hearing about "RAG" and AI agents but haven't build one yet or building one and are stuck somewhere also Singaporean people(obviously!)

Comparison:- RAW LLM vs RAG based LLM to test the rag implementation i compared output of my logic code against the standard(gemini/Arcee AI/groq) and custom system instructions with rag(gemini/Arcee AI/groq) results were shocking query:- "can I fly in a drone in public park" standard llm response :- ""gave generic advice about "checking local laws" and safety guidelines"" Customized llm with RAG :- ""cited the air navigation act,specified the 5km no fly zones,and linked to the CAAS permit page"" the difference was clear and it was sure that the ai was not hallucinating.

Ingestion:- I have the RAG Architecture about 594 PDFs about Singaporian laws and acts which rougly contains 33000 pages.

How did I do it :- I used google Collab to build vector database and metadata which nearly took me 1 hour to do so ie convert PDFs to vectors.

How accurate is it:- It's still in development phase but still it provides near accurate information as it contains multi query retrieval ie if a user asks ("ease of doing business in Singapore") the logic would break the keywords "ease", "business", "Singapore" and provide the required documents from the PDFs with the page number also it's a little hard to explain but you can check it on my webpage.Its not perfect but hey i am still learning.

The Tech Stack: Ingestion: Python scripts using PyPDF2 to parse various PDF formats. Embeddings: Hugging Face BGE-M3(1024 dimensions) Vector Database: FAISS for similarity search. Orchestration: LangChain. Backend: Flask Frontend: React and Framer.

The RAG Pipeline operates through the following process: Chunking: The source text is divided into chunks of 150 with an overlap of 50 tokens to maintain context across boundaries. Retrieval: When a user asks a question (e.g., "What is the policy on HDB grants?"), the system queries the vector database for the top k chunks (k=1). Synthesis: The system adds these chunks to the prompt of LLMs which produces the final response that includes citation information. Why did I say llms :- because I wanted the system to be as non crashable as possible so I am using gemini as my primary llm to provide responses but if it fails to do so due to api requests or any other reasons the backup model(Arcee AI trinity large) can handle the requests.

Don't worry :- I have implemented different system instructions for different models so that result is a good quality product.

Current Challenges: I am working on optimizing the the ranking strategy of the RAG architecture. I would value insights from anyone who has encountered RAG returning unrelevant documents.

Feedbacks are the backbone of improving a platform so they are most

Repository:- https://github.com/adityaprasad-sudo/Explore-Singapore

Show HN: I created a Mars colony RPG based on Kim Stanley Robinson's Mars books

https://underhillgame.com/
79•ariaalam•4h ago•33 comments

Show HN: It took 4 years to sell my startup. I wrote a book about it

https://derekyan.com/ma-book/
157•zhyan7109•4d ago•44 comments

Show HN: Filtron – parse human-friendly filter strings into structured queries

https://filtron.dev/
2•jbergstroem•42m ago•0 comments

Show HN: Envon - cross-shell CLI for activating Python virtual environments

https://github.com/Userfrom1995/envon
2•userfrom1995•49m ago•0 comments

Show HN: LocalGPT – A local-first AI assistant in Rust with persistent memory

https://github.com/localgpt-app/localgpt
308•yi_wang•19h ago•143 comments

Show HN: SendRec – Self-hosted async video for EU data sovereignty

https://github.com/sendrec/sendrec
2•alexneamtu•2h ago•1 comments

Show HN: Kekkai – Interactive security triage in the terminal

5•kirumachi•5d ago•3 comments

Show HN: I saw this cool navigation reveal, so I made a simple HTML+CSS version

https://github.com/Momciloo/fun-with-clip-path
134•momciloo•1d ago•35 comments

Show HN: Fine-tuned Qwen2.5-7B on 100 films for probabilistic story graphs

https://cinegraphs.ai/
83•graphpilled•9h ago•20 comments

Show HN: Hivewire – A news feed where you control your algorithm weights

https://hivewire.news
3•bchaps•3h ago•2 comments

Show HN: Bhagavan – a calm, approachable app for exploring Hinduism

https://www.bhagavan.io
4•AkhilSonthi•4h ago•0 comments

Show HN: Readability API – Unrender

https://unrender.page/
2•zintus•6h ago•1 comments

Show HN: A luma dependent chroma compression algorithm (image compression)

https://www.bitsnbites.eu/a-spatial-domain-variable-block-size-luma-dependent-chroma-compression-...
56•mbitsnbites•4d ago•9 comments

Show HN: Claude Code skill that uses Codex as MCP server for code review

https://github.com/pauhu/claude-codex-review
3•pauhu•7h ago•0 comments

Show HN: Smooth CLI – Token-efficient browser for AI agents

https://docs.smooth.sh/cli/overview
104•antves•3d ago•71 comments

Show HN: Kappal – CLI to Run Docker Compose YML on Kubernetes for Local Dev

https://github.com/sandys/kappal
48•sandGorgon•3d ago•28 comments

Show HN: Look Ma, No Linux: Shell, App Installer, Vi, Cc on ESP32-S3 / BreezyBox

https://github.com/valdanylchuk/breezydemo
319•isitcontent•1d ago•40 comments

Show HN: R3forth, a ColorForth-inspired language with a tiny VM

https://github.com/phreda4/r3
89•phreda4•1d ago•17 comments

Show HN: If you lose your memory, how to regain access to your computer?

https://eljojo.github.io/rememory/
389•eljojo•2d ago•223 comments

Show HN: I spent 4 years building a UI design tool with only the features I use

https://vecti.com
381•vecti•2d ago•175 comments

Show HN: A sandboxed execution environment for AI agents via WASM

https://github.com/Parassharmaa/agent-sandbox
2•paraaz•8h ago•0 comments

Show HN: Real-time path tracing of medical CT volumes in the browser via WebGPU

https://grenzwert.net/
4•MickGorobets•9h ago•1 comments

Show HN: Portview, A diagnostic-first port viewer for Linux (~930 KB, zero deps)

https://github.com/Mapika/portview
5•Mapika•9h ago•0 comments

Show HN: Google Maps but for your repo (Open Source)

https://github.com/zacharykapank/repomap
2•zacharykapank•9h ago•0 comments

Show HN: Artifact Keeper – Open-Source Artifactory/Nexus Alternative in Rust

https://github.com/artifact-keeper
161•bsgeraci•2d ago•66 comments

Show HN: Elysia JIT "Compiler", why it's one of the fastest JavaScript framework

https://elysiajs.com/internal/jit-compiler
3•saltyaom•10h ago•0 comments

Show HN: BioTradingArena – Benchmark for LLMs to predict biotech stock movements

https://www.biotradingarena.com/hn
33•dchu17•2d ago•14 comments

Show HN: HalalCodeCheck – Verify food ingredients offline

https://halalcodecheck.com/
3•pythonbase•11h ago•0 comments

Show HN: FlashMesh – An encrypted file mesh across Google Drive and Dropbox

https://flashmesh.netlify.app
2•Elevanix•12h ago•0 comments

Show HN: Slack CLI for Agents

https://github.com/stablyai/agent-slack
59•nwparker•2d ago•13 comments