frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: I made it fast and easy to launch your own RAG-powered AI chatbots

https://www.chatrag.ai
1•carlos_marcial•20m ago

Comments

carlos_marcial•18m ago
I built the tech stack behind ChatRAG to handle the increasing number of clients I started getting about a year ago who needed Retrieval Augmented Generation (RAG) powered chatbots.

After a lot of trial and error, I settled on this tech stack for ChatRAG:

Frontend

- Next.js 16 (App Router) Latest React framework with server components and streaming

- React 19 + React Compiler: Automatic memoization, no more useMemo/useCallback hell

- Zustand: Lightweight state management (3kb vs Redux bloat)

- Tailwind CSS + Framer Motion: Styling + buttery animations

- Embed a chat widget version of your RAG chatbot on any web page, apart from creating a ChatGPT or Claude looking web UI

AI / LLM Layer

- Vercel AI SDK 5 – Unified streaming interface for all providers

- OpenRouter – Single API for Claude, GPT-4, DeepSeek, Gemini, etc.

- MCP (Model Context Protocol) – Tool use and function calling across models

RAG Pipeline

- Text chunking → documents split for optimal retrieval

- OpenAI embeddings (1536 dim vectors) – Semantic search representation

- pgvector with HNSW indexes – Fast approximate nearest neighbor search directly in Postgres

Database & Auth

- Supabase (PostgreSQL) – Database, auth, realtime, storage in one

- GitHub & Google OAuth via Supabase – Third party sign in providers managed by Supabase

- Row Level Security – Multi-tenant data isolation at the DB level

Multi-Modal Generation

- Use Fal.ai or Replicate.ai API keys for generating image, video and 3D assets inside of your RAG chatbot

Integrations

- WhatsApp via Baileys – Chat with your RAG from WhatsApp

- Stripe / Polar – Payments and subscriptions

Infra

- Fly.io / Koyeb – Edge deployment for WhatsApp workers

- Vercel – Frontend hosting with edge functions

My special sauce: pgvector HNSW indexes (m=64, ef_construction=200) give you sub-100ms semantic search without leaving Postgres. No Pinecone/Weaviate vendor lock-in.

Single-tenant vs Multi-tenant RAG setups: Why not both?

ChatRAG supports both deployment modes depending on your use case:

Single-tenant

- One knowledge base → many users

- Ideal for celebrity/expert AI clones or brand-specific agents

- e.g., "Tony Robbins AI chatbot" or "Deepak Chopra AI"

- All users interact with the same dataset and the same personality layer

Multi-tenant

- Users have workspace/project isolation — each with its own knowledge base, project-based system prompt and settings

- Perfect for SaaS products or platform builders that want to offer AI chatbots to their customers

- Every customer gets private data and their own RAG

My long term vision is to keep evolving ChatRAG so I can eventually release a fully open-source version for everyone to build with.

Show HN: The AI homepage – A news homepage for AI related articles

https://www.theaihomepage.com/
1•maverick98•1m ago•0 comments

Game Theory Explains How Algorithms Can Drive Up Prices

https://www.wired.com/story/game-theory-explains-how-algorithms-can-drive-up-prices/
1•quapster•13m ago•0 comments

Microsoft says it will run Windows 11 File Explorer in background to load faster

https://www.windowslatest.com/2025/11/22/microsoft-says-it-will-always-run-windows-11-file-explor...
1•tosh•14m ago•0 comments

US Department of Transportation unveils first female-modeled crash test dummy

https://www.theguardian.com/world/2025/nov/21/transportation-department-first-female-crash-dummy
1•binning•14m ago•0 comments

The Many – and Contradictory – Histories of Mt. Rushmore

https://lithub.com/on-the-many-and-contradictory-histories-of-mt-rushmore/
1•bryanrasmussen•14m ago•0 comments

The battle between science and postmodernism: from Boyle's air pump to Dawkins

https://susanpickard.substack.com/p/the-battle-between-science-and-postmodernism
1•binning•18m ago•0 comments

Like the New Yorker but Better

https://thelambsconduitreview.neocities.org
1•rishirulzeworld•19m ago•0 comments

South Africa declares gender-based violence and femicide a national disaster

https://www.theguardian.com/society/2025/nov/22/south-africa-g20-protests-gender-based-violence-n...
2•binning•20m ago•0 comments

Show HN: I made it fast and easy to launch your own RAG-powered AI chatbots

https://www.chatrag.ai
1•carlos_marcial•20m ago•1 comments

Ask HN: Why GenAI is immoral but vibe coding is ok?

1•jb_briant•25m ago•2 comments

Russia's drone revolution heaps pressure on Ukrainian defenses

https://www.cnn.com/2025/11/22/europe/russia-rubicon-unit-drone-revolution-ukraine-intl-cmd
2•rguiscard•26m ago•0 comments

Should you dry someone else's abandoned laundry?

https://www.nytimes.com/interactive/2025/11/21/magazine/wet-laundry-hodgman-quiz.html
1•bookofjoe•29m ago•1 comments

Lando Norris, Oscar Piastri Disqualified from Las Vegas Grand Prix

https://www.formula1.com/en/latest/article/norris-and-piastri-disqualified-from-las-vegas-gp-as-m...
1•tosh•31m ago•0 comments

Show HN: A little browser game I made after seeing a viral dance video

https://immersivewarmup.com/
1•davidyu37•31m ago•0 comments

Alcove: An RSS Reader for the Open Web

https://bearblog.stevedylan.dev/alcove-an-rss-reader-for-the-open-web/
2•stevedsimkins•34m ago•0 comments

Let's focus on things we can control

https://ramezanpour.net/post/2025/11/22/worry-about-things-you-can-change
1•ramezanpour•36m ago•0 comments

Customer Service Pelita Air

1•Fransiskoelani•39m ago•11 comments

Ask HN: Do LLMs make you feel like you've lost your edge?

1•galfarragem•39m ago•2 comments

Gnome 49 Introduces Donation Request Notification

https://forum.zorin.com/t/gnome-49-introduces-donation-request-notification/54035
1•shaunpud•42m ago•0 comments

Silicon Valley startups: being evil, again and again

https://notesfrombelow.org/article/silicon-valley-startups-doing-evil-again-and-again
2•iSpiderman•42m ago•0 comments

Gum disease and tooth decay hit men harder – and now we know why

https://newatlas.com/health-wellbeing/gum-disease-inflammasome-males/
1•breve•43m ago•1 comments

Gnome Plans New Donation Reminder Pop-Up in Upcoming Release

https://linuxiac.com/gnome-plans-new-donation-reminder-pop-up-in-upcoming-release/
2•shaunpud•45m ago•0 comments

BeatsToRapOn – A music-only marketplace and AI tools for artists

https://beatstorapon.com
1•Chet-Fitzgerald•46m ago•1 comments

Olmo 3 is a fully open LLM

https://simonwillison.net/2025/Nov/22/olmo-3/
3•lumpa•47m ago•1 comments

Show HN: Video Notes for YouTube

https://chromewebstore.google.com/detail/video-notes/phgnkidiglnijkpmmdjcgdkekfoelcom
1•prameshbajra•50m ago•0 comments

Toad Attacks via Entra Guest Invites

https://taggart-tech.com/entra-toad/
1•computersuck•55m ago•3 comments

A New Bridge Links the Math of Infinity to Computer Science

https://www.quantamagazine.org/a-new-bridge-links-the-strange-math-of-infinity-to-computer-scienc...
1•pseudolus•57m ago•0 comments

Hedging (1942) [video]

https://www.youtube.com/watch?v=WoprVhpOKIk
1•pbalau•57m ago•0 comments

A Startup's Bid to Dim the Sun

https://www.newyorker.com/news/the-lede/a-startups-bid-to-dim-the-sun
1•pseudolus•58m ago•2 comments

Ask HN: Looking for a donated PC or laptop for freelance work

3•PikoBeam•59m ago•0 comments