frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: RAG chatbot using Qwen3 with custom thinking UI

4•Arindam1729•8mo ago
Hey Folks,

I've been playing around with the new Qwen3 models recently (from Alibaba). They’ve been leading a bunch of benchmarks recently, especially in coding, math, reasoning tasks and I wanted to see how they work in a Retrieval-Augmented Generation (RAG) setup. So I decided to build a basic RAG chatbot on top of Qwen3 using LlamaIndex.

Here’s the setup:

Model: Qwen3-235B-A22B (the flagship model via Nebius Ai Studio)

RAG Framework: LlamaIndex

Docs: Load → transform → create a VectorStoreIndex using LlamaIndex

Storage: Works with any vector store (I used the default for quick prototyping)

UI: Streamlit (It's the easiest way to add UI for me)

One small challenge I ran into was handling the <think> </think> tags that Qwen models sometimes generate when reasoning internally. Instead of just dropping or filtering them, I thought it might be cool to actually show what the model is “thinking”.

So I added a separate UI block in Streamlit to render this. It actually makes it feel more transparent, like you’re watching it work through the problem statement/query.

Nothing fancy with the UI, just something quick to visualize input, output, and internal thought process. The whole thing is modular, so you can swap out components pretty easily (e.g., plug in another model or change the vector store).

Here’s the full code if anyone wants to try or build on top of it: GitHub: https://github.com/Arindam200/awesome-ai-apps/tree/main/rag_...

And I did a short walkthrough/demo here: YouTube: https://www.youtube.com/watch?v=L7P8RcKcdzI

Would love to hear if anyone else is using Qwen3 or doing something fun with LlamaIndex or RAG stacks. What’s worked for you?

Comments

tomasen9987•8mo ago
I have tried Gemma before haven't got a chance to try Qwen3 yet.

What do you think is difference between Gemma and Qwen when it comes to RAG performance?

Arindam1729•8mo ago
I haven't tried comparing both, but Qwen's reasoning quality is better.
cenktekin•8mo ago
Thanks for sharing this! I've also been using Qwen3 recently and I'm really impressed with its performance, especially in terms of speed and consistency. The idea of visualizing the tags is brilliant! I'll definitely check out your code. What kind of tasks are you primarily using Qwen3 for?

Show HN: LocalGPT – A local-first AI assistant in Rust with persistent memory

https://github.com/localgpt-app/localgpt
197•yi_wang•7h ago•74 comments

Show HN: Analyzing 9 years of HN side projects that reached $500/month

2•haileyzhou•6m ago•0 comments

Show HN: Free Bank Statement Analyzer to Find Spending Leaks and Save Money

https://www.whereismymoneygo.com/
2•raleobob•27m ago•1 comments

Show HN: I saw this cool navigation reveal, so I made a simple HTML+CSS version

https://github.com/Momciloo/fun-with-clip-path
97•momciloo•14h ago•22 comments

Show HN: OpenCode for Meetings

https://getscripta.app
2•whitemyrat•1h ago•1 comments

Show HN: A luma dependent chroma compression algorithm (image compression)

https://www.bitsnbites.eu/a-spatial-domain-variable-block-size-luma-dependent-chroma-compression-...
42•mbitsnbites•3d ago•6 comments

Show HN: Look Ma, No Linux: Shell, App Installer, Vi, Cc on ESP32-S3 / BreezyBox

https://github.com/valdanylchuk/breezydemo
307•isitcontent•1d ago•39 comments

Show HN: Open-source AI powered Kubernetes IDE

https://github.com/agentkube/agentkube
2•saiyampathak•2h ago•0 comments

Show HN: Lucid – Use LLM hallucination to generate verified software specs

https://github.com/gtsbahamas/hallucination-reversing-system
2•tywells•2h ago•0 comments

Show HN: If you lose your memory, how to regain access to your computer?

https://eljojo.github.io/rememory/
375•eljojo•1d ago•218 comments

Show HN: I spent 4 years building a UI design tool with only the features I use

https://vecti.com
376•vecti•1d ago•173 comments

Show HN: Seedance 2.0 – The Most Powerful AI Video Generator

https://seedance.ai/
2•bigbromaker•4h ago•0 comments

Show HN: Kappal – CLI to Run Docker Compose YML on Kubernetes for Local Dev

https://github.com/sandys/kappal
45•sandGorgon•3d ago•27 comments

Show HN: Smooth CLI – Token-efficient browser for AI agents

https://docs.smooth.sh/cli/overview
100•antves•2d ago•70 comments

Show HN: High-performance bidirectional list for React, React Native, and Vue

https://suhaotian.github.io/broad-infinite-list/
2•jeremy_su•5h ago•0 comments

Show HN: R3forth, a ColorForth-inspired language with a tiny VM

https://github.com/phreda4/r3
87•phreda4•1d ago•17 comments

Show HN: Artifact Keeper – Open-Source Artifactory/Nexus Alternative in Rust

https://github.com/artifact-keeper
160•bsgeraci•2d ago•65 comments

Show HN: BioTradingArena – Benchmark for LLMs to predict biotech stock movements

https://www.biotradingarena.com/hn
32•dchu17•1d ago•13 comments

Show HN: Slack CLI for Agents

https://github.com/stablyai/agent-slack
56•nwparker•2d ago•13 comments

Show HN: Django-rclone: Database and media backups for Django, powered by rclone

https://github.com/kjnez/django-rclone
2•cui•9h ago•1 comments

Show HN: Axiomeer – An open marketplace for AI agents

https://github.com/ujjwalredd/Axiomeer
12•ujjwalreddyks•5d ago•2 comments

Show HN: Gigacode – Use OpenCode's UI with Claude Code/Codex/Amp

https://github.com/rivet-dev/sandbox-agent/tree/main/gigacode
25•NathanFlurry•1d ago•11 comments

Show HN: Witnessd – Prove human authorship via hardware-bound jitter seals

https://github.com/writerslogic/witnessd
2•davidcondrey•9h ago•2 comments

Show HN: Micropolis/SimCity Clone in Emacs Lisp

https://github.com/vkazanov/elcity
174•vkazanov•2d ago•50 comments

Show HN: ARM64 Android Dev Kit

https://github.com/denuoweb/ARM64-ADK
18•denuoweb•2d ago•2 comments

Show HN: PalettePoint – AI color palette generator from text or images

https://palettepoint.com
2•latentio•11h ago•0 comments

Show HN: I built a <400ms latency voice agent that runs on a 4gb vram GTX 1650"

https://github.com/pheonix-delta/axiom-voice-agent
2•shubham-coder•13h ago•1 comments

Show HN: Stacky – certain block game clone

https://www.susmel.com/stacky/
3•Keyframe•14h ago•0 comments

Show HN: A toy compiler I built in high school (runs in browser)

https://vire-lang.web.app
3•xeouz•15h ago•1 comments

Show HN: Daily-updated database of malicious browser extensions

https://github.com/toborrm9/malicious_extension_sentry
15•toborrm9•1d ago•8 comments