frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: An offline document search engine for my university's messy PDFs

https://github.com/Yigtwxx/FiratUniversityChatbot
2•Yigtwx•2h ago

Comments

Yigtwx•2h ago
Hi HN,

I’m a 3rd-year software engineering student. Every semester, I waste hours trying to find specific rules, passing grades, or course details buried inside my university’s endless, poorly formatted PDFs. I just wanted a fast, completely offline way to search through them, so I built this.

Instead of jumping straight into a heavy LLM or RAG setup, I decided to keep it simple and lightweight. The backend is built with FastAPI. I used `pdfplumber` to extract the text from the PDFs (which is a nightmare on its own) and implemented BM25 for the core search engine.

It works completely offline and handles Turkish text surprisingly well for a pure retrieval system. It does exactly what I need without the latency, hardware requirements, or hallucinations of running local AI models.

It’s my first time properly using BM25 for a real-world problem, so the codebase might still be a bit rough.

If anyone has war stories or tips on extracting clean text from terribly formatted academic PDFs, or ways to improve BM25 search relevance for a specific language without bloating the system, I’d love to hear them!

Show HN: Claude-File-Recovery, recover files from your ~/.claude sessions

https://github.com/hjtenklooster/claude-file-recovery
39•rikk3rt•8h ago•14 comments

Show HN: I built a self-hosted course platform in Clojure

https://clojure.stream
32•jacekschae•1d ago•4 comments

Show HN: RetroTick – Run classic Windows EXEs in the browser

https://retrotick.com/
173•lqs_•12h ago•52 comments

Show HN: Accept.md now supports SvelteKit – return Markdown from any page

https://www.accept.md/
2•hval•34m ago•0 comments

Show HN: I made a website to write online math as fast as paper

https://www.scratchpad-math.com/
3•maxtobiasen•52m ago•2 comments

Show HN: Badge that shows how well your codebase fits in an LLM's context window

https://github.com/qwibitai/nanoclaw/tree/main/repo-tokens
79•jimminyx•9h ago•40 comments

Show HN: Unfucked – version every change between commits - local-first

https://www.unfudged.io/
62•cyrusradfar•1d ago•37 comments

Show HN: OpenTimelineEngine – Shared local memory for Claude Code and codex

https://github.com/JOELJOSEPHCHALAKUDY/open-timeline-engine
5•joeljoseph_•1h ago•3 comments

Show HN: The Silent Filter, The Delegation of Synthesis and Linguistic Drift

https://juanpabloaj.com/2026/02/27/the-silent-filter/
5•juanpabloaj•3h ago•0 comments

Show HN: Notemac++ – A Notepad++-inspired code editor for macOS and the web

https://github.com/sergioadevita/notemac-plus-plus
2•sergioadevita•2h ago•2 comments

Show HN: MCP server for AI compliance documentation (Colorado AI Act)

https://github.com/jeremytuite/aop-mcp-server
3•jeremytuite•2h ago•0 comments

Show HN: Host a Real time collaborative spreadsheet right from your pocket

https://gitlab.com/here_forawhile/spreadsheet
3•smalltorch•2h ago•0 comments

Show HN: An offline document search engine for my university's messy PDFs

https://github.com/Yigtwxx/FiratUniversityChatbot
2•Yigtwx•2h ago•1 comments

Show HN: Your Expensive Lawyer Is Making Mistakes

https://findthefuckup.com/
12•jaggednad•3h ago•3 comments

Show HN: AgentGames.co – my game creator

https://agentgames.co
3•imightbekyle•3h ago•0 comments

Show HN: Hacker Smacker – Spot great (and terrible) HN commenters at a glance

https://hackersmacker.org
138•conesus•3d ago•162 comments

Show HN: I Built a $1 Escalating Internet Billboard – Called Space

https://www.spacefilled.com/
3•clarkage•4h ago•3 comments

Show HN: Mac hardware toys – pipe your accelerometer into your keyboard lights

https://github.com/pirate/mac-hardware-toys
3•nikisweeting•4h ago•1 comments

Show HN: Linex – A daily challenge: placing pieces on a board that fights back

https://www.playlinex.com/
79•Humanista75•3d ago•34 comments

Show HN: Deff – Side-by-side Git diff review in your terminal

https://github.com/flamestro/deff
116•flamestro•1d ago•64 comments

Show HN: SignalCend – API that resolves conflicting IoT device state in 47ms

https://signalcend.com/
4•TylerArrows•7h ago•0 comments

Show HN: Interactive Resume/CV Game

https://breezko.dev
3•breezk0•5h ago•1 comments

Show HN: Respectify – A comment moderator that teaches people to argue better

https://respectify.org/
219•vintagedave•2d ago•229 comments

Show HN: Terminal Phone – E2EE Walkie Talkie from the Command Line

https://gitlab.com/here_forawhile/terminalphone
315•smalltorch•1d ago•80 comments

Show HN: PDF reader with interactive visualizations for research papers

https://zerodistract.com
7•vignesh_warar•7h ago•2 comments

Show HN: BananaOS, vibecoded operating system that boots on a 486 with ~11MB RAM

4•madamdo•7h ago•0 comments

Show HN: Goatpad

https://www.goatpad.xyz
8•martialg•7h ago•0 comments

Show HN: Beehive – Multi-Workspace Agent Orchestrator

https://storozhenko98.github.io/beehive/
46•mst98•3d ago•22 comments

Show HN: A real-time strategy game that AI agents can play

https://llmskirmish.com/
218•__cayenne__•2d ago•81 comments

Show HN: CodeLeash: framework for quality agent development, NOT an orchestrator

https://codeleash.dev/
11•cadamsdotcom•21h ago•4 comments