frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

Open in hackernews

Show HN: I built a 2B-page search engine, independent of Google/Bing

6•Chief_Searcha•8h ago
Hi HN, For the last 18 months, I've been working solo on building a completely independent search engine from scratch. Today, I'm opening it up for beta testing and would love to get your feedback. The project powers two public sites from the same 2-billion-page index: Searcha.Page: A session-aware search engine that uses a persistent browser key (not a cookie) for better context. Seek.Ninja: A 100% stateless, privacy-first version with no identifiers at all. The entire stack is self-hosted on a single ~$4k bare-metal EPYC server in my laundry room (no cloud, no VC funding). The search pipeline is a hybrid model, using a traditional lexical index for the heavy lifting and lightweight LLMs for specific tasks like query expansion and re-ranking. It's an experiment in capital efficiency and digital sovereignty—proving you don't need Big Tech APIs to compete. I’m looking for feedback on search result relevance, speed, and the clarity of the privacy models. Please try it out and let me know what you think. Links: https://searcha.page https://seek.ninja Thanks, Ryan

Comments

mindctrl-org•8h ago
Hi, cool project. A little feedback, as I only spent a few seconds there. The search results need to populate considerably faster. The weird delay, animation, and layout shifting makes it all feel cheap.

The first results were good for the search I did. However, I went back and searched for something else entirely, and I got results similar to the first search. That seems bad, since they were unrelated searches.

Now I can't search anything because it's timing out.

Chief_Searcha•8h ago
Thank you, I will check for errors re the time out. This is just the type of feedback I am looking for. Regarding the overall speed, I have a plan but it won't be easy so it could be a few months to tackle the single biggest issue. Much appreciated!
FerkiHN•8h ago
Great work, especially since you did it yourself, you really put your heart and soul into the project and not in vain, it is truly unique and I appreciate it, good luck with further development, you can add local pages (like the most popular services) this will save mobile traffic.
Chief_Searcha•8h ago
Thank you, I appreciate it. For local pages I do have that planned, it will take a bit of time to make a native local index due to many problems such as businesses closing and not reporting it. This is one area that Google has such a strong advantage - user submitted content - and one I would need to find a way to automate. I have various plans and maybe one of them would work. The other option is to outsource it to an API, at least to buy time.
phillipseamore•7h ago
This is great. May I suggest you post a direct link to seek.ninja if this doesn't get the upvotes it deserves today.

Did you crawl yourself or using common crawl?

Chief_Searcha•4h ago
Thank you! Both, I use common crawl as the bulk, but it has long delays so I need to crawl myself too. I'm planning on upgrading my own crawling capabilities but not yet ready since it will require an investment. I also have plans to have a smarter crawl system which I may implement in the next half year.
ofalkaed•4h ago
Really appreciate that when searching for commodities/goods the results are not dominated by amazon/etsy/ebay sellers, very low on multiple results from a single domain in general. Solves my biggest gripe with search engines but have only done a few quick searches with seek.ninja so far.

The one thing that I would like to see is the "More results from <site>," links for sites with a lot of results. :site works so not a deal breaker, just a would be nice.

Show HN: PlutoFilter- A single-header, zero-allocation image filter library in C

https://github.com/sammycage/plutofilter
46•sammycage•3d ago•8 comments

Show HN: Easy alternative to giflib – header-only decoder in C

https://github.com/Ferki-git-creator/TurboStitchGIF-HeaderOnly-Fast-ZeroAllocation-PlatformIndependent-Embedded-C-GIF-Decoder
15•FerkiHN•13h ago•4 comments

Show HN: Improving search ranking with chess Elo scores

https://www.zeroentropy.dev/blog/improving-rag-with-elo-scores
182•ghita_•1d ago•63 comments

Show HN: FavBox is a local-firs browser extension for bookmark management

https://github.com/dd3v/favbox
3•dm_dd3v•4h ago•0 comments

Show HN: 0xDEAD//TYPE – A fast-paced typing shooter with retro vibes

https://0xdeadtype.theden.sh/
109•theden•4d ago•26 comments

Show HN: A browser-based accessibility checker that integrates into web projects

https://accented.dev
2•pomerantsev•5h ago•0 comments

Show HN: BloomSearch – Keyword search with hierarchical Bloom filters

https://github.com/danthegoodman1/bloomsearch
63•dangoodmanUT•4d ago•12 comments

Show HN: Object database for LLMs that persists across chats (MCP server)

https://dry.ai/mcp-object-database
6•kooshaazim•6h ago•2 comments

Show HN: A 'Choose Your Own Adventure' written in Emacs Org Mode

https://tendollaradventure.com/sample/
151•dskhatri•1d ago•24 comments

Show HN: Shoggoth Mini – A soft tentacle robot powered by GPT-4o and RL

https://www.matthieulc.com/posts/shoggoth-mini
584•cataPhil•2d ago•107 comments

Show HN: I built a cute focus timer where you can grow an infinite garden

https://www.growdoro.com/
4•dqnamo•7h ago•0 comments

Show HN: AI tool to remove backgrounds,change clothes,and animate product photos

https://getaicraft.com
2•SaaSified•7h ago•1 comments

Show HN: The HTML Maze – Escape an eerie labyrinth built with HTML pages

https://htmlmaze.com/
62•kyrylo•3d ago•18 comments

Show HN: Claude‑CMD – A CLI for managing Claude Code commands and workflows

https://github.com/kiliczsh/claude-cmd
4•kilic•7h ago•0 comments

Show HN: I Wrote a 680-Page Interactive Book on Data Structures and Algorithms

https://cartesian.app
16•EliasY•11h ago•3 comments

Show HN: I built this to talk Danish to my girlfriend – works with any language

https://menerdu.vercel.app/
201•lil_csom•4d ago•107 comments

Show HN: Detailed explanation and guide to understanding gene editing treatments

https://www.aditharun.com/p/understanding-the-science-behind
4•tinymagician•8h ago•0 comments

Show HN: I built a 2B-page search engine, independent of Google/Bing

6•Chief_Searcha•8h ago•7 comments

Show HN: Cobble – A hard daily word game

https://wilf.live/cobble/
25•wolfred•1d ago•17 comments

Show HN: An MCP server that gives LLMs temporal awareness and time calculation

https://github.com/jlumbroso/passage-of-time-mcp
87•lumbroso•1d ago•50 comments

Show HN: We made our own inference engine for Apple Silicon

https://github.com/trymirai/uzu
177•darkolorin•2d ago•45 comments

Show HN: Beyond Z²+C, Plot Any Fractal

https://www.juliascope.com/
99•akunzler•2d ago•26 comments

Show HN: DataRamen, a Fast SQL Explorer with Automatic Joins and Data Navigation

https://dataramen.xyz/
46•oleksandr_dem•1d ago•54 comments

Show HN: Conductor, a Mac app that lets you run a bunch of Claude Codes at once

https://conductor.build/
12•Charlieholtz•10h ago•11 comments

Show HN: LangWhich – a 30‑second daily challenge to recognize languages

https://langwhich.app
3•jdmelin•10h ago•0 comments

Show HN: WordPress Without PHP – Build Apps and CLI Tools in TypeScript

https://github.com/rnaga/wp-node
3•rnaga•10h ago•0 comments

Show HN: kiln – Git-native, decentralized secret management using age

https://kiln.sh/
12•pacmansyyu•10h ago•2 comments

Show HN: A directory of 800 free APIs, no auth required

https://freeapis.juheapi.com/apis
2•LeoWood42•10h ago•0 comments

Show HN: Timep – a next-gen profiler and flamegraph-generator for bash code

https://github.com/jkool702/timep
24•jkool702•2d ago•1 comments

Show HN: A Git(1) implementation written in Python

https://github.com/xqb64/legit
2•xqb64•11h ago•0 comments