There are different thresholds to "trusted": the highest that'd be useful to search would be just authoritative go-to sources for something, e.g. the official docs for an API or library. But that'd be perhaps too narrow: including stackoverflow links around that API or library usage would be fine. In other domains like product reviews, you'd include only sites that have a reputation for actually testing the products.
Use-cases: - Reliably good search results... Google et al are full of SEO spam now and soon will be full of AI slop - Better grounding for LLMs than current search grounding... even if the LLM doesn't hallucinate it can cite junk on the web - Better pre-training data... I actually don't understand how LLMs will themselves filter out their own slop from future pre-training runs
I'm not sure what form this should take if it doesn't exist yet. Maybe a github project or wiki curating links per domain (yahoo directory reinvented?), each of us curating our own bookmarks and sharing it (delicious reinvented?), something else?
k310•3h ago
Perhaps a "Hackerpedia" of archived articles, not just Algolia search, but organized in some {use your imagination} way.
TBH,when I search, I choose Wikipedia first, then as close to the originator as possible.
Everyone's search results seem heavily weighted by "Shit for sale". And just as an aside, ebay seems to offer every search term for sale, including Richard Feynmann and a slave ship.
zhyder•9m ago