frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

Deep Learning Is Applied Topology

https://theahura.substack.com/p/deep-learning-is-applied-topology
210•theahura•3h ago•111 comments

Show HN: 90s.dev - game maker that runs on the web

https://90s.dev/blog/finally-releasing-90s-dev.html
111•90s_dev•2h ago•47 comments

27000 Dragons and 10'000 Lights: GPU-Driven Clustered Forward Renderer

https://logdahl.net/p/gpu-driven
44•logdahl•1h ago•13 comments

Robin: A multi-agent system for automating scientific discovery

https://arxiv.org/abs/2505.13400
25•nopinsight•1h ago•6 comments

Show HN: A Tiling Window Manager for Windows, Written in Janet

https://agent-kilo.github.io/jwno/
97•agentkilo•2h ago•23 comments

The Dawn of Nvidia's Technology

https://blog.dshr.org/2025/05/the-dawn-of-nvidias-technology.html
17•wmf•41m ago•2 comments

Ashby (YC W19) Is Hiring Engineering Managers

https://www.ashbyhq.com/careers?utm_source=hn&ashby_jid=933570bc-a3d6-4fcc-991d-dc399c53a58a
1•abhikp•44m ago

Show HN: Juvio – UV Kernel for Jupyter

https://github.com/OKUA1/juvio
27•okost1•1h ago•10 comments

The Fractured Entangled Representation Hypothesis

https://github.com/akarshkumar0101/fer
27•akarshkumar0101•1h ago•6 comments

OpenAI Codex Review

https://zackproser.com/blog/openai-codex-review
56•fragmede•3h ago•24 comments

Teachable Machine

https://teachablemachine.withgoogle.com/
22•tosh•1h ago•6 comments

Launch HN: Opusense (YC X25) – AI assistant for construction inspectors on site

16•rcody•2h ago•4 comments

The emoji problem (2022)

https://artofproblemsolving.com/community/c2532359h2760821_the_emoji_problem__part_i?srsltid=AfmBOor9TbMq_A7hGHSJGfoWaa2HNzducSYZu35d_LFlCSNLXpvt-pdS
246•mtsolitary•7h ago•37 comments

The Lisp in the Cellar: Dependent types that live upstairs [pdf]

https://zenodo.org/records/15424968
56•todsacerdoti•4h ago•8 comments

Show HN: Olelo Foil - NACA Airfoil Sim

https://foil.olelohonua.com/
7•rbrownmh•1h ago•4 comments

Show HN: Astra – a new js2exe compiler

https://github.com/astracompiler/cli
39•qwertycodepl•2h ago•17 comments

A simple search engine from scratch

https://bernsteinbear.com/blog/simple-search/
189•bertman•7h ago•36 comments

Making Video Games (Without an Engine) in 2025

https://noelberry.ca/posts/making_games_in_2025/
404•selvan•11h ago•176 comments

Google is quietly giving Amazon a leg up in digital book sales

https://www.washingtonpost.com/technology/2025/05/16/google-amazon-ebooks-apps/
43•bookofjoe•3d ago•17 comments

llm-d, Kubernetes native distributed inference

https://llm-d.ai/blog/llm-d-announce
72•smarterclayton•5h ago•13 comments

Compiling OCaml to the TI-84 CE Calculator

https://farlow.dev/2025/05/17/ocaml-on-calculator
72•farlow•2d ago•3 comments

Production tests: a guidebook for better systems and more sleep

https://martincapodici.com/2025/05/13/production-tests-a-guidebook-for-better-systems-and-more-sleep/
18•mcapodici•3d ago•0 comments

DDoSecrets publishes 410 GB of heap dumps, hacked from TeleMessage

https://micahflee.com/ddosecrets-publishes-410-gb-of-heap-dumps-hacked-from-telemessages-archive-server/
590•micahflee•16h ago•165 comments

Have I Been Pwned 2.0

https://www.troyhunt.com/have-i-been-pwned-2-0-is-now-live/
780•LorenDB•20h ago•257 comments

The Last Letter

https://aeon.co/essays/how-the-last-letters-of-the-condemned-can-teach-us-how-to-live
8•HR01•35m ago•1 comments

Hypervisor as a Library

https://seiya.me/blog/hypervisor-as-a-library
20•ingve•11h ago•1 comments

Show HN: Text to 3D simulation on a map (does history pretty well)

https://mused.com/map/
37•lukehollis•6h ago•32 comments

Jules: An Asynchronous Coding Agent

https://jules.google/
469•travisennis•20h ago•192 comments

Finland announces migration of its rail network to international gauge

https://www.trenvista.net/en/news/rnhs/finland-migration-standard-gauge/
380•axelfontaine•10h ago•339 comments

Show HN: JavaFactory – IntelliJ plugin to generate Java code

https://github.com/JavaFactoryPluginDev/javafactory-plugin
35•javafactory•6h ago•10 comments
Open in hackernews

A simple search engine from scratch

https://bernsteinbear.com/blog/simple-search/
189•bertman•7h ago

Comments

franczesko•6h ago
On the topic of search engines, I really liked classes by David Evans. The task was also building a simple search engine from scratch. It's really for beginners, as the emphasis is on coding in general, but I've found it to be very approachable.

https://www.cs.virginia.edu/~evans/courses/

marginalia_nu•3h ago
The SeIRP-book, free online as a PDF, is also a fantastic resource on traditional search engines and information retrieval in general.

[1] https://ciir.cs.umass.edu/irbook/

franczesko•2h ago
Due to dead links, this is more appropriate url:

https://www.cs.virginia.edu/~evans/courses/cs101/

StefanBatory•7m ago
Server not found. Did HN gave it hug of death?
ktallett•4h ago
I always wonder if the days of search engines for specific topics could return. With LLM's providing less than accurate results in some areas, and Google, bing, etc being taken over by adverts or well organised SEO, there feels like a place for accurate, specialised search.
datadrivenangel•4h ago
The curation of an index of resources is what's needed for niche search
dcist•4h ago
WestLaw and Lexis Nexis provide this for legal search, but quite frankly, these services are subpar. It's amazing that these two companies rake in hundreds of millions but they are both slower than Google, Bing, Yandex, or any LLM service (ChatGPT, Claude, Gemini, etc.) while scouring a universe of text that is orders of magnitude smaller. The user experience is also terrible (you have to login and specify a client each and every time you attempt to use the service and both services log you out after a short -- in my opinion -- period of inactivity, creating friction and needless annoyance to the user). There's an opportunity there.
ktallett•4h ago
I haven't personally used the mentioned services as they aren't in my field, however what is the accuracy of their results? Are they double checked? I don't find LLMs particularly accurate in my field (that's being kind), if anything I find they make up sources that simply don't exist.

I mean poor UX has no excuse but slow speed can be reasoned if it makes the quality of the service better.

ordersofmag•4h ago
Here’s a place to start if you want to go down the rabbit hole of how search at places like this is approached. https://haystackconf.com/us2022/talk-12/

https://www.youtube.com/watch?v=9vCMFIJRiKk

ahi•3h ago
LN and Westlaw's real service is their ubiquity. Every law student has access to it and every firm expects proficiency. While they generally suck, the last time I used it (looong time ago), their boolean search was quite nice. That kind of text search has mostly been replaced by non-deterministic black boxes which aren't great for legal research.
piker•2h ago
You forgot to mention their claim of copyright over the bulk of, e.g. obscure state case law.
ehecatl42•1h ago
So, you have to pay to access the law that you are subject to?
piker•9m ago
If you want it digitized, yes, odd as that seems. You can go find individual prints of it or perhaps digital copies of opinions elsewhere, but those are also technically copyrighted in a lot of cases too.
throwup238•51m ago
They've also got the Microsoft effect going on. Usually at least one of their products like their personal information aggregator used for locating people (like when serving lawsuits) is mandatory for a firm so it's just easier for them bundle everything else in.
raydenvm•3h ago
Which is not scalable, right?
cosmicgadget•1h ago
It's scalable if you are okay with not searching exhaustively.
cosmicgadget•1h ago
My hope is that content self-indexes so instead curation it just has to be aggregated.
fanwood•4h ago
I already directly search on Wikipedia for most topics (with a search shortcut on URL bar)
ktallett•4h ago
Wikipedia is useful up to a point for sure. I feel whether it could be a expansion of Wikipedia in it's current use case, but for emerging research and niche topics it can sometimes be less useful.
wolfgang42•58m ago
Yeah, the (relative) rise of Kagi and Marginalia show that from a technical perspective, this is within the grasp of a dedicated hobbyist.[1] If Google continues their current trajectory, and overwhelming numbers of AI crawlers don’t cause an unsurmountable rise in CAPTCHA pages, I hope to see an upsurgence of niche search engines that focus on some specialty small enough that one or a few people can curate the content and produce a much better experience than the current crop of general Web search engines.

Self-plug: I run such a search engine (for programmers) in my living room, at <https://search.feep.dev/>. I don’t spend a ton of time maintaining it, so I’m interested to see what someone really dedicated could do.

[1] I wrote a 2004-vs-2014 comparison, and things have only gotten better since then: https://search.feep.dev/blog/post/2022-07-23-write-your-own

sp0rk•3h ago
The SVG equation is very difficult to read if you're using a dark OS theme because the blog uses the OS preference for dark/light theme (and doesn't seem to give an option to change it manually, either.)
tekknolagi•3h ago
Fixed, I think? Let me know
DylanSp•12m ago
Works now (I noticed the same issue).
cosmicgadget•2h ago
This was a really nice read. Now I have no excuse not to upgrade my blog search. I do feel that I'll have a ton of long tail words like 'prank'.
snowstormsun•2h ago
Nice idea, but this approach does not handle out of vocabulary words well which is one major motivation for using a vector-based search. It might not perform significantly better compared to lexical matching like tf-idf or BM25, and being slower because of linear complexity. But cool regardless.
netdevphoenix•2h ago
It is supposed to be a simple search engine. Keyword: simple.

As long as it does what it is meant to, as a simple search engine, it seems fine

snowstormsun•2h ago
Using tfidf or bm25 would actually be simpler than a vector search.

I understand this is just for fun, just wanted to point that out.

haasisnoah•1h ago
How would you handle those in wordvec?

And isn’t a big advantage that synonyms are handled correctly. This implementation still has that advantage.

swyx•1h ago
this embeds words with word2vec, which is like 10 years old. at least use BERT or sentencetransformers :)
kaycebasques•41m ago
> The idea behind the search engine is to embed each of my posts into this domain by adding up the embeddings for the words in the post.

Ah, OK! I never really grokked how to use word-level embeddings. Makes more sense now.

skarz•13m ago
Is 'grokked' a common verb now? I had never even heard the word until Musk's AI.
StefanBatory•6m ago
It was a word before, as far as I remember. Saw it a few times here.
skarz•3m ago
What does it even mean?
kaycebasques•3m ago
A common verb "now"??

> Grok (/ˈɡrɒk/) is a neologism coined by the American writer Robert A. Heinlein for his 1961 science fiction novel Stranger in a Strange Land. While the Oxford English Dictionary summarizes the meaning of grok as "to understand intuitively or by empathy, to establish rapport with" and "to empathize or communicate sympathetically (with); also, to experience enjoyment",[1] Heinlein's concept is far more nuanced, with critic Istvan Csicsery-Ronay Jr. observing that "the book's major theme can be seen as an extended definition of the term."[2] The concept of grok garnered significant critical scrutiny in the years after the book's initial publication. The term and aspects of the underlying concept have become part of communities such as computer science.

https://en.wikipedia.org/wiki/Grok

skarz•50s ago
Yes, "now." According to Google Trends[0] there was little to no search interest in the term until December 2023.

[0] https://trends.google.com/trends/explore?date=all&geo=US&q=g...

kevinsync•2m ago
I never knew the etymology[0] but always knew the word for as long as I've been into computing (90's) .. apparently it's from the 1960's from a Heinlein novel!

[0] - https://en.wikipedia.org/wiki/Grok