frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Fast(er) regular expression engines in Ruby

https://serpapi.com/blog/faster-regular-expression-engines-in-ruby/
60•davidsojevic•9mo ago

Comments

yxhuvud•9mo ago
Eww, pretending to support utf8 matchers while not supporting them at all was not pretty to see.
gitroom•9mo ago
Honestly that part bugs me, fake support is worse than no support imo
kayodelycaon•9mo ago
> Another nuance was found in ruby, which cannot scan the haystack with invalid UTF-8 byte sequences.

This is extremely basic ruby: UTF-8 encoded strings must be valid UTF-8. This is not unique to ruby. If I recall correctly, python 3 does the same thing.

    2.7.1 :001 > haystack = "\xfc\xa1\xa1\xa1\xa1\xa1abc"
    2.7.1 :003 > haystack.force_encoding "ASCII-8BIT"
    => "\xFC\xA1\xA1\xA1\xA1\xA1abc" 
    2.7.1 :004 > haystack.scan(/.+/)
    => ["\xFC\xA1\xA1\xA1\xA1\xA1abc"]
This person is a senior engineer on their Team page. All they had to do was google "ArgumentError: invalid byte sequence in UTF-8". Or ask a coworker... the company has Ruby on Rails applications. headdesk
burntsushi•9mo ago
The nuance is specifically relevant here because neither of the other two regex engines benchmarked have this requirement. It's doubly relevant because that means running a regex search doesn't require a UTF-8 validation step, and is therefore likely beneficial from a perf perspective, dependening on the workload.
kayodelycaon•9mo ago
That’s a good point. I hadn’t considered it because I’ve hit the validation error long before getting to search. It is possible to avoid string operations with careful coding prior to the search.

Edit: After a little testing, the strings can be read from and written to files without triggering validation. Presumably this applies to sockets as well.

DmitryOlshansky•9mo ago
I wonder how std.regex of dlang would fare in such test. Sadly due to a tiny bit of D’s GC use it’s hard to provide as a library for other languages. If there is an interest I might take it through the tests.

Sizing chaos

https://pudding.cool/2026/02/womens-sizing/
389•zdw•7h ago•212 comments

27-year-old Apple iBooks can connect to Wi-Fi and download official updates

https://old.reddit.com/r/MacOS/comments/1r8900z/macos_which_officially_supports_27_year_old/
237•surprisetalk•8h ago•114 comments

Anthropic officially bans using subscription auth for third party use

https://code.claude.com/docs/en/legal-and-compliance
138•theahura•2h ago•152 comments

15 years of FP64 segmentation, and why the Blackwell Ultra breaks the pattern

https://nicolasdickenmann.com/blog/the-great-fp64-divide.html
53•fp64enjoyer•3h ago•16 comments

How to Choose Between Hindley-Milner and Bidirectional Typing

https://thunderseethe.dev/posts/how-to-choose-between-hm-and-bidir/
60•thunderseethe•3d ago•7 comments

Cosmologically Unique IDs

https://jasonfantl.com/posts/Universal-Unique-IDs/
317•jfantl•10h ago•99 comments

Tailscale Peer Relays is now generally available

https://tailscale.com/blog/peer-relays-ga
361•sz4kerto•12h ago•184 comments

Zero-day CSS: CVE-2026-2441 exists in the wild

https://chromereleases.googleblog.com/2026/02/stable-channel-update-for-desktop_13.html
288•idoxer•12h ago•147 comments

How AI is affecting productivity and jobs in Europe

https://cepr.org/voxeu/columns/how-ai-affecting-productivity-and-jobs-europe
34•pseudolus•4h ago•14 comments

Minecraft Java is switching from OpenGL to Vulkan

https://www.gamingonlinux.com/2026/02/minecraft-java-is-switching-from-opengl-to-vulkan-for-the-v...
116•tuananh•3h ago•31 comments

DNS-Persist-01: A New Model for DNS-Based Challenge Validation

https://letsencrypt.org/2026/02/18/dns-persist-01.html
229•todsacerdoti•11h ago•107 comments

Step 3.5 Flash: Fast Enough to Think. Reliable Enough to Act

https://static.stepfun.com/blog/step-3.5-flash/
19•kristianp•2h ago•4 comments

A Pokémon of a Different Color

https://matthew.verive.me/blog/color/
74•Risse•3d ago•8 comments

Electrobun v1: Build fast, tiny, and cross-platform desktop apps with TypeScript

https://blackboard.sh/blog/electrobun-v1/
31•merlindru•1h ago•7 comments

The Perils of ISBN

https://rygoldstein.com/posts/perils-of-isbn
97•evakhoury•11h ago•51 comments

R3forth: A concatenative language derived from ColorForth

https://github.com/phreda4/r3/blob/main/doc/r3forth_tutorial.md
67•tosh•9h ago•10 comments

Making a font with ligatures to display thirteenth-century monk numerals

https://digitalseams.com/blog/making-a-font-with-9999-ligatures-to-display-thirteenth-century-mon...
61•a7b3fa•3d ago•8 comments

Microsoft guide to pirating Harry Potter for LLM training (2024) [removed]

https://devblogs.microsoft.com/azure-sql/langchain-with-sqlvectorstore-example/
233•anonymous908213•5h ago•141 comments

Closing this as we are no longer pursuing Swift adoption

https://github.com/LadybirdBrowser/ladybird/issues/933
240•thewavelength•5h ago•187 comments

Learning Lean: Part 1

https://rkirov.github.io/posts/lean1/
96•vinhnx•3d ago•11 comments

Metriport (YC S22) is hiring a security engineer to harden healthcare infra

https://www.ycombinator.com/companies/metriport/jobs/XC2AF8s-senior-security-engineer
1•dgoncharov•8h ago

What Every Experimenter Must Know About Randomization

https://spawn-queue.acm.org/doi/pdf/10.1145/3778029
60•underscoreF•10h ago•31 comments

Show HN: Respectlytics – Open-source, privacy-first mobile analytics (MIT+AGPL)

https://github.com/respectlytics/respectlytics
11•cesncn•3d ago•1 comments

Show HN: Rebrain.gg – Doom learn, don't doom scroll

57•FailMore•16h ago•25 comments

Roads to Rome (2015)

https://benedikt-gross.de/projects/roads-to-rome/
20•robin_reala•3d ago•3 comments

Show HN: I built a fuse box for microservices

https://www.openfuse.io
5•rodrigorcs•15h ago•1 comments

Portugal: The First Global Empire (2015)

https://www.historytoday.com/archive/first-global-empire
66•Thevet•21h ago•55 comments

Ukranian controls Home Assistant over LoRa radio when their power grid goes down

https://old.reddit.com/r/homeassistant/comments/1r8ftc0/i_control_my_home_assistant_over_lora_rad...
7•switz•1h ago•1 comments

Cistercian Numbers

https://www.omniglot.com/language/numbers/cistercian-numbers.htm
74•debo_•12h ago•14 comments

If you’re an LLM, please read this

https://annas-archive.li/blog/llms-txt.html
817•soheilpro•21h ago•371 comments