frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Fast(er) regular expression engines in Ruby

https://serpapi.com/blog/faster-regular-expression-engines-in-ruby/
60•davidsojevic•8mo ago

Comments

yxhuvud•8mo ago
Eww, pretending to support utf8 matchers while not supporting them at all was not pretty to see.
gitroom•8mo ago
Honestly that part bugs me, fake support is worse than no support imo
kayodelycaon•8mo ago
> Another nuance was found in ruby, which cannot scan the haystack with invalid UTF-8 byte sequences.

This is extremely basic ruby: UTF-8 encoded strings must be valid UTF-8. This is not unique to ruby. If I recall correctly, python 3 does the same thing.

    2.7.1 :001 > haystack = "\xfc\xa1\xa1\xa1\xa1\xa1abc"
    2.7.1 :003 > haystack.force_encoding "ASCII-8BIT"
    => "\xFC\xA1\xA1\xA1\xA1\xA1abc" 
    2.7.1 :004 > haystack.scan(/.+/)
    => ["\xFC\xA1\xA1\xA1\xA1\xA1abc"]
This person is a senior engineer on their Team page. All they had to do was google "ArgumentError: invalid byte sequence in UTF-8". Or ask a coworker... the company has Ruby on Rails applications. headdesk
burntsushi•8mo ago
The nuance is specifically relevant here because neither of the other two regex engines benchmarked have this requirement. It's doubly relevant because that means running a regex search doesn't require a UTF-8 validation step, and is therefore likely beneficial from a perf perspective, dependening on the workload.
kayodelycaon•8mo ago
That’s a good point. I hadn’t considered it because I’ve hit the validation error long before getting to search. It is possible to avoid string operations with careful coding prior to the search.

Edit: After a little testing, the strings can be read from and written to files without triggering validation. Presumably this applies to sockets as well.

DmitryOlshansky•8mo ago
I wonder how std.regex of dlang would fare in such test. Sadly due to a tiny bit of D’s GC use it’s hard to provide as a library for other languages. If there is an interest I might take it through the tests.

East Germany balloon escape

https://en.wikipedia.org/wiki/East_Germany_balloon_escape
418•robertvc•13h ago•144 comments

Cloudflare acquires Astro

https://astro.build/blog/joining-cloudflare/
778•todotask2•16h ago•348 comments

FLUX.2 [Klein]: Towards Interactive Visual Intelligence

https://bfl.ai/blog/flux2-klein-towards-interactive-visual-intelligence
90•GaggiX•6h ago•33 comments

High-Level Is the Goal

https://bvisness.me/high-level/
74•tobr•1d ago•24 comments

Beebo, a wave simulator written in C

https://git.sr.ht/~willowf/beebo/
19•anon25783•3d ago•0 comments

Cursor's latest “browser experiment” implied success without evidence

https://embedding-shapes.github.io/cursor-implied-success-without-evidence/
485•embedding-shape•15h ago•201 comments

6-Day and IP Address Certificates Are Generally Available

https://letsencrypt.org/2026/01/15/6day-and-ip-general-availability
379•jaas•14h ago•222 comments

Releasing rainbow tables to accelerate Net-NTLMv1 protocol deprecation

https://cloud.google.com/blog/topics/threat-intelligence/net-ntlmv1-deprecation-rainbow-tables
105•linolevan•8h ago•61 comments

LLM Structured Outputs Handbook

https://nanonets.com/cookbooks/structured-llm-outputs
186•vitaelabitur•1d ago•33 comments

Drone Hacking Part 1: Dumping Firmware and Bruteforcing ECC

https://neodyme.io/en/blog/drone_hacking_part_1/
20•tripdout•3h ago•0 comments

IKEA for Software

https://tommaso-girotto.co/blog/an-ikea-for-software
38•tgirotto•4d ago•14 comments

Which is "Bouba", and which is "Kiki"? [video]

https://www.youtube.com/watch?v=1TDIAObsqcs
7•basilikum•6d ago•8 comments

Dell UltraSharp 52 Thunderbolt Hub Monitor

https://www.dell.com/en-us/shop/dell-ultrasharp-52-thunderbolt-hub-monitor-u5226kw/apd/210-bthw/m...
188•cebert•13h ago•242 comments

Lock-Picking Robot

https://github.com/etinaude/Lock-Picking-Robot
291•p44v9n•4d ago•127 comments

Install.md: A standard for LLM-executable installation

https://www.mintlify.com/blog/install-md-standard-for-llm-executable-installation
54•npmipg•8h ago•70 comments

Why DuckDB is my first choice for data processing

https://www.robinlinacre.com/recommend_duckdb/
255•tosh•19h ago•94 comments

STFU

https://github.com/Pankajtanwarbanna/stfu
755•tanelpoder•12h ago•483 comments

Experts Warn of Growing Parrot Crisis in Canada

https://www.ctvnews.ca/ottawa/video/2026/01/06/experts-warn-of-growing-parrot-crisis-in-canada/
38•debo_•4d ago•9 comments

Keifu – A TUI for navigating commit graphs with color and clarity

https://github.com/trasta298/keifu
26•indigodaddy•5h ago•5 comments

Reading across books with Claude Code

https://pieterma.es/syntopic-reading-claude/
86•gmays•11h ago•22 comments

Patching the Wii News Channel to serve local news (2025)

https://raulnegron.me/2025/wii-news-pr/
79•todsacerdoti•17h ago•19 comments

Elasticsearch was never a database

https://www.paradedb.com/blog/elasticsearch-was-never-a-database
121•jamesgresql•5d ago•84 comments

Emoji Use in the Electronic Health Record is Increasing

https://jamanetwork.com/journals/jamanetworkopen/fullarticle/2843883
68•giuliomagnifico•12h ago•66 comments

HTTP RateLimit Headers

https://dotat.at/@/2026-01-13-http-ratelimit.html
48•zdw•2d ago•13 comments

Local-only Marstek Venus e-battery integration with Home Assistant

https://du.nkel.dev/blog/2026-01-11_marstek-battery-homeassistant/
10•Helmut10001•23h ago•0 comments

Michelangelo's first painting, created when he was 12 or 13

https://www.openculture.com/2026/01/discover-michelangelos-first-painting.html
337•bookofjoe•16h ago•164 comments

Launch HN: Indy (YC S21) – A support app designed for ADHD brains

https://www.shimmer.care/indy-redirect
71•christalwang•14h ago•78 comments

The five orders of ignorance (2000)

https://cacm.acm.org/opinion/the-five-orders-of-ignorance/
43•svilen_dobrev•4d ago•14 comments

Dev-owned testing: Why it fails in practice and succeeds in theory

https://dl.acm.org/doi/10.1145/3780063.3780066
124•rbanffy•16h ago•154 comments

Slop is everywhere for those with eyes to see

https://www.fromjason.xyz/p/notebook/slop-is-everywhere-for-those-with-eyes-to-see/
238•speckx•10h ago•112 comments