frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Fast(er) regular expression engines in Ruby

https://serpapi.com/blog/faster-regular-expression-engines-in-ruby/
60•davidsojevic•1y ago

Comments

yxhuvud•1y ago
Eww, pretending to support utf8 matchers while not supporting them at all was not pretty to see.
gitroom•1y ago
Honestly that part bugs me, fake support is worse than no support imo
kayodelycaon•1y ago
> Another nuance was found in ruby, which cannot scan the haystack with invalid UTF-8 byte sequences.

This is extremely basic ruby: UTF-8 encoded strings must be valid UTF-8. This is not unique to ruby. If I recall correctly, python 3 does the same thing.

    2.7.1 :001 > haystack = "\xfc\xa1\xa1\xa1\xa1\xa1abc"
    2.7.1 :003 > haystack.force_encoding "ASCII-8BIT"
    => "\xFC\xA1\xA1\xA1\xA1\xA1abc" 
    2.7.1 :004 > haystack.scan(/.+/)
    => ["\xFC\xA1\xA1\xA1\xA1\xA1abc"]
This person is a senior engineer on their Team page. All they had to do was google "ArgumentError: invalid byte sequence in UTF-8". Or ask a coworker... the company has Ruby on Rails applications. headdesk
burntsushi•1y ago
The nuance is specifically relevant here because neither of the other two regex engines benchmarked have this requirement. It's doubly relevant because that means running a regex search doesn't require a UTF-8 validation step, and is therefore likely beneficial from a perf perspective, dependening on the workload.
kayodelycaon•1y ago
That’s a good point. I hadn’t considered it because I’ve hit the validation error long before getting to search. It is possible to avoid string operations with careful coding prior to the search.

Edit: After a little testing, the strings can be read from and written to files without triggering validation. Presumably this applies to sockets as well.

DmitryOlshansky•1y ago
I wonder how std.regex of dlang would fare in such test. Sadly due to a tiny bit of D’s GC use it’s hard to provide as a library for other languages. If there is an interest I might take it through the tests.

Dropbox CEO Drew Houston to step down

https://www.cnbc.com/2026/05/26/dropbox-ceo-drew-houston-ashraf-alkarmi.html
50•aghuang•4h ago•7 comments

Is "colorectal cancer" rising in "young people"?

https://dynomight.net/crc-rates/
51•surprisetalk•2h ago•26 comments

The Ballad of TIGIT

https://www.owlposting.com/p/the-ballad-of-tigit
39•crescit_eundo•2h ago•2 comments

Launch HN: Minicor (YC P26) – Windows desktop automations at scale

https://www.minicor.com/
41•fchishtie•3h ago•28 comments

What color is your function? (2015)

https://journal.stuffwithstuff.com/2015/02/01/what-color-is-your-function/
36•tosh•2h ago•23 comments

Language Models Need Sleep

https://arxiv.org/abs/2605.26099
109•juxtapose•2h ago•81 comments

Spain blocks prediction markets Polymarket, Kalshi over lack of gambling licence

https://www.reuters.com/business/spain-blocks-prediction-markets-polymarket-kalshi-over-lack-gamb...
362•thm•4h ago•179 comments

Sage Care (YC S24) Is Hiring Software Engineers

https://www.ycombinator.com/companies/sagecare/jobs/xtloH8r-senior-software-engineer
1•ian-gillis•1h ago

Using AI to write better code more slowly

https://nolanlawson.com/2026/05/25/using-ai-to-write-better-code-more-slowly/
1028•signa11•18h ago•390 comments

C64 Basic: Game Map Overhead "Camera View"

https://retrogamecoders.com/overhead-camera-view/
47•ibobev•4h ago•5 comments

A few interesting modern pixel fonts

https://unsung.aresluna.org/a-few-interesting-modern-pixel-fonts/
12•zdw•21h ago•3 comments

Outsourcing plus local AI will soon become more economical vs. frontier labs

https://www.signalbloom.ai/posts/outsourcing-plus-localai-will-soon-become-more-economical-vs-fro...
136•GodelNumbering•5h ago•146 comments

The real cost of owning a home

https://ericturner.dev/posts/cost-of-home-ownership/
55•ggcr•2h ago•120 comments

Opaque Types in Python

https://blog.glyph.im/2026/05/opaque-types-in-python.html
77•lumpa•3d ago•29 comments

Netherlands blocks US takeover of vital digital supplier

https://www.politico.eu/article/netherlands-blocks-us-takeover-vital-digital-supplier/
368•vrganj•6h ago•127 comments

Rosalind: A genomics toolkit in Rust running whole-genome pipelines on a laptop

https://github.com/logannye/rosalind
9•samuell•5d ago•0 comments

DynIP – Dynamic DNS with RFC 2136, IPv6, DNSSEC, and BYOD

https://dynip.dev/
261•dynip•10h ago•108 comments

Uber, Lyft drivers in Massachusetts form first US ride-share union

https://www.reuters.com/business/world-at-work/uber-lyft-drivers-massachusetts-form-first-us-ride...
162•onemoresoop•2h ago•74 comments

Taking a walk may lead to more creativity than sitting, study finds (2014)

https://www.apa.org/news/press/releases/2014/04/creativity-walk
517•bilsbie•19h ago•206 comments

Performance of Rust Language [pdf]

https://github.com/yugr/rust-slides/
92•tanelpoder•18h ago•69 comments

Are we self-sovereign PKI yet?

https://buffrr.dev/blog/are-we-self-sovereign-pki-yet/
31•ca98am79•4d ago•9 comments

Phantasy Star IV – 1993 Developer Interviews

https://shmuplations.com/phantasystariv/
113•speckx•4d ago•42 comments

Don't put aria-label on generic elements like divs

https://www.matuzo.at/blog/2026/aria-label-generic-elements
74•cyanbane•4d ago•54 comments

Use boring languages with LLMs

https://jry.io/writing/use-boring-languages-with-llms/
68•evakhoury•4d ago•64 comments

The user is visibly frustrated

https://pscanf.com/s/354/
222•croes•13h ago•199 comments

How Shamir's Secret Sharing Works

https://ente.com/blog/how-shamirs-secret-sharing-works/
337•subract•19h ago•59 comments

Incident with Actions and Pages

https://www.githubstatus.com/incidents/gnftqj9htp0g
71•hakube•6h ago•20 comments

How do you build a semiconductor company on something that's free?

https://www.siliconimist.com/p/the-open-source-silicon-business
57•johncole•4d ago•19 comments

Eagle 3.1: Collaboration Between the EAGLE Team, vLLM Team, and TorchSpec Team

https://vllm.ai/blog/2026-05-26-eagle-3-1
60•berlianta•6h ago•20 comments

Don't Subscribe So Casually

https://thebestworstcase.substack.com/p/dont-subscribe-so-casually
77•shmublu•3h ago•63 comments