frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Fast(er) regular expression engines in Ruby

https://serpapi.com/blog/faster-regular-expression-engines-in-ruby/
60•davidsojevic•8mo ago

Comments

yxhuvud•8mo ago
Eww, pretending to support utf8 matchers while not supporting them at all was not pretty to see.
gitroom•8mo ago
Honestly that part bugs me, fake support is worse than no support imo
kayodelycaon•8mo ago
> Another nuance was found in ruby, which cannot scan the haystack with invalid UTF-8 byte sequences.

This is extremely basic ruby: UTF-8 encoded strings must be valid UTF-8. This is not unique to ruby. If I recall correctly, python 3 does the same thing.

    2.7.1 :001 > haystack = "\xfc\xa1\xa1\xa1\xa1\xa1abc"
    2.7.1 :003 > haystack.force_encoding "ASCII-8BIT"
    => "\xFC\xA1\xA1\xA1\xA1\xA1abc" 
    2.7.1 :004 > haystack.scan(/.+/)
    => ["\xFC\xA1\xA1\xA1\xA1\xA1abc"]
This person is a senior engineer on their Team page. All they had to do was google "ArgumentError: invalid byte sequence in UTF-8". Or ask a coworker... the company has Ruby on Rails applications. headdesk
burntsushi•8mo ago
The nuance is specifically relevant here because neither of the other two regex engines benchmarked have this requirement. It's doubly relevant because that means running a regex search doesn't require a UTF-8 validation step, and is therefore likely beneficial from a perf perspective, dependening on the workload.
kayodelycaon•8mo ago
That’s a good point. I hadn’t considered it because I’ve hit the validation error long before getting to search. It is possible to avoid string operations with careful coding prior to the search.

Edit: After a little testing, the strings can be read from and written to files without triggering validation. Presumably this applies to sockets as well.

DmitryOlshansky•8mo ago
I wonder how std.regex of dlang would fare in such test. Sadly due to a tiny bit of D’s GC use it’s hard to provide as a library for other languages. If there is an interest I might take it through the tests.

If you put Apple icons in reverse it looks like someone getting good at design

https://www.threads.com/@heliographe.studio/post/DTeOwAykwQ1
125•lateforwork•2h ago•53 comments

A programming language based on grammatical cases of Turkish

https://github.com/kip-dili/kip
118•nhatcher•5h ago•40 comments

ASCII characters are not pixels: a deep dive into ASCII rendering

https://alexharri.com/blog/ascii-rendering
826•alexharri•14h ago•102 comments

Xous Operating System

https://xous.dev/
70•eustoria•3d ago•21 comments

We put Claude Code in Rollercoaster Tycoon

https://labs.ramp.com/rct
360•iamwil•5d ago•207 comments

Show HN: ChunkHound, a local-first tool for understanding large codebases

https://github.com/chunkhound/chunkhound
46•NadavBenItzhak•4h ago•5 comments

The recurring dream of replacing developers

https://www.caimito.net/en/blog/2025/12/07/the-recurring-dream-of-replacing-developers.html
280•glimshe•11h ago•232 comments

Light Mode InFFFFFFlation

https://willhbr.net/2025/10/20/light-mode-infffffflation/
141•Fudgel•3h ago•101 comments

The Olivetti Company

https://www.abortretry.fail/p/the-olivetti-company
139•rbanffy•6d ago•24 comments

Raising money fucked me up

https://blog.yakkomajuri.com/blog/raising-money-fucked-me-up
117•yakkomajuri•7h ago•38 comments

Below the Surface: Archeological Finds from the Amsterdam Noord/Zuid Metro Line

https://belowthesurface.amsterdam/en/vondsten
60•stefanvdw1•6d ago•8 comments

An Elizabethan mansion's secrets for staying warm

https://www.bbc.com/future/article/20260116-an-elizabethan-mansions-secrets-for-staying-warm
112•Tachyooon•9h ago•131 comments

M8SBC-486 (Homebrew 486 computer)

https://maniek86.xyz/projects/m8sbc_486.php
89•rasz•6d ago•8 comments

The thing that brought me joy

https://www.stephenlewis.me/blog/the-thing-that-brought-me-joy/
61•monooso•7h ago•24 comments

MIT's Computer Systems Security (2024)

https://css.csail.mit.edu/6.858/2024/
4•barishnamazov•1h ago•1 comments

Common misunderstandings about large software companies

https://philipotoole.com/common-misunderstandings-about-large-software-companies/
66•otoolep•5d ago•35 comments

Counterfactual evaluation for recommendation systems

https://eugeneyan.com/writing/counterfactual-evaluation/
61•kurinikku•20h ago•4 comments

IRISC: An ARMv7 assembly interpreter and computer architecture simulator

https://polysoftit.co.uk/irisc-web/
9•rtybanana•1h ago•1 comments

There's no single best way to store information

https://www.quantamagazine.org/why-theres-no-single-best-way-to-store-information-20260116/
75•7777777phil•9h ago•43 comments

A New Era for FIRST LEGO League: Inspiring the Next Generation of Learners

https://community.firstinspires.org/new-era-first-lego-league-future-edition
3•jchin•5d ago•1 comments

How do I stop participating?

https://geohot.github.io//blog/jekyll/update/2026/01/18/how-do-i-stop.html
22•JuanJohnJames•1h ago•4 comments

What are Tithe Maps (2021)

https://mapreading.co.uk/what-are-tithe-maps/
13•thomasjb•5d ago•1 comments

Map To Poster – Create Art of your favourite city

https://github.com/originalankur/maptoposter
223•originalankur•15h ago•58 comments

ClickHouse acquires Langfuse

https://langfuse.com/blog/joining-clickhouse
195•tin7in•16h ago•91 comments

The Dilbert Afterlife

https://www.astralcodexten.com/p/the-dilbert-afterlife
450•rendall•1d ago•289 comments

OpenAI could reportedly run out of cash by mid-2027

https://www.tomshardware.com/tech-industry/big-tech/openai-could-reportedly-run-out-of-cash-by-mi...
62•thenaturalist•2h ago•47 comments

Show HN: Streaming gigabyte medical images from S3 without downloading them

https://github.com/PABannier/WSIStreamer
134•el_pa_b•17h ago•44 comments

6-Day and IP Address Certificates Are Generally Available

https://letsencrypt.org/2026/01/15/6day-and-ip-general-availability
488•jaas•1d ago•271 comments

Reducing Dependabot Noise

https://nesbitt.io/2026/01/10/16-best-practices-for-reducing-dependabot-noise.html
57•zdw•5d ago•37 comments

Docker.how – Docker command cheat sheet

https://docker.how/
43•anagogistis•5h ago•12 comments