frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Fast(er) regular expression engines in Ruby

https://serpapi.com/blog/faster-regular-expression-engines-in-ruby/
60•davidsojevic•9mo ago

Comments

yxhuvud•9mo ago
Eww, pretending to support utf8 matchers while not supporting them at all was not pretty to see.
gitroom•9mo ago
Honestly that part bugs me, fake support is worse than no support imo
kayodelycaon•9mo ago
> Another nuance was found in ruby, which cannot scan the haystack with invalid UTF-8 byte sequences.

This is extremely basic ruby: UTF-8 encoded strings must be valid UTF-8. This is not unique to ruby. If I recall correctly, python 3 does the same thing.

    2.7.1 :001 > haystack = "\xfc\xa1\xa1\xa1\xa1\xa1abc"
    2.7.1 :003 > haystack.force_encoding "ASCII-8BIT"
    => "\xFC\xA1\xA1\xA1\xA1\xA1abc" 
    2.7.1 :004 > haystack.scan(/.+/)
    => ["\xFC\xA1\xA1\xA1\xA1\xA1abc"]
This person is a senior engineer on their Team page. All they had to do was google "ArgumentError: invalid byte sequence in UTF-8". Or ask a coworker... the company has Ruby on Rails applications. headdesk
burntsushi•9mo ago
The nuance is specifically relevant here because neither of the other two regex engines benchmarked have this requirement. It's doubly relevant because that means running a regex search doesn't require a UTF-8 validation step, and is therefore likely beneficial from a perf perspective, dependening on the workload.
kayodelycaon•9mo ago
That’s a good point. I hadn’t considered it because I’ve hit the validation error long before getting to search. It is possible to avoid string operations with careful coding prior to the search.

Edit: After a little testing, the strings can be read from and written to files without triggering validation. Presumably this applies to sockets as well.

DmitryOlshansky•9mo ago
I wonder how std.regex of dlang would fare in such test. Sadly due to a tiny bit of D’s GC use it’s hard to provide as a library for other languages. If there is an interest I might take it through the tests.

Zed editor switching graphics lib from blade to wgpu

https://github.com/zed-industries/zed/pull/46758
155•jpeeler•2h ago•92 comments

Monosketch

https://monosketch.io/
301•penguin_booze•4h ago•56 comments

Open Source Is Not About You (2018)

https://gist.github.com/richhickey/1563cddea1002958f96e7ba9519972d9
71•doubleg•1h ago•27 comments

Green’s Dictionary of Slang - Five hundred years of the vulgar tongue

https://greensdictofslang.com/
38•mxfh•5d ago•9 comments

Faster Than Dijkstra?

https://systemsapproach.org/2026/02/09/faster-than-dijkstra/
30•drbruced•3d ago•9 comments

Resizing windows on macOS Tahoe – the saga continues

https://noheger.at/blog/2026/02/12/resizing-windows-on-macos-tahoe-the-saga-continues/
738•erickhill•16h ago•373 comments

MinIO repository is no longer maintained

https://github.com/minio/minio/commit/7aac2a2c5b7c882e68c1ce017d8256be2feea27f
368•psvmcc•8h ago•242 comments

Apple, fix my keyboard before the timer ends or I'm leaving iPhone

https://ios-countdown.win/
188•ozzyphantom•2h ago•143 comments

An open replacement for the IBM 3174 Establishment Controller

https://github.com/lowobservable/oec
7•bri3d•5d ago•1 comments

Implementing Auto Tiling with Just 5 Tiles

https://www.kyledunbar.dev/2026/02/05/Implementing-auto-tiling-with-just-5-tiles.html
42•todsacerdoti•5d ago•7 comments

GPT‑5.3‑Codex‑Spark

https://openai.com/index/introducing-gpt-5-3-codex-spark/
821•meetpateltech•22h ago•359 comments

Gemini 3 Deep Think

https://blog.google/innovation-and-ai/models-and-research/gemini-models/gemini-3-deep-think/
965•tosh•23h ago•639 comments

Gauntlet AI (YC S17) train you to master building with AI, give you $200k+ job

http://qualify.gauntletAI.com
1•austenallred•3h ago

MMAcevedo aka Lena by qntm

https://qntm.org/mmacevedo
205•stickynotememo•11h ago•123 comments

Cache Monet

https://cachemonet.com
87•keepamovin•5d ago•23 comments

Tell HN: Ralph Giles has died (Xiph.org| Rust@Mozilla | Ghostscript)

395•ffworld•17h ago•19 comments

Advanced Aerial Robotics Made Simple

https://www.drehmflight.com
55•jacquesm•5d ago•6 comments

An AI agent published a hit piece on me

https://theshamblog.com/an-ai-agent-published-a-hit-piece-on-me/
2088•scottshambaugh•1d ago•851 comments

Particle Lenia

https://znah.net/lenia/
45•memalign•4d ago•1 comments

We interfaced single-threaded C++ with multi-threaded Rust

https://antithesis.com/blog/2026/rust_cpp/
77•lukastyrychtr•6d ago•7 comments

CSS-Doodle

https://css-doodle.com/
67•dsego•8h ago•4 comments

AWS Adds support for nested virtualization

https://github.com/aws/aws-sdk-go-v2/commit/3dca5e45d5ad05460b93410087833cbaa624754e
260•sitole•16h ago•100 comments

Polis: Open-source platform for large-scale civic deliberation

https://pol.is/home2
300•mefengl•22h ago•112 comments

Improving 15 LLMs at Coding in One Afternoon. Only the Harness Changed

http://blog.can.ac/2026/02/12/the-harness-problem/
744•kachapopopow•1d ago•267 comments

Apocalypse no: how almost everything we thought we knew about the Maya is wrong

https://www.theguardian.com/news/2026/feb/12/apocalypse-no-how-almost-everything-we-thought-we-kn...
20•speckx•1h ago•10 comments

What Drives Stock Market Returns?

https://outlookzen.com/2018/10/27/where-do-stock-market-returns-come-from/
12•whack•50m ago•6 comments

Beginning fully autonomous operations with the 6th-generation Waymo driver

https://waymo.com/blog/2026/02/ro-on-6th-gen-waymo-driver
255•ra7•1d ago•320 comments

Major European payment processor can't send email to Google Workspace users

https://atha.io/blog/2026-02-12-viva
576•thatha7777•1d ago•391 comments

Ruby Newbie Is Joining the Ruby Users Forum

https://www.rubyforum.org/tag/getting-started
60•jvrc•4d ago•14 comments

Japan's Dododo Land, the most irritating place on Earth

https://soranews24.com/2026/02/07/take-a-trip-to-japans-dododo-land-the-most-irritating-place-on-...
114•zdw•6d ago•35 comments