frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Fast(er) regular expression engines in Ruby

https://serpapi.com/blog/faster-regular-expression-engines-in-ruby/
60•davidsojevic•9mo ago

Comments

yxhuvud•9mo ago
Eww, pretending to support utf8 matchers while not supporting them at all was not pretty to see.
gitroom•9mo ago
Honestly that part bugs me, fake support is worse than no support imo
kayodelycaon•9mo ago
> Another nuance was found in ruby, which cannot scan the haystack with invalid UTF-8 byte sequences.

This is extremely basic ruby: UTF-8 encoded strings must be valid UTF-8. This is not unique to ruby. If I recall correctly, python 3 does the same thing.

    2.7.1 :001 > haystack = "\xfc\xa1\xa1\xa1\xa1\xa1abc"
    2.7.1 :003 > haystack.force_encoding "ASCII-8BIT"
    => "\xFC\xA1\xA1\xA1\xA1\xA1abc" 
    2.7.1 :004 > haystack.scan(/.+/)
    => ["\xFC\xA1\xA1\xA1\xA1\xA1abc"]
This person is a senior engineer on their Team page. All they had to do was google "ArgumentError: invalid byte sequence in UTF-8". Or ask a coworker... the company has Ruby on Rails applications. headdesk
burntsushi•9mo ago
The nuance is specifically relevant here because neither of the other two regex engines benchmarked have this requirement. It's doubly relevant because that means running a regex search doesn't require a UTF-8 validation step, and is therefore likely beneficial from a perf perspective, dependening on the workload.
kayodelycaon•9mo ago
That’s a good point. I hadn’t considered it because I’ve hit the validation error long before getting to search. It is possible to avoid string operations with careful coding prior to the search.

Edit: After a little testing, the strings can be read from and written to files without triggering validation. Presumably this applies to sockets as well.

DmitryOlshansky•9mo ago
I wonder how std.regex of dlang would fare in such test. Sadly due to a tiny bit of D’s GC use it’s hard to provide as a library for other languages. If there is an interest I might take it through the tests.

Welcoming Discord users amidst the challenge of Age Verification

https://matrix.org/blog/2026/02/welcome-discord/
70•foresto•45m ago•28 comments

GPT‑5.3‑Codex‑Spark

https://openai.com/index/introducing-gpt-5-3-codex-spark/
374•meetpateltech•3h ago•171 comments

Gemini 3 Deep Think

https://blog.google/innovation-and-ai/models-and-research/gemini-models/gemini-3-deep-think/
443•tosh•4h ago•266 comments

An AI agent published a hit piece on me

https://theshamblog.com/an-ai-agent-published-a-hit-piece-on-me/
1093•scottshambaugh•5h ago•500 comments

Polis: Open-source platform for large-scale civic deliberation

https://pol.is/home2
93•mefengl•3h ago•20 comments

Major European payment processor can't send email to Google Workspace users

https://atha.io/blog/2026-02-12-viva
379•thatha7777•7h ago•239 comments

What's the difference between a "disc" and a "disk"? (2023)

https://support.apple.com/en-gb/100749
26•IndySun•1h ago•31 comments

Rari – Rust-powered React framework

https://rari.build/
54•bvanvugt•2h ago•32 comments

Launch HN: Omnara (YC S25) – Run Claude Code and Codex from anywhere

71•kmansm27•4h ago•102 comments

Improving 15 LLMs at Coding in One Afternoon. Only the Harness Changed

http://blog.can.ac/2026/02/12/the-harness-problem/
465•kachapopopow•8h ago•199 comments

Anthropic raises $30B in Series G funding at $380B post-money valuation

https://www.anthropic.com/news/anthropic-raises-30-billion-series-g-funding-380-billion-post-mone...
154•ryanhn•2h ago•160 comments

Fixing retail with land value capture

https://worksinprogress.co/issue/fixing-retail-with-land-value-capture/
15•marojejian•58m ago•7 comments

How to Have a Bad Career – David Patterson (2016) [video]

https://www.youtube.com/watch?v=Rn1w4MRHIhc
18•rombr•2h ago•0 comments

A brief history of barbed wire fence telephone networks (2024)

https://loriemerson.net/2024/08/31/a-brief-history-of-barbed-wire-fence-telephone-networks/
105•keepamovin•6h ago•22 comments

I was insulted today – AI style

https://forkingmad.blog/insulted-today-ai/
34•speckx•1h ago•25 comments

Beginning fully autonomous operations with the 6th-generation Waymo driver

https://waymo.com/blog/2026/02/ro-on-6th-gen-waymo-driver
85•ra7•5h ago•55 comments

Apache Arrow is 10 years old

https://arrow.apache.org/blog/2026/02/12/arrow-anniversary/
152•tosh•8h ago•37 comments

Shut Up: Comment Blocker

https://rickyromero.com/shutup/
64•mefengl•4h ago•25 comments

ICE, CBP Knew Facial Recognition App Couldn't Do What DHS Says It Could

https://www.techdirt.com/2026/02/12/ice-cbp-knew-facial-recognition-app-couldnt-do-what-dhs-says-...
27•cdrnsf•49m ago•1 comments

Culture Is the Mass-Synchronization of Framings

https://aethermug.com/posts/culture-is-the-mass-synchronization-of-framings
106•mrcgnc•7h ago•55 comments

Show HN: Generate Web Interfaces from Data

https://github.com/puffinsoft/syntux
16•Goose78•1h ago•4 comments

The "Crown of Nobles" Noble Gas Tube Display (2024)

https://theshamblog.com/the-crown-of-nobles-noble-gas-tube-display/
115•Ivoah•9h ago•25 comments

Partial 8-Piece Tablebase

https://lichess.org/@/Lichess/blog/op1-partial-8-piece-tablebase-available/1ptPBDpC
7•qsort•3d ago•0 comments

The Future for Tyr, a Rust GPU Driver for Arm Mali Hardware

https://lwn.net/Articles/1055590/
99•todsacerdoti•7h ago•23 comments

ai;dr

https://www.0xsid.com/blog/aidr
435•ssiddharth•4h ago•185 comments

How to make a living as an artist

https://essays.fnnch.com/make-a-living
226•gwintrob•17h ago•119 comments

The Science of the Perfect Second (2023)

https://harpers.org/archive/2023/04/the-science-of-the-perfect-second/
10•NaOH•5d ago•1 comments

Show HN: Geo Racers – Race from London to Tokyo on a single bus pass

https://geo-racers.com/
61•pattle•11h ago•55 comments

Sex Workers Are Built Different (Graph Dump)

https://aella.substack.com/p/sex-workers-are-built-different-graph
7•surprisetalk•1h ago•1 comments

Run Pebble OS in Browser via WASM

https://ericmigi.github.io/pebble-qemu-wasm/
102•goranmoomin•8h ago•14 comments